Consulta ofertas de otros estudios y especialidades
L'objectiu del projecte consisteix en classificar un conjunt d'espècies a partir d'imatges reals d'ocells. S'estudiaran diferents solucions basades en característiques locals i/o deep learning.
Els detalls es donaran a l'alumne personalment
State-of-the-art models such as LLMs are too large to fit in a single compute node (GPU, NPU, CPU), both for training and inference on a device (e.g., phone, laptop, tablet) or in larger-scale data centers. There is a need to develop optimization techniques to split and place these models onto a distributed set of compute nodes so that the overall system performance is maximized. The research will be focused on optimizing the placement of AI models onto distributed systems considering training time, energy consumption, and computational resources.
State-of-the-art models such as LLMs are too large to fit in a single compute node (GPU, NPU, CPU), both for training and inference on a device (e.g., phone, laptop, tablet) or in larger-scale data centers. There is a need to develop optimization techniques to split and place these models onto a distributed set of compute nodes so that the overall system performance is maximized. The research will be focused on optimizing the placement of AI models onto distributed systems considering training time, energy consumption, and computational resources.
In this thesis, the student will model and simulate distributed AI workloads using both mathematical frameworks and simulators. This encompasses the modeling of network, compute, and memory components within a distributed architecture. Developing new modules to enhance the modeling process. Evaluating and optimizing various parallelization techniques to improve overall system performance.
The Universitat Politècnica de Catalunya · BarcelonaTech offers Master thesis fellowships in the field of LLM training. The research will be supported by Qualcomm and will be carried out in an environment with a strong interaction with leading experts in the field, with opportunities for doing internships in the company.
More information here: https://www.cs.upc.edu/~jordicf/priv/eda/llm_qc.html
In the proposed project, we are interested in the mobile setting, and propose the use of depth information, on top of the usual RGB (Red, Green, Blue) pixel data acquired by mobile device cameras, to track and quantify visual attention.
Automatic gaze tracking methods have been used in the recent years for various purposes, including measuring ad attention or enabling for gesture-less user-device interactions. Most of these methods have been tested in the desktop setting, while the few reported attempts that addressed the mobile setting report low accuracy and require continuous calibration. Additionally, the aforementioned methods rely on expensive sensors, such as infrared eye trackers, that limit their scaling capacity. Other methods employ touch-based interactions (e.g., tracking zoom/pinch gestures and scroll activity) to produce an estimation of the user gaze which, at best, are weakly correlated with visual attention. However, 42% of the time spent on websites is by mobile users, while similar trends are reported for the percentage of page views per visit, which creates an opportunity for a novel attention measurement technology to take root; one which can offer accurate, reliable, and scalable measurements.
In the proposed project, we are interested in the mobile setting, and propose the use of depth information, on top of the usual RGB (Red, Green, Blue) pixel data acquired by mobile device cameras, to track and quantify visual attention. Depth information can either be estimated with post computer visual processing or acquired directly with high precision, using advanced LiDAR sensors that newer smartphone models are currently equipped with. Furthermore, we propose the use of "lightweight" Deep Learning (DL) techniques to achieve an accurate and robust gaze point estimation without the need of extensive calibration. Unlike prior art, we will consider the use of historical information to inform future predictions and will examine how such models can be tailored to each user by injecting personalized data. In addition, we propose a more lightweight Neural Network (NN) architecture, suitable for the processing power of mobile devices, that also leverages the use of video or photo frames with depth sensory data.
The candidate will:
© Facultat d'Informàtica de Barcelona - Universitat Politècnica de Catalunya - Avíso legal sobre esta web - Configuración de privacidad