In this course we will analyze the paradigm of automatic object recognition from a Computer Vision and Machine Learning points of view. We will review past and recent challenges in object recognition, such as multi-modal, multi-part, multi-scale, multi-view, multi-class, multi-label, and large scale object recognition, including recent deep learning architectures. We will also review current trends for a particular and complex kind of objects: people in visual data'. We will deal with the problem of human pose recovery and automatic behavior analysis, describing potential applications as well as future lines of research in the field.
Person in charge
Simone Balocco (
Sergio Escalera (
Generic Technical Competences
CG2 - Capability to lead, plan and supervise multidisciplinary teams.
Technical Competences of each Specialization
CEA6 - Capability to understand the basic operation principles of Computational Vision main techniques, and to know how to use in the environment of an intelligent system or service.
CEA14 - Capability to understand the advanced techniques of Vision, Perception and Robotics, and to know how to design, implement and apply these techniques in the development of intelligent applications, services or systems.
CEP6 - Capability to assimilate and integrate the changing economic, social and technological environment to the objectives and procedures of informatic work in intelligent systems.
CEP8 - Capability to respect the surrounding environment and design and develop sustainable intelligent systems.
Appropiate attitude towards work
CT5 - Capability to be motivated for professional development, to meet new challenges and for continuous improvement. Capability to work in situations with lack of information.
CB7 - Ability to integrate knowledges and handle the complexity of making judgments based on information which, being incomplete or limited, includes considerations on social and ethical responsibilities linked to the application of their knowledge and judgments.
Introduction to object and human recognition
Multi-modal object recognition
Multi-part object recognition
Multi-scale object recognition
Multi-view object recognition
Multi-class object recognition
Multi-label object recognition
Multi-ple data: deep-learning for large scale object recognition
Object Recognition in context: scene understanding and grammars
Human Pose Recovery
Human Behavior Analysis
T Each week it will be a 1.5h theoretical topic exposition class.
P Each week it will be a 1h practical session.
The rest of the course are devoted to autonomous lectures, programming, and studying.
The course will follow a continuous evaluation consisting in four practical reports (PR) and two in-class presentations (PS). At the end of the course a test exam will be performed (TS). The final score (FS) will be computed as follows:
FS = 0.5 * PR + 0.3 * PS + 0.2 * TS
A minimum score of 3 over 10 points is required for each part PR, PS, and TS in order to compute the final score FS.
This is an addendum to the OR's course guide and it is valid during the exceptional period that was declared by the "Royal Decree 463/2020, of 14 March 2020, that declares the state of alarm in order to manage the health crisis caused by the COVID-19"
Changes, compared to the version of the OR's Course Guide at the beginning of the semester, regarding contents, activities and teaching methodology:
BEYOND THE ADAPTATION OF THE COURSE TO A VIRTUAL METHODOLOGY, THERE ARE NO SIGNIFICANT CHANGES.
Changes, compared to the version of the OR's Course Guide at the beginning of the semester, regarding the evaluation method:
BEYOND THE ADAPTATION OF THE COURSE TO A VIRTUAL METHODOLOGY, THERE ARE NO CHANGES IN THE EVALUATION METHOD
THE ADAPTATION OF THE COURSE TO A VIRTUAL METHODOLOGY
Where we are
B6 Building Campus Nord
C/Jordi Girona Salgado,1-3
08034 BARCELONA Spain
Tel: (+34) 93 401 70 00