Object Recognition

Teachers
Weekly hours
Competences
Objectives
Contents
Activities
Teaching methodology
Evaluation methodology
Bibliography

Credits

Types

Elective

Requirements

This subject has not requirements

Department

UB;CS

In this course we will analyze the paradigm of automatic object recognition from a Computer Vision and Machine Learning points of view. We will review past and recent challenges in object recognition, such as multi-modal, multi-part, multi-scale, multi-view, multi-class, multi-label, and large scale object recognition, including recent deep learning architectures. We will also review current trends for a particular and complex kind of objects: people in visual data'. We will deal with the problem of human pose recovery and automatic behavior analysis, describing potential applications as well as future lines of research in the field.

Teachers

Person in charge

Sergio Escalera Guerrero ( )

Others

Meysam Madadi ( )

Weekly hours

Theory

1.5

Problems

Laboratory

Guided learning

Autonomous learning

Competences

Generic Technical Competences

Generic

CG2 - Capability to lead, plan and supervise multidisciplinary teams.

Technical Competences of each Specialization

Academic

CEA3 - Capability to understand the basic operation principles of Machine Learning main techniques, and to know how to use on the environment of an intelligent system or service.
CEA4 - Capability to understand the basic operation principles of Computational Intelligence main techniques, and to know how to use in the environment of an intelligent system or service.
CEA6 - Capability to understand the basic operation principles of Computational Vision main techniques, and to know how to use in the environment of an intelligent system or service.
CEA8 - Capability to research in new techniques, methodologies, architectures, services or systems in the area of ??Artificial Intelligence.
CEA13 - Capability to understand advanced techniques of Modeling , Reasoning and Problem Solving, and to know how to design, implement and apply these techniques in the development of intelligent applications, services or systems.
CEA14 - Capability to understand the advanced techniques of Vision, Perception and Robotics, and to know how to design, implement and apply these techniques in the development of intelligent applications, services or systems.

Professional

CEP3 - Capacity for applying Artificial Intelligence techniques in technological and industrial environments to improve quality and productivity.
CEP6 - Capability to assimilate and integrate the changing economic, social and technological environment to the objectives and procedures of informatic work in intelligent systems.
CEP8 - Capability to respect the surrounding environment and design and develop sustainable intelligent systems.

Transversal Competences

Appropiate attitude towards work

CT5 - Capability to be motivated for professional development, to meet new challenges and for continuous improvement. Capability to work in situations with lack of information.

Basic

CB7 - Ability to integrate knowledges and handle the complexity of making judgments based on information which, being incomplete or limited, includes considerations on social and ethical responsibilities linked to the application of their knowledge and judgments.

Objectives

Introduction to object and human recognition
Multi-modal object recognition
Multi-part object recognition
Multi-scale object recognition
Multi-view object recognition
Multi-class object recognition
Multi-label object recognition
Multi-ple data: deep-learning for large scale object recognition
Object Recognition in context: scene understanding and grammars
Human Pose Recovery
Human Behavior Analysis
Related competences: CT5, CEA13, CEA14, CEA3, CEA4, CEA6, CB7, CEA8, CEP3, CEP6, CEP8, CG2,

Introduction to object and human recognition
Convolutional neural networks
Recurrent Neural Networks in Vision
Object detection and segmentation
Human pose estimation
Human Behavior
Transformers / self-attention in Vision
Graph Neural Networks in Vision

Activities

Activity Evaluation act

Paper presentation

Objectives: 1
Week: 10

Theory

1.5h

Problems

Laboratory

Guided learning

Autonomous learning

Paper presentation 2

Week: 14

Theory

1.5h

Problems

Laboratory

Guided learning

Autonomous learning

Exam

Week: 15 (Outside class hours)

Theory

Problems

Laboratory

Guided learning

Autonomous learning

30h

Laboratory 1

Week: 2

Theory

Problems

Laboratory

Guided learning

Autonomous learning

Laboratory 2

Week: 5

Theory

Problems

Laboratory

Guided learning

Autonomous learning

Laboratory 3

Week: 8

Theory

Problems

Laboratory

Guided learning

Autonomous learning

Laboratory 4

Week: 12

Theory

Problems

Laboratory

Guided learning

Autonomous learning

Theoretical class

Theory

22.5h

Problems

Laboratory

Guided learning

Autonomous learning

Practical sessions

Theory

Problems

Laboratory

15h

Guided learning

Autonomous learning

Teaching methodology

T Each week it will be a 1.5h theoretical topic exposition class.
P Each week it will be a 1h practical session.
The rest of the course are devoted to autonomous lectures, programming, and studying.

Evaluation methodology

The course will follow a continuous evaluation consisting in four practical reports (PR) and two in-class presentations (PS). At the end of the course a test exam will be performed (TS). The final score (FS) will be computed as follows:
FS = 0.5 * PR + 0.3 * PS + 0.2 * TS
A minimum score of 3 over 10 points is required for each part PR, PS, and TS in order to compute the final score FS.

Bibliography

Basic:

Computer vision: a modern approach - Forsyth, D.A.; Ponce, J, Pearson Education, 2012. ISBN: 0273764144
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003948569706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Computer vision: algorithms and applications - Szeliski, R, Springer, 2022. ISBN: 9783030343712
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991005130575906711&context=L&vid=34CSUC_UPC:VU1&lang=ca

Complementary:

Articulated motion and deformable objects: 7th International Workshop, AMDO 2012: proceedings - Sergio Escalera, , .
http://cataleg.upc.edu/record=b1280808~S1*cat
IEEE Transactions on Pattern Analysis and Machine Intelligence - P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan, , .
http://cataleg.upc.edu/record=b1203811~S1*cat
Pattern recognition - C. Gatta, E. Puertas, and O. Pujol, , .
http://cataleg.upc.edu/record=b1243124~S1*cat
Multiple view geometry in computer vision - Hartley, R.; Zisserman, A, Cambridge University Press , 2003. ISBN: 0521540518
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002686969706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
IEEE Transactions on Pattern Analysis and Machine Intelligence - Sergio Escalera, Oriol Pujol, and Petia Radeva, , .
http://cataleg.upc.edu/record=b1203811~S1*cat
IEEE Transactions in Pattern Analysis and Machine Intelligence - Sergio Escalera, David Tax, Oriol Pujol, Petia Radeva, and Robert Duin, , .
http://cataleg.upc.edu/record=b1203811~S1*cat
Pattern Recognition - Gjorgji Madjarov, Dragi Kocev, Author Vitae, Dejan Gjorgjevikj, Sao Deroski, , .
http://cataleg.upc.edu/record=b1243124~S1*cat
Proceedings of the British Machine Vision Conference (BMVA) - Clocksin, W.F.; Fitzgibbon, A.W.; Torr, P.H.S. (eds.), British Machine Vision Association , 2005.
IEEE Transactions on Pattern Analysis and Machine Intelligence - A. Torralba, R. Fergus, W. T. Freeman, , .
http://cataleg.upc.edu/record=b1203811~S1*cat
Trends in Cognitive Sciences - Oliva, A. Torralba, , .
http://cataleg.upc.edu/record=b1243234~S1*cat
IEEE 11th International Conference on Computer Vision, 14-21 Oct. 2007 - Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora and S. Belongie, IEEE Computer Society , 2007.
A stochastic grammar of images - Zhu, S.-C.; Mumford, D, Now Publishers , 2007. ISBN: 9781601980601
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004093519706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Conference on Computer Vision and Pattern Recognition (CVPR), 20-25 June 2011 - Y. Yang, D. Ramanan, IEEE , 2011.
IEEE transactions on pattern analysis and machine intelligence - T. Starner, J. Weaver, and A. Pentland, , .
http://cataleg.upc.edu/record=b1203811~S1*cat
Proceedings of the IEEE - L. Rabiner, , .
http://cataleg.upc.edu/record=b1203818~S1*cat
European Conference on Computer Vision: ECCV 1998: Computer Vision - Burkhardt, H.; Neumann, B. (eds), Springer , 1998.
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991000301229706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
ICML '05: Proceedings of the 22nd international conference on machine learning - S. Calinon, A. Billard, International Machine Learning Society , 2005.
Deep learning - Goodfellow, I.; Courville, A.; Bengio, Y, The MIT Press , 2016. ISBN: 9780262035613
https://www.deeplearningbook.org/
Nature - LeCun, Yann, Yoshua Bengio, and Geoffrey Hinton, , .
http://cataleg.upc.edu/record=b1317875~S1*cat

Object Recognition

Teachers

Person in charge

Others

Weekly hours

Competences

Generic Technical Competences

Generic

Technical Competences of each Specialization

Academic

Professional

Transversal Competences

Appropiate attitude towards work

Basic

Objectives

Contents

Activities

Paper presentation

Paper presentation 2

Exam

Laboratory 1

Laboratory 2

Laboratory 3

Laboratory 4

Theoretical class

Practical sessions

Teaching methodology

Evaluation methodology

Bibliography

Basic:

Complementary:

Where we are

Contact with us

Object Recognition

You are here

Teachers

Person in charge

Others

Weekly hours

Competences

Generic Technical Competences

Generic

Technical Competences of each Specialization

Academic

Professional

Transversal Competences

Appropiate attitude towards work

Basic

Objectives

Contents

Activities

Paper presentation

Paper presentation 2

Exam

Laboratory 1

Laboratory 2

Laboratory 3

Laboratory 4

Theoretical class

Practical sessions

Teaching methodology

Evaluation methodology

Bibliography

Basic:

Complementary:

Where we are

Contact with us