Credits
6
Types
Compulsory
Requirements
- Prerequisite: IE-GIA
Department
EIO
Web
https://www.fib.upc.edu/ca/estudis/graus/grau-en-intelligencia-artificial/pla-destudis/assignatures/
Teachers
Person in charge
- Jordi Cortés Martínez ( jordi.cortes-martinez@upc.edu )
Others
- Dante Conti ( dante.conti@upc.edu )
- Karina Gibert Oliveras ( karina.gibert@upc.edu )
Weekly hours
Theory
2
Problems
0
Laboratory
2
Guided learning
0
Autonomous learning
6
Competences
Transversals
Basic
Especifics
Generic
Objectives
-
Design solvent and goal-oriented test and training games
Related competences: CG8, CT8, CB3, CE09, -
Identify which predictive model is appropriate for a specific problem and specific data
Related competences: CG4, CE01, CE09, CE20, -
Construct and interpret valid models for the temporal evolution of a numerical variable
Related competences: CG4, CT3, CT4, CE01, CE09, CE20, -
Identify classes in a data set and know how to validate and interpret them conceptually
Related competences: CG2, CG4, CT3, CT4, CE01, CE09, CE20, -
Characterize multivariate relationships in a data set with factor analysis techniques
Related competences: CG4, CT3, CT4, CE01, CE09, CE20, -
Be able to do basic unsupervised analysis of a textual database with basic techniques of topic modeling and multivariate analysis by textual data
Related competences: CG4, CT3, CT4, CE01, CE09, CE20, -
Know how to build and validate the right model for a new real situation
Related competences: CG2, CG4, CT3, CT4, CE01, CE09, CE20, -
Know how to integrate the contents of the different topics of this course and the previous ones in a global solution for a complex problem
Related competences: CG2, CE01, CE09, CE20, -
Know how to plan in the long term the modeling of a real complex problem and solve it throughout the course as a team
Related competences: CT3, CT4, CB4,
Contents
-
Generalized linear models
Introduction to the concepts of generalized linear models. Logistics models -
Time series
Introduction to stochastic processes. Timeline vs. Time Series Box-Jenkins MethodologyMain models of time series: MA, AR, ARIMA, SARIMA (concept and case study) -
Factorial analysis
Dimensionality reduction methods -
Clustering
Introduction. Main classification models. Distances. -
Profiling
Description of the classifications from the study of significance of variables -
Experimental design
Complete and fractional 2k designs. Sensitivity and explicability analysis of the models. Identification of main effects and interactions. Design of training sets for machine learning. Design of test sets for validation of data models
Activities
Activity Evaluation act
Teamwork
Students are organized into groups and look for real data that meet certain requirements set by the teacher. They use them to apply the techniques and methodologies that are seen throughout the course. At the end they present a report with the results and make an oral presentation with the most relevant results of the studyObjectives: 1 2 3 4 5 6 7 8 9
Contents:
Theory
0h
Problems
0h
Laboratory
11h
Guided learning
0h
Autonomous learning
27.5h
Practical application syllabus subject
Run R code on the concepts seen in theory.
Theory
0h
Problems
0h
Laboratory
12.5h
Guided learning
0h
Autonomous learning
0h
Quiz 1
During the course there will be short answer tests to fix learning pieces. It will be done at the end of certain lab classesObjectives: 2
Week: 4
Theory
0h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
Quiz 4
During the course there will be short answer tests to fix learning pieces. It will be done at the end of certain lab classesObjectives: 4
Week: 11
Theory
0h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
Quiz 5
During the course there will be short answer tests to fix learning pieces. It will be done at the end of certain lab classesObjectives: 1
Week: 13
Theory
0h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
Teaching methodology
The subject consists of two theory hours and two laboratory hours per weekThe subject's website will contain the subject's calendar and the materials to prepare each class. The theory class will be mainly dedicated to explaining concepts and presenting cases and developing interactive activities with students such as discussing cases, developing problems.
In groups of 4, the students will carry out practical work with data that they will look for themselves and that will meet certain characteristics set by the teachers. With this data, each team will carry out practice sessions, each week applying the techniques of the topic worked on in the theory session. The teacher will monitor all the work teams weekly in the laboratory sessions.
In the middle and at the end of the course, the teams will present their results in a sharing session where all the projects will be discussed together.
Evaluation methodology
Ordinary Evaluation:---------------------
(Q) Quizzes. 20%
(P) Project. 30%
(EF) Final Exam. 50%
Ordinary Final Grade = 0,2 * Q + 0,3 * P + 0,5 * EF
P. It consists of 5 individual and face-to-face questions with the same weight on the final Q grade. These questionnaires will be completed in person and cannot be taken on a day other than the scheduled date even for a justified reason.
Q = (Q1 + Q2 + Q3 + Q4 + Q5)/5
P. Group project where the following competences will be assessed: (P1) Data collection, analysis and interpretation of results and Transmission of results (80%); (P2) Oral and written communication (20%)
P = 0,8 * P1 + 0,2 * P2
You must obtain a minimum grade of 3.5 in the individual and face-to-face tests, i.e.,
2/7 * Q + 5/7 * EF > 3.5 to pass the course. On the other hand, the completion of the project will be mandatory in order to pass during the ordinary evaluation.
Extraordinary evaluation:
---------------------------------
Only those people who, having taken the final exam and failed it, can take the Extraordinary Final Exam.
(EF) Extraordinary Final Exam
Extraordinary Grade = Min{7, Max{EE, 0,2 * Q + 0,3 * P + 0,5 * EE}}
In this exam, there will be no minimum passing grade. The maximum grade for this exam is a 7.
Bibliography
Basic
-
Practical statistics for data scientists: 50+ essential concepts using R and Python
- Bruce, Peter; Bruce, Andrew; Gedeck, Peter,
O'Reilly,
[2020].
ISBN: 9781492072942
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004946307706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Data analysis and graphics using R : an example-based approach
- Maindonald, J. H; Braun, John,
Cambridge University,
2010.
ISBN: 9780521762939
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003210549706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Forecasting: principles and practice
- Hyndman, R.J.; Athanasopoulos, G,
O Texts,
2021.
ISBN: 9780987507136
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991005164678006711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Practical time series analysis: prediction with statistics and machine learning
- Nielsen, Aileen,
O'Reilly Media, Inc,
2019.
ISBN: 9781492041658
Previous capacities
Introduction to StatisticsProbability theory
statistical inference
simple statistical models
data visualization
basic programming
R basic skills
Algebra