Credits
6
Types
Compulsory
Requirements
This subject has not requirements
, but it has got previous capacities
Department
EIO
Teachers
Person in charge
- Jan Graffelman ( jan.graffelman@upc.edu )
- Jose Antonio Sánchez Espigares ( josep.a.sanchez@upc.edu )
Others
- Nihan Acar Denizli ( nihan.acar.denizli@upc.edu )
Weekly hours
Theory
2
Problems
0
Laboratory
2
Guided learning
0
Autonomous learning
6
Competences
Technical competencies
Transversals
Basic
Generic
Objectives
-
Exploratory Data Analysis
Related competences: CB2, CB4, CT3, CT5, CT6, CT7, CE1, CE2, CE3, CE4, CE8, CG1, CG3, CG4,
Subcompetences- Clustering. Profiling.
- Pre-processing. Outliers, missing values. Transformations
- PCA, SVD, Factor Analysis. Multidmensional Scaling.
- Correspondence Analysis. Multiple Correspondence Analysis.
-
Discriminant Analysis with probabilistic hypothesis
Related competences: CT3, CT4, CT5, CT6, CT7, CE1, CE3, CE8, CG2, CG3,
Subcompetences- Linear Discriminat Analisis, Discriminació de Fisher. Quadratic Discriminant Analisis.
- Normal multivariate distribution. Sampling distributions.
-
Multivariate modeling
Related competences: CT4, CT6, CT7, CE1, CE3, CE8, CG1, CG2, CB2, CG4,
Subcompetences- Multivariate Regression
- Canonical Correlation Analysis
- Principal Component Regression, Partial Least Squares Regression
-
Time series
Related competences: CT6, CE1, CE3, CE8,
Subcompetences- Outlier, Calendar Effects and Intervention Analysis
- Univariate models of time series
- Applications of the Kalman Filter
Contents
-
Data preprocessing
Outliers, missing data and transformations -
Principal component analysis
Multivariate description of a table of continous variables. Regression with principal components. -
Factor analysis
The singular value decomposition, biplots, factor analysis -
Multidimensional scaling (MDS)
Distance measures. Metric multidimensional scaling. Algorithms. -
Cluster analysis
Hierarchical clustering techniques. Agglomeration methods. Ward's criterion. Dendrogram. -
Correspondence analysis
Contingency tables. Row and column profiles. Independence and chi-square statistics. Simple correspondence analysis. Biplot. -
Discriminant analysis
Multivariate normal distribution. Fisher's linear discriminant analysis. -
Univariate time series models
Exponential smoothing, ARIMA models -
Intervention analysis
Outliers, seasonal effects, intervention analysis.
Activities
Activity Evaluation act
Theory
4h
Problems
0h
Laboratory
4h
Guided learning
0h
Autonomous learning
4h
Principal component analysis
Application of principal component analysis in practical data analysisObjectives: 1
Contents:
Theory
4h
Problems
0h
Laboratory
4h
Guided learning
0h
Autonomous learning
6h
Theory
2h
Problems
0h
Laboratory
3h
Guided learning
0h
Autonomous learning
4h
Theory
2h
Problems
0h
Laboratory
2h
Guided learning
0h
Autonomous learning
4h
Clustering
Application of the method to quantitative data matrices.
Theory
4h
Problems
0h
Laboratory
4h
Guided learning
0h
Autonomous learning
4h
Theory
2h
Problems
0h
Laboratory
2h
Guided learning
0h
Autonomous learning
4h
Theory
4h
Problems
0h
Laboratory
4h
Guided learning
0h
Autonomous learning
4h
Univariate time series models
Fitting time series models to data sets on the computerObjectives: 4
Contents:
Theory
4h
Problems
0h
Laboratory
4h
Guided learning
0h
Autonomous learning
6h
Intervention analysis
Application of intervention analysis to real data setsObjectives: 4
Contents:
Theory
2h
Problems
0h
Laboratory
3h
Guided learning
0h
Autonomous learning
4h
Project
Students realize, in couples, a complete multivariate study of a certain dataset using the techniques they studied during the course, and hand in a written report about it.Objectives: 1 2 3 4
Week: 15 (Outside class hours)
Theory
0h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
Teaching methodology
The learning process is a combination of theoretical explanation and practical application. The theory classes are used to explain the basic scientific contents of the course, whereas the laboratory sessions work on their application to solve real-life problems.Practicals and project form the basis for working out the transversal competences of the students, related to team-work and public presentation of results. Practicals and project also serve to integrate the different pieces of knowledge of the course.
For hands-on computer training we use the R statistical environment.
Evaluation methodology
The student's final grade for the course is based on grades obtained for weekly homework assignments (25%), a partial exam half-way the course (25%), a final exam covering the second half of the course (25%) and a project (25%).Each weekly assignments consists of resolving a questionnaire. These assigments aim at consolidating knowledge of the techniques exposed in the theoretical sessions. The assignments require analysis of datasets in the statistical environment R.
A project is carried out by a group of two students, and students have to show they can resolve problems with the techniques they have learned during the course. Each group hands in a written report about their project at the end of the course.
The two exams will be programmed according to the calendar of the faculty, and evaluate if students have assimilated the basic concepts of the material of the course.
For the resit exam, the student can choose to do a re-examination of only the first partial (25%), or of only the second partial (25%), or of both partials (50%). The re-evaluation thus represents at most 50% of the final course grade.
Bibliography
Basic
-
Multivariate statistical methods: a primer
- Manly, B.F.J.; Navarro, J.A,
CRC Press, Taylor & Francis Group,
2017.
ISBN: 9781498728966
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004178359706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Applied multivariate statistical analysis
- Johnson, R.A.; Wichern, D.W,
Pearson,
2014.
ISBN: 9781292024943
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004175889706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Análisis de datos multivariantes
- Peña, D,
McGraw-Hill,
cop. 2002.
ISBN: 9788448136109
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002497609706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Nuevos métodos de análisis multivariante
- Cuadras, C.M,
CMC Ediciones,
2012.
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991000916409706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Time series analysis and its applications: with R examples
- Shumway, R.H.; Stoffer, D.S,
Springer,
2017.
ISBN: 9783319524511
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004156569706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Course slides for Multivariate Analysis (in English)
- Graffelman, Jan,
Complementary
-
Multivariate analysis
- Mardia, K.V; Kent, J.T; Bibby, J.M,
Academic Press,
1979.
ISBN: 0124712509
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991000218529706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
An introduction to multivariate statistical analysis
- Anderson, T.W,
Wiley,
2003.
ISBN: 0471360910
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002604589706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Aprender de los datos: el análisis de componentes principales: una aproximación desde el Data Mining
- Aluja, T.; Morineau, A,
EUB,
1999.
ISBN: 8483120224
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001877509706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Time series analysis: forecasting and control
- Box, G.E.P.; Jenkins, G.M.; Reinsel, G.C.; Ljung, G.M,
Wiley,
2016.
ISBN: 9781118675021
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004156549706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Análisis de series temporales
- Peña, D,
Alianza,
2010.
ISBN: 9788420669458
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004087859706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Time series: theory and methods
- Brockwell, P.J.; Davis, R.A,
Springer-Verlag,
1991.
ISBN: 9781441903198
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991000762229706711&context=L&vid=34CSUC_UPC:VU1&lang=ca