Teachers
Person in charge
- José Antonio González Alastrue ( jose.a.gonzalez@upc.edu )
Others
- Bhumika Ashvinbhai Patel ( bhumika.patel@upc.edu )
- Jan Moreno Monteys ( jan.moreno.monteys@upc.edu )
- Jordi Cortés Martínez ( jordi.cortes-martinez@upc.edu )
- Klaus Gerhard Langohr ( klaus.langohr@upc.edu )
- Mario Agustiño Batet ( mario.agustino@upc.edu )
- Mercè Vila Albacete ( merce.vila.albacete@upc.edu )
- Mireia Besalú Mayol ( mireia.besalu@upc.edu )
- Mireia Lopez Beltran ( mireia.lopez.beltran@upc.edu )
- Nuria Perez Alvarez ( nuria.perez@upc.edu )
Weekly hours
Theory
2.4
Problems
2
Laboratory
0
Guided learning
0
Autonomous learning
5.6
Competences
Common technical competencies
- CT1.2A - To interpret, select and value concepts, theories, uses and technological developments related to computer science and its application derived from the needed fundamentals of mathematics, statistics and physics. Capacity to solve the mathematical problems presented in engineering. Talent to apply the knowledge about: algebra, differential and integral calculus and numeric methods; statistics and optimization.
- CT8.3 - To demonstrate knowledge and be able to apply appropriate techniques for modelling and analysing different kinds of decisions.
Reasoning
- G9.2 - Analysis and synthesis capacity, capacity to solve problems in its field, and to interpret the results in a critical way. Abstraction capacity: capacity to create and use models which reflect real situations. Capacity to design and perform simple experiments and to analyse and interpret their results in a critical way.
Objectives
-
2. Define and calculate probabilities for a random experience.
Related competences: CT1.2A, -
Calculate the conditional and joint probabilities and detect whether there is (in)dependence for a random experience with two variables and apply Bayes' theorem to locating the conditional probabilities for the other variable.
Related competences: CT1.2A, -
Graphically represent a random experience.
Related competences: CT1.2A, -
Calculate mean and variance for given probability and distribution functions for a discrete random variable.
Related competences: CT1.2A, -
Identify the most appropriate theoretical model to represent a given random variable from among the following: Bernoulli, binomial, Poisson, Geometric, Normal, uniform and exponential.
Related competences: CT1.2A, -
Calculate cumulative probabilities for certain values from the parameter for theoretical models with the help of tables or R; conversely, locate the random variable values from the desired cumulative probabilities.
Related competences: CT1.2A, CT8.3, G9.2,
Subcompetences- They should also be able to identify and analyse theoretical models suitable for different IT situations.
-
Calculate and interpret the covariance and correlation values for two random variables.
Related competences: CT1.2A, -
Calculate, using sample data, statistics that reflect central tendency (mean) and dispersion (variance and standard deviation).
Related competences: CT1.2A, -
From sample indicators, obtained from a s.r.s., he/she will compute confidence intervals for certain parameters. For example: from the mean, the standard deviation and the sample size of a variable with Normal distribution, the student will calculate the CI95%.
Related competences: CT1.2A, -
Based on a hypothesis and the sample mean and standard deviation for a normally distributed variable, calculate the P-value and justify the evidence against the hypothesis.
Related competences: CT1.2A, CT8.3, -
From the data of a comparative test (e.g., performance of two computer products), the student will use the confidence interval to obtain a wide range of possible values of the difference in the outcome.
Related competences: CT1.2A, CT8.3, -
Using the summary of the model, obtain and interpret the estimators of the model, compute and interpret the R-squared coefficient, obtain the estimators of the uncertainty of the estimate and build a CI for the population values.
Related competences: CT1.2A, -
Make predictions and assess their degree of uncertainty using summary data from the adjusted model.
Related competences: CT1.2A, CT8.3, -
Based on the graphs of the adjusted model, analyze the premises of the model and, if necessary, propose transformations of the variables.
Related competences: CT1.2A, CT8.3, -
Design a prediction study, collect data and analyse and interpret results.
Related competences: CT1.2A, CT8.3, G9.2,
Subcompetences- They should also be able to use the collected data to describe tendency and dispersion characteristics in numerical and graphical terms.
-
Identify, for a deterministic process, variability sources and magnitudes.
Related competences: CT1.2A, CT8.3, G9.2,
Subcompetences- They should also be able to use collected data to describe tendency and dispersion characteristics in numerical and graphical terms.
-
Design a comparative test of computer products, collect data and analyse and interpret results.
Related competences: CT1.2A, CT8.3, G9.2,
Subcompetences- They should also be able to use the collected data to describe tendency and dispersion characteristics in numerical and graphical terms.
Contents
-
Block A. Probability and random variables
Random experiment. Probability, conditional probability, joint probability. Definition of random variable and types. Probability function, probability density function and probability distribution function. Joint probability function. Indicators: expectation, variance, standard deviation, covariance, correlation. Independence between two random variables. -
Block B. Probabilistic models
Parameterised theoretical models of random variables. Direct and inverse probabilities computation, with R. Introduction to simulation. Sample mean distribution. Central Limit Theorem, Normal approximations. -
Block C. Basis of statistics
Population and sample. Parameter, statistic and estimator. Bias of an estimator. Confidence interval for a parameter, and for the difference of two parameters. Hypothesis test -
Block D. Statistical models and forecasting
Comparació de dos grups, disseny aparellat, mostres independents. Model lineal. Indicadors de la qualitat de l'ajustament. Validació de les premisses. Introducció a la ciència de dades. Estudis experimentals i observacionals. Ètica de la ciència, research waste. -
Block T. Application.
Identifying sources of variability in computer processes. Design of a study with planning of the goal, data collection, statistical analysis with R and results interpretation.
Activities
Activity Evaluation act
Block A activities. Probability and random variables
Locate probability and statistics, especially in the IT field. Provide a grounding in probability. Be able to calculate and analyze joint and conditional probabilities. Analyze whether there is independence or not. Define random variable (RV), discrete and continuous RV. Define probability function, cumulative probability function and joint probability function. Relate RV indicators to sample indicators.- Theory: Tests to monitor pre-reading and study. Explanation of topics: foundations of probability and statistics; independence and conditional and joint probabilities; definition of random variable and of probability and probability distribution functions; random variable metrics and relationship to sample indicators.
- Problems: Model examples of the topics. Follow-up tests. Completion of set exercises. Discussion of results.
- Guided learning: Problem resolution in a mid-semester or final exam.
- Autonomous learning: Study of materials before the theory sessions. Problem completion in E-status.
Contents:
Theory
6h
Problems
6h
Laboratory
0h
Guided learning
0h
Autonomous learning
15h
Block B activities. Probabilistic models
Define the theoretical, discrete and continuous models typically used in the IT field and their characteristics and parameters.- Theory: Tests to monitor pre-reading and study. Tests to monitor pre-reading and study. Explanation of topics: define the theoretical, discrete and continuous models typically used in the IT field and their characteristics and parameters and calculate direct and inverse probabilities with the defined models.
- Problems: Model examples of the topics. Follow-up tests. Completion of set exercises. Discussion of results.
- Guided learning: Problem resolution in a mid-semester or final exam.
- Autonomous learning: Study of materials before the theory sessions. Problem completion in E-status.
Contents:
Theory
6h
Problems
6h
Laboratory
0h
Guided learning
0h
Autonomous learning
15h
Block C activities. Basis of statistics
Basic population, sampling, parameter and estimator concepts. Introduction to statistics; definition and linking of confidence intervals (CI) and hypothesis testing (HT).- Theory: Tests to monitor pre-reading and study. Explanation of topics: definition of sample, parameter, estimator and statistic for constructing confidence intervals (CI) and description of the statistics defining the more interesting CIs and HTs in an IT setting.
- Problems: Model examples of the topics. Follow-up tests. Completion of set exercises. Discussion of results.
- Guided learning: Problem resolution in a mid-semester or final exam.
- Autonomous learning: Study of materials before the theory sessions. Problem completion in E-status.
Contents:
Theory
6h
Problems
6h
Laboratory
0h
Guided learning
0h
Autonomous learning
15h
Block D activities. Statistical models and forecasting
Definition of statistical models. Analysis of variability. Paired design and independent samples. Linear model. Validation of premises, possible transformations, predictions. Some models for data science. Implications of research.- Theory: Tests to monitor pre-reading and study. Explanation of topics: define the suitable model, validate it and analyse transformations, obtain effect estimates and make predictions.
- Problems: Model examples of the topics. Follow-up tests. Completion of set exercises. Discussion of results.
- Guided learning: Problem resolution in a mid-semester or final exam.
- Autonomous learning: Study of materials before the theory sessions. Problem completion in E-status.
Contents:
Theory
6h
Problems
6h
Laboratory
0h
Guided learning
0h
Autonomous learning
15h
Application activities
Identify problems in the IT field for a probability or statistical study. Design a study, collect data and analyse and interpret results. Summarise conclusions critically.- Theory: Propose and provide guidance for the probability and/or statistics studies performed by students. Monitor studies and encourage synthetic and critical evaluations.
- Problems: Guidance and monitoring of probability and/or statistics studies. Guidance and monitoring regarding practical probability and statistics components.
- Autonomous learning: Research computer situations where a probability or statistical study is necessary. Study design, data collection, results analysis and interpretation.
Contents:
Theory
6h
Problems
6h
Laboratory
0h
Guided learning
0h
Autonomous learning
12h
Final Exam
Covers all the topics.Week: 15 (Outside class hours)
Theory
0h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
Teaching methodology
The subject is based on the active learning of the student, guided and guided by the teacher with the help of e-status (interactive platform that, with data individualized by the exercises, allows to evaluate and learn thanks to a feedback immediate).The teaching scheme of the 4 specific blocks consists of the repetition of cycles based on: exposition of theoretical concepts, numerical resolution of exercises, support for exercises with R (using laptops in the classroom, or in the laboratory), follow-up tests by of the teacher, and autonomous practice of exercises.
The application block develops the transversal competence with the application in group to a specific case contributed by the students, analyzed with R, under the direction of the teacher.
Evaluation methodology
The qualification of the subject is obtained by means of the continuous evaluation (AC) during the 15 weeks of class or with the final exam (EF).PE is divided into 5 topics or blocks: 4 specific (A, B, C, D) and one cross-disciplinary applications topic of statistical application (T).
Each block results in a Block Note (NB.i, i = A,B,C,D,T). The following formula is applied in the AC:
AC = [ 3 NB.A + 3 NB.B + 3 NB.C + 3 NB.D + 2 NB.T ] / 14
If AC> = 5, the student can be released from the final exam.
Please note that the EF may consider the grade for the transversal competence:
EF = max{ ef, (12 ef + 2 NB.T) / 14 }
where "ef" is the actual grade for the final exam.
The course grade of the subject PE is max(AC, EF).
The qualification of the transversal competence is:
A and NB.T> = 8.5; B for 6.5 <= NB.T <8.5; C for 5 <= NB.T <6.5; and D and NB.T <5
Calculating NB.i grades:
- the first 4 have an assessment based on a Block Problem (PB.i, i = A,B,C,D) in a mid-term exam out of class hours. Usually there are 2 tests that give rise to the grades for the 4 blocks.
In addition, a Block Monitoring factor (SB.i, i = A,B,C,D) is obtained for each of the four theoretical blocks, based on 3 tests: 2 written tests done in the classroom, and a mark for problems solved outside the classroom. The SB.i factor increases the grade for the corresponding Block Problem (PB.i) to obtain the Block Grade according to:
NB.i = min (10, PB.i * SB.i) for i = A,B,C,D
(SB.i factor is 1 + Sum pj, where pj is a number between 0 and 0.05, coming from the different block monitoring tests; the exact number of tests may be less than 3 if there are unforeseen changes to the school calendar, with consequent loss of classes).
- The T-Block grade (NB.T) is calculated on the basis of two reports and a final presentation.
Bibliography
Basic
-
Estadística per a enginyers informàtics
- González, J.A.; Cobo, E.; Muñoz, P.; Martí-Recober, M,
Edicions UPC,
2008.
ISBN: 9788483019535
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003417199706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Probability and statistics for computer scientists
- Baron, M,
CRC Press,
2019.
ISBN: 9781138044487
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004181089706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Introducción a la estadística
- Wonnacott, T.H.; Wonnacott, R.J,
Limusa,
1997.
ISBN: 9681845099
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001638289706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Complementary
-
Probability with R: an introduction with computer science applications
- Horgan, J.M,
Wiley,
2009.
ISBN: 9780470280737
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003780759706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Estadística: modelos y métodos: vol.1: fundamentos
- Peña, D,
Alianza,
1986.
ISBN: 8420689931
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001319579706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Probability models for computer science
- Ross, S.M,
Harcourt/Academic Press,
2002.
ISBN: 9780125980517
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003271789706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
From algorithms to Z-Scores: probabilistic and statistical modeling in computer science
- Matloff, N.S,
University Press of Florida,
2009.
ISBN: 9781616100360
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991000807539706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Web links
- Servidor e-status https://estatus.upc.edu
- Pàgina web de Probabilitat i Estadística, amb material divers https://www-eio.upc.edu/teaching/pe/
- Repositori d'aplicatius shiny https://www-eio.upc.edu/~josean/shinyweb/jag_shiny.php