Skip to main content

Statistical Modelling and Design of Experiments

Credits
6
Types
Compulsory
Requirements
This subject has not requirements , but it has got previous capacities
Department
EIO
The aim of the course is to provide students with the tools needed to cope with complex systems using statistical modeling techniques. The students also learn different techniques of experimental design.

Teachers

Person in charge

  • Pau Fonseca Casas (pau@fib.upc.edu)

Others

  • Lidia Montero Mercadé (lidia.montero@upc.edu)
  • Nihan Acar Denizli (nihan.acar.denizli@upc.edu)

Weekly hours

Theory
1
Problems
1
Laboratory
2
Guided learning
0
Autonomous learning
6

Competences

Computer networks and distributed systems

  • CEE2.3 - Capability to understand models, problems and mathematical tools to analyze, design and evaluate computer networks and distributed systems.
  • High performance computing

  • CEE4.1 - Capability to analyze, evaluate and design computers and to propose new techniques for improvement in its architecture.
  • Generic

  • CG1 - Capability to apply the scientific method to study and analyse of phenomena and systems in any area of Computer Science, and in the conception, design and implementation of innovative and original solutions.
  • CG3 - Capacity for mathematical modeling, calculation and experimental designing in technology and companies engineering centers, particularly in research and innovation in all areas of Computer Science.
  • Information literacy

  • CTR4 - Capability to manage the acquisition, structuring, analysis and visualization of data and information in the area of informatics engineering, and critically assess the results of this effort.
  • Reasoning

  • CTR6 - Capacity for critical, logical and mathematical reasoning. Capability to solve problems in their area of study. Capacity for abstraction: the capability to create and use models that reflect real situations. Capability to design and implement simple experiments, and analyze and interpret their results. Capacity for analysis, synthesis and evaluation.
  • Objectives

    1. Applying the mathematical formalism to solve problems involving uncertainty.
      Related competences: CTR4, CTR6, CG1, CG3,
    2. Applying the queuing models for computer systems performance evaluation and/or configurations analysis.
      Related competences: CEE2.3, CTR6, CEE4.1,
    3. Ability to design, conduct experiments and analyze results.
      Related competences: CTR4, CTR6, CG1, CG3,

    Contents

    1. Introduction to probability
      Students should feel comfortable with the use of set notation and basic statistical terminology. Likewise, the student should be able to write the sample space of simple experiments, including sampling with replacement (like throwing coins or throwing dice), sampling without replacement, from Bernoulli trials and with rules of detention. Likewise, the student should be able to calculate the probabilities in simple cases of the above type of experiment.
    2. Introduction to statistical estimation
      Estimation, in the framework of statistical inference, is the set of techniques with the aim of give an approximate value for a parameter of a population from data provided by a sample. From the different methods that exist (point estimate, estimate intervals, or Bayesian estimation) we focus on the point estimate.
    3. Analysis of data
      The main objective of the section is to know the procedures associated with the analysis of variance (ANOVA terminology in English) and when is useful to be applied.This activity also introduces MANOVA, as a technique useful when there are two or more dependent variables. We also work with the techniques of linear regression and PCA, completing the repertoire of tools for data analysis.
    4. Introduction to experimental design
      Statistical experimental design, a.k.a. design of experiments (DoE) is the methodology of how to conduct and plan experiments in order to extract the maximum amount of information in the fewest number of runs (saving resources). In this section we describe different techniques to achieve that.
    5. Introduction to queuing theory and simulation
      This section will introduce the student to use the techniques of operations research for systems analysis for making quantitative decision in the presence of uncertainty through their representation in terms of queuing models and simulation.

    Activities

    Activity Evaluation act


    Introduction to probability

    At the end of this activity the Student must be comfortable with using basic set notation and terminology. Also the Student must be capable of write down the sample space for simple experiments, including sampling with replacement (such as tossing coins or rolling dice), sampling without replacement, and Bernoulli trials with stopping rules. Also the Student must be capable of calculate probabilities in straightforward instances of the above types of experiment.
    Objectives: 1
    Contents:
    Theory
    1h
    Problems
    1h
    Laboratory
    2h
    Guided learning
    0h
    Autonomous learning
    5h

    Introduction to statistical estimation

    Estimation, in the framework of statistical inference, is the set of techniques with the aim of give an approximate value for a parameter of a population from data provided by a sample. From the different methods that exist (point estimate, estimate intervals, or Bayesian estimation) we focus on the point estimate.
    Objectives: 1
    Contents:
    Theory
    2h
    Problems
    2h
    Laboratory
    4h
    Guided learning
    0h
    Autonomous learning
    8h

    ANalysis Of VAriance

    The main objective of the activity is to know the procedures associated with the analysis of variance (ANOVA terminology in English) and when is useful to be applied.This activity also introduces MANOVA, as a technique useful when there are two or more dependent variables.
    Objectives: 1
    Contents:
    Theory
    1h
    Problems
    1h
    Laboratory
    2h
    Guided learning
    0h
    Autonomous learning
    5h

    Linear regression

    Linear regression is a mathematical method that models the relationship between a dependent variable Y, independent variables Xi and a random term. This section will examine this method and explain its applicability from different examples.
    Objectives: 1
    Contents:
    Theory
    1h
    Problems
    1h
    Laboratory
    2h
    Guided learning
    0h
    Autonomous learning
    6h

    Principal component analysis

    The principal component analysis (PCA, PCA in English), in statistics, is a technique that reduces the dimensionality of a dataset. This allows us to represent them graphically in two or three dimensional graphs of various variables grouped the data into factors, or components, consisting of the grouping variables. In this section we will work this technique from a practical point of view.
    Objectives: 1
    Contents:
    Theory
    1h
    Problems
    1h
    Laboratory
    2h
    Guided learning
    0h
    Autonomous learning
    6h

    Factorial design

    Many experiments are conducted to study the effects of two or more factors. in this case the factorial designs are more efficient, presented in this section.
    Objectives: 3
    Contents:
    Theory
    3h
    Problems
    3h
    Laboratory
    9h
    Guided learning
    0h
    Autonomous learning
    12h

    Randomized blocks, Latin squares and related designs

    In many research problems is necessary to design experiments that can systematically control the variability caused by different sources. This section will consider some experimental designs for solve these situations.
    Objectives: 3
    Contents:
    Theory
    1h
    Problems
    1h
    Laboratory
    2h
    Guided learning
    0h
    Autonomous learning
    6h

    Incomplete block design

    Description incomplete blocks design, useful when you can not develop all combinations of treatment within each block.
    Objectives: 3
    Contents:
    Theory
    1h
    Problems
    1h
    Laboratory
    2h
    Guided learning
    0h
    Autonomous learning
    6h

    General structure of queuing models

    Introduction to the theory of queue models. Notation Kendall. Discreet simulation using Event Schedulling.
    Objectives: 2
    Contents:
    Theory
    1h
    Problems
    1h
    Laboratory
    2h
    Guided learning
    0h
    Autonomous learning
    5h

    Queuing models based on birth and death processes

    Introduction to basic concepts and elements of the analysis of Markov processes. Markov queues.
    Objectives: 2
    Contents:
    Theory
    1h
    Problems
    1h
    Laboratory
    2h
    Guided learning
    0h
    Autonomous learning
    5h

    Generalized queuing patterns with non-exponential distributions and serial exponential queues.

    Networks of queues: open and closed networks. Introduction to general service distributions and multiple types of work.
    Objectives: 2
    Contents:
    Theory
    1h
    Problems
    1h
    Laboratory
    2h
    Guided learning
    0h
    Autonomous learning
    5h

    Validation Verification and Accreditation

    Techniques to Verify, Validate and do the Accreditation of the models.
    Objectives: 2
    Contents:
    Theory
    1h
    Problems
    1h
    Laboratory
    2h
    Guided learning
    0h
    Autonomous learning
    5h

    First report


    Objectives: 1
    Week: 5
    Theory
    0h
    Problems
    0h
    Laboratory
    0h
    Guided learning
    0h
    Autonomous learning
    0h

    Second report


    Objectives: 2
    Week: 10
    Theory
    0h
    Problems
    0h
    Laboratory
    0h
    Guided learning
    0h
    Autonomous learning
    0h

    Third report


    Objectives: 3
    Week: 15
    Theory
    0h
    Problems
    0h
    Laboratory
    0h
    Guided learning
    0h
    Autonomous learning
    0h

    Final exam


    Objectives: 1 2 3
    Week: 15
    Theory
    0h
    Problems
    0h
    Laboratory
    0h
    Guided learning
    0h
    Autonomous learning
    0h

    Teaching methodology

    The course is practical and aims that students will be able, once the course is completed and from the work done in the sessions, to solve real problems similar to those developed in class.

    Evaluation methodology

    The course will have different exercises that the students must solve during the course (80% of the final grade).
    At the end there will be an exam that will weigh 20% of the final grade.

    Bibliography

    Basic

    Complementary

    Web links

    Previous capacities

    Students must have sufficient knowledge of algebra and mathematical analysis to assimilate the concepts related to algebra of sets, numerical series, functions of real variables of one or more dimensions, derivation and integration.