Introduction to Machine Learning

You are here

Credits
6
Types
Compulsory
Requirements
This subject has not requirements, but it has got previous capacities
Department
CS
This course introduces the concept and types of machine learning. It describes how to design and perform consistent experiments, and the most common methodologies for doing so. It also presents the main machine learning algorithms, for each of the main categories of existing problems.

Teachers

Person in charge

  • Dario García Gasulla ( )

Others

  • Jordi Luque Serrano ( )

Weekly hours

Theory
2
Problems
0
Laboratory
2
Guided learning
0
Autonomous learning
6

Competences

Transversal Competences

Transversals

  • CT2 [Avaluable] - Sustainability and Social Commitment. To know and understand the complexity of economic and social phenomena typical of the welfare society; Be able to relate well-being to globalization and sustainability; Achieve skills to use in a balanced and compatible way the technique, the technology, the economy and the sustainability.
  • CT5 [Avaluable] - Solvent use of information resources. Manage the acquisition, structuring, analysis and visualization of data and information in the field of specialty and critically evaluate the results of such management.
  • CT6 - Autonomous Learning. Detect deficiencies in one's own knowledge and overcome them through critical reflection and the choice of the best action to extend this knowledge.
  • CT8 [Avaluable] - Gender perspective. An awareness and understanding of sexual and gender inequalities in society in relation to the field of the degree, and the incorporation of different needs and preferences due to sex and gender when designing solutions and solving problems.

Basic

  • CB3 - That students have the ability to gather and interpret relevant data (usually within their area of ??study) to make judgments that include a reflection on relevant social, scientific or ethical issues.

Technical Competences

Especifics

  • CE03 - To identify and apply the basic algorithmic procedures of computer technologies to design solutions to problems by analyzing the suitability and complexity of the proposed algorithms.
  • CE04 - To design and use efficiently the most appropriate data types and structures to solve a problem.
  • CE09 - To ideate, design and integrate intelligent data analysis systems with their application in production and service environments.
  • CE15 - To acquire, formalize and represent human knowledge in a computable form for solving problems through a computer system in any field of application, particularly those related to aspects of computing, perception and performance in intelligent environments or environments.
  • CE20 - To select and put to use techniques of statistical modeling and data analysis, assessing the quality of the models, validating and interpreting.

Generic Technical Competences

Generic

  • CG1 - To ideate, draft, organize, plan and develop projects in the field of artificial intelligence.
  • CG2 - To use the fundamental knowledge and solid work methodologies acquired during the studies to adapt to the new technological scenarios of the future.
  • CG3 - To define, evaluate and select hardware and software platforms for the development and execution of computer systems, services and applications in the field of artificial intelligence.
  • CG4 - Reasoning, analyzing reality and designing algorithms and formulations that model it. To identify problems and construct valid algorithmic or mathematical solutions, eventually new, integrating the necessary multidisciplinary knowledge, evaluating different alternatives with a critical spirit, justifying the decisions taken, interpreting and synthesizing the results in the context of the application domain and establishing methodological generalizations based on specific applications.
  • CG6 - To identify opportunities for innovative applications of artificial intelligence and robotics in constantly evolving technological environments.
  • CG7 - To interpret and apply current legislation, as well as specifications, regulations and standards in the field of artificial intelligence.
  • CG8 - Perform an ethical exercise of the profession in all its facets, applying ethical criteria in the design of systems, algorithms, experiments, use of data, in accordance with the ethical systems recommended by national and international organizations, with special emphasis on security, robustness , privacy, transparency, traceability, prevention of bias (race, gender, religion, territory, etc.) and respect for human rights.
  • CG9 - To face new challenges with a broad vision of the possibilities of a professional career in the field of Artificial Intelligence. Develop the activity applying quality criteria and continuous improvement, and act rigorously in professional development. Adapt to organizational or technological changes. Work in situations of lack of information and / or with time and / or resource restrictions.

Objectives

  1. Learn the main methods of machine learning, and how to use them appropriately.
    Related competences: CG1, CG2, CG3, CG4, CG8, CT2, CT5, CB3, CE03, CE04, CE09, CE15, CE20,
  2. Interact in a critical and prudent manner with data and machine learning models
    Related competences: CG1, CG4, CG7, CG8, CG9, CT6, CT8, CE04,
    Subcompetences:
    • Keep a critic and skeptic view of model behavior
    • Identify biases in data
  3. Recognize in an easy manner the characteristics of a problem from the perspective of machine learning
    Related competences: CG1, CG2, CG4, CG6, CG9, CT5, CE03, CE04, CE09, CE15,
    Subcompetences:
    • Propose the most appropriate learning types for a problem
    • Identify analysis of relevance to be conducted on a data set

Contents

  1. Intro to machine learning.
    Basic tipes of learning. What can they be used for, purposes and main limitations. Includes a set of warnings and sanity checks to keep in mind while working with machine learning.
  2. Experimental design in machine learning
    Using data for learning. How to design, execute and evaluate experiments conducted with machine learning techniques.
  3. Data preprocessing
    Distributions, normalization and standardization of data. How and why to prepare data to be processed by machine learning algorithms.
  4. Applied Regression
    Practical cases of regression
  5. Dimensionality reduction
    Review of the main methods to reduce the dimensionality of data: PCA, UMAP, T-SNE, ...
  6. Classification: Basic concepts and review of basic methods
    Distance measurements are studied and related to the concept of similitude, which then allows us to build and compare a large number of methods. Revision of the K-Nearest Neighbor as a simple framework and extension to other methods.
  7. Classification methods based on other criteria
    Support Vector Machines, Neural Networks (classic architectures) and Decision Trees.
  8. Muilticlassification
    The main methods of combining "weak" learning methods are studied in order to obtain more robust models: Boosting, Bagging, GAMs, EBMs, Sets
  9. Explainability
    Relevance, use and methods of explicability. Several methods are studied in order to interpret and explain the operation and result of machine learning algorithms, a basic need for the implementation and acceptance of these methods. The foundations for Explainable AI (Explainable Artificial Intelligence) are being laid.
  10. Clustering
    The bases of the classical methods of obtaining significant data sets in the absence of class information and / or prior structures are reviewed. K-means, Hierarchical Clustering, Spectral Clustering, DBSCAN.
  11. Genetic algorisms
    Introduction to genetic algorithms, as a first vision of bio-inspired learning methods. The conceptual and mathematical bases of the main mutation and crossover operators and their representational variants are reviewed.
  12. Machine learning in graphs
    The structure of the graph is widespread in various environments and has donated to a whole discipline, the Science of the Xarxes, on it is work on the structural properties of the graphs to derive properties and conclusions about the phenomenon or field that is studied. This type of learning is especially important in internet applications, near or recovery applications or knowledge detection. Detection of communities, prediction of accidents, etc.

Activities

Activity Evaluation act


Introduction to Machine Learning

Types, purposes and limitations
Objectives: 1 2
Contents:
Theory
6h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
6h

Data preprocessing and manipulation


Objectives: 1 3
Contents:
Theory
4h
Problems
0h
Laboratory
6h
Guided learning
0h
Autonomous learning
12h

Theory
12h
Problems
0h
Laboratory
20h
Guided learning
0h
Autonomous learning
18h

Other aspects of machine learning


Objectives: 1 2 3
Contents:
Theory
4h
Problems
0h
Laboratory
4h
Guided learning
0h
Autonomous learning
10h

First Mid-term Exam


Objectives: 1
Week: 8
Type: theory exam
Theory
2h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
12h

Final


Objectives: 1
Week: 15 (Outside class hours)
Type: theory exam
Theory
2h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
12h

Lab evaluation


Objectives: 1
Week: 15 (Outside class hours)
Type: lab exam
Theory
0h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
10h

Teaching methodology

Interactive classes of theoretical content. Relatively autonomous laboratory sessions of practical contingency.

Evaluation methodology

The course consists of one partial exam (P) and a final exam (F). The laboratory will be evaluated continuously (LC) and through a final delivery (LF).

Final score = (0.2*P) + (0.4*F) + (0.1*LC) + (0.3*LF)

Bibliography

Basic:

Previous capacities

Understand the computing flow within a software system.
Understand the basic concepts behind inference, deduction and evidence based reasoning.
Being familiarized with data distribution, basic data preprocessing, i how numerical variables can represent information.