This course will introduce to different advanced tecniques in machine learning, data mining and case based reasoning. The machine learning part is oriented to unsupervised learning algorithms for structured (sequences, streams, graphs) and unstructured data. The case based reasoning part os oriented to the design and developements of this systems
Weekly hours
Theory
3
Problems
0
Laboratory
0
Guided learning
0.21
Autonomous learning
5.7
Objectives
To known and use advanced unsupervised machine learning and data mining techniques for application on all the domains of engineering and science
Related competences:
CB7,
CT2,
CT4,
CEA12,
CEA13,
CEP1,
CG1,
CG3,
Contents
Data Mining, a global perspective
Brief introduction to what is Data Mining and Knowledge Discovery, the areas they are related to and the different techniques involved
Unsupervised data preprocessing/transformation
This topic will include different algorithms for unsupervised data preprocessing such as data normalization, discretization, dimensionality reduction and feature extraction (PCA, ICA, SVD, linear and non linear, multidimensional scalling and non negative matrix factorization)
Unsupervised Machine Learning/Numerical Taxonomy
This topic will include classical and current algorithms for unsupervised learning from machine learning and statistics including hierarchical and parititional algorithms (K-means,Fuzzy C-means, Gaussian EM, graph partitioning, density based algorithms, grid based algorithms, unsupervised ANN, affinity propagation, ...)
Semi supervised clustering
This topic will include current semi supervised algorithms for clustering data (based on constraints, based on rules, markov random fields)
Unsupervised methodologies in Knowledge Discovery and Data Mining
This topic will include current trends on knowledge discovery for data mining and big data, (scalability, any time clustering, one pass algorithms, approximation algorithms, distributed clustering, ..)
Association Rules
This topic will include and introduction to association rules algorithms and their relationship with unsupervised learning algorithms and clustering
Mining sequential and structured data
This topic will include algorithms for unsupervised learning with sequential data and structured data, such as mining frequent sequences, strings, time series clustering and frequent motifs, clustering data streams, clustering graphs and social networks and discovering frequent subgraphs
Fundamentals of Case-Based Reasoning
Cognitive theories. Basic cycle of CBR reasoning. Academic Demosntrators.
CBR System Components
Case Structure. Case Library Structure. Retrieval. Adaptation/Reuse. Evaluation/Repair. Learning/Retain
CBR Application
A complex real-world example. OPENCASE/GESCONDA-CBR: a domain-independent CBR System .
CBR Development Problems
Competence. Space Efficiency. Time Efficiency.
Reflective/Introspective Reasoning in CBR
Introspection reasoning. Case Base maintenance.