Tecnologies Avançades del Llenguatge Humà

Hores setmanals
Competències
Objectius
Continguts
Activitats
Metodologia docent
Mètode d'avaluació
Bibliografia
Web links
Capacitats prèvies

Crèdits

Tipus

MAI: Optativa
MIRI: Optativa

Requisits

Aquesta assignatura no té requisits, però té capacitats prèvies

Departament

CS;TSC

Web

https://www.cs.upc.edu/~padro/ahlt/ahlt.html

This course offers an in-depth coverage of main basic tasks for Natural Language Processing. We will present fundamental models and tools to approach a variety of Natural Language Processing tasks, ranging from named entity recognition to syntactic processing and document classification. The flow of the course is along two main axis: (1) computational formalisms to describe natural language processes, and (2) statistical and machine learning methods to acquire linguistic models from large data collections and solve specific linguistic tasks

Professorat

Responsable

Lluis Padro Cirera ( )

Hores setmanals

Teoria

1.5

Problemes

0.5

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

5.3333

Competències

Competències Tècniques Generals

Genèriques

CG3 - Capacitat per a la modelització, càlcul, simulació, desenvolupament i implantació en centres tecnològics i d'enginyeria d'empresa, particularment en tasques de recerca, desenvolupament i innovació en tots els àmbits relacionats amb la Intel·ligència Artificial.

Competències Tècniques de cada especialitat

Acadèmiques

CEA3 - Capacitat de comprendre els principis bàsics de funcionament de les tècniques principals d'Aprenentatge Automàtic, i saber utilitzar-les en l'entorn d'un sistema o servei intel·ligent.
CEA5 - Capacitat de comprendre els principis bàsics de funcionament de les tècniques de Processament del Llenguatge Natural, i saber utilitzar-les en l'entorn d'un sistema o servei intel·ligent.
CEA7 - Capacitat de comprendre la problemàtica, i les solucions als problemes en la pràctica professional de l'aplicació de la Intel·ligència Artificial en l'entorn empresarial i industrial.
CEA10 - Capacitat de comprendre les tècniques avançades d'Interacció Persona-Màquina, i saber dissenyar, implementar i aplicar aquestes tècniques en el desenvolupament d'aplicacions, serveis o sistemes intel·ligents.

Professionals

CEP3 - Capacitat d'aplicació de les tècniques d'Intel·ligència Artificial en entorns tecnològics i industrials per a la millora de la qualitat i la productivitat.
CEP4 - Capacitat per dissenyar, redactar i presentar informes sobre projectes informaticos en l'area especifica d'Intel·ligència Artificial.

Competències Transversals

Treball en equip

CT3 - Ser capaç de treballar com a membre d'un equip interdisciplinari, ja sigui com un membre més o duent a terme tasques de direcció, amb la finalitat de contribuir a desenvolupar projectes amb pragmatisme i sentit de la responsabilitat, tot assumint compromisos considerant els recursos disponibles.

Raonament

CT6 - Capacitat d'avaluar i analitzar de manera raonada i crítica sobre situacions, projectes, propostes, informes i estudis de caracter cientific-tecnic. Capacitat d'argumentar les raons que expliquen o justifiquen aquestes situacions, propostes, etc.

Analisis i sintesis

CT7 - Capacitat d'anàlisi i resolució de problemes tècnics complexos.

Bàsiques

CB6 - Que els estudiants sàpiguen aplicar els coneixements adquirits y la seva capacitat de resolució de problemes en entorns nous o poc coneguts dins de contexts més amplis (o multidisciplinaris) relacionats amb la seva àrea d'estudi.
CB8 - Que els estudiants sàpiguen comunicar les seves conclusions i els coneixements i raons darreres que les sustenten- a públics especialitzats i no especialitzats d'una manera clara i sense ambigüitats.
CB9 - Que els estudiants posseeixin les habilitats d'aprenentatge que els permetin continuar estudiant d'una manera que haurà de ser en gran mesura autodirigida o autònoma.

Objectius

Learn to apply statistical methods for NLP in a practical application
Competències relacionades: CB6, CB8, CT3, CEA10, CEA3, CEA5, CEA7,
Understand statistical and machine learning techniques applied to NLP
Competències relacionades: CT6, CT7, CEA3, CEP3, CG3, CB6,
Develop the ability to solve technical problems related to statistical and algorithmic problems in NLP
Competències relacionades: CB6, CB8, CB9, CT7, CEA10, CEA3, CEA5, CEA7, CG3,
Understand fundamental methods of Natural Language Processing from a computational perspective
Competències relacionades: CB6, CT7, CEA5, CEP4,

Continguts

Statistical Models for NLP
Introduction to statistical modelling for language. Maximum Likelhood models and smooting. Maximum entropy estimation. Log-Linear models
Distances and Similarities
Distances (and similarities) between linguistic units. Textual, Semantic, and Distributional distances. Semantic spaces (WN, Wikipedia, Freebase, Dbpedia).
Sequence Predicion
Prediction in word sequences: PoS tagging, NERC. Local classifiers, HMM, global predictors, Log-linear models.
Syntactic Parsing
Parsing constituent trees: PCFG, CKY vs Inside/outside
Parsing dependency trees: CRFs for parsing. Earley algorithm
Word Embeddings
Static word embeddings. Word2Vec, Glove
Limitations - need of contextual embeddings
Recurrent Neural Networks
RNNs for Language Modeling and sequence labeling
Bottleneck problem. LSTMs
Vanishing gradient problem
LSTM-based word embeddings: ELMO
Convolutional Neural Networks
CNNs for NLP. 1D kernels vs 2D kernels.
stride, padding
Pooling
NLP tasks suitable for CNNs vs RNNs
Transformers
Vanishing gradient problem in RNN/LSTM
Attention
Tranformer architecture
Large Language Models
Large Language Models: origin and evolution
Reinforcement Learning from Human Feedback
Fundational vs Instructed LLMs
Use of LLMs in NLP applications: Zero shot, few-shot, fine-tuning
Optimization and efficiency issues
Ethics - Limitations and Risks of LLMs
Biases
Hallucinations
Security
Environmental costs
Social costs

Activitats

Activitat Acte avaluatiu

Statistical Language Models

Introduction to statistical modelling for language. Maximum Likelhood models and smooting. Maximum entropy estimation. Log-Linear models
Objectius: 4 2
Continguts:

1 . Statistical Models for NLP

Teoria

2.5h

Problemes

0.5h

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

Distances and Similarities

Distances (and similarities) between linguistic units. Textual, Semantic, and Distributional distances. Semantic spaces (WN, Wikipedia, Freebase, Dbpedia). Latent Semantic Analysis. Word Embeddings
Objectius: 4 2
Continguts:

2 . Distances and Similarities

Teoria

Problemes

0.5h

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

Sequence Prediction

These lectures will present sequence labeling models, an important set of tools that is used for sequential tasks. We will present this in the framework of structured prediction (later in the course we will see that the same framework is used for parsing and translation). We will focus on machine learning aspects, as well as algorithmic aspects. We will give special emphasis to Conditional Random Fields.
Objectius: 4 2
Continguts:

3 . Sequence Predicion

Teoria

Problemes

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

Syntax and Parsing

We will present statistical models for syntactic structure, and in general tree structures. The focus will be on probabilistic context-free grammars and dependency grammars, two standard formalisms. We will see relevant algorithms, as well as methods to learn grammars from data based on the structured prediction framework.
Objectius: 4 2
Continguts:

4 . Syntactic Parsing

Teoria

Problemes

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

Word Embeedings

Static Word embeddings. Word2Vec, Glove, FastText
Objectius: 4 2
Continguts:

5 . Word Embeddings

Teoria

Problemes

0.5h

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

Recurrent Neural Networks

Recurrent Neural Networks. Bottleneck problem. LSTMs. Vanishing Gradient problem
Objectius: 4 2
Continguts:

6 . Recurrent Neural Networks

Teoria

Problemes

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

Convolutional Neural Networks

CNNs for NLP. 1D kernels vs 2D kernels stride, padding, pooling
Objectius: 2
Continguts:

7 . Convolutional Neural Networks

Teoria

Problemes

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

Transformers

Attention. Transformers
Objectius: 3
Continguts:

8 . Transformers

Teoria

Problemes

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

Large Language Models

Large Language Models. Origin & Evolution. Usage: zero-shot, few-shot, fine tuning optimization
Objectius: 3 1
Continguts:

9 . Large Language Models
10 . Ethics - Limitations and Risks of LLMs

Teoria

Problemes

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

Lab project

Objectius: 3 1
Setmana: 15 (Fora d'horari lectiu)

Teoria

Problemes

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

30h

Final exam

Setmana: 15 (Fora d'horari lectiu)

Teoria

Problemes

Laboratori

Aprenentatge dirigit

Aprenentatge autònom

15h

Metodologia docent

The course will be structured around four different linguistic analysis levels: word level, phrase level, sentence level, and document level. Typical NLP tasks and solutions corresponding to each level will be presented.
The first half of the course is devoted to "classical" statistical and ML approaches. The second half of the course revisits the same levels under a deep learning perspective

Theoretical background and practical exercises will be developed in class.

Finally, students will develop a practical project in teams of two students. The goal of the project is to put into practice the methods learned in class, and learn how the experimental methodology that is used in the NLP field. Students have to identify existing components (i.e. data and tools) that can be used to build a system, and perform experiments in order to perform empirical analysis of some statistical NLP method.

Mètode d'avaluació

Final grade = 0.5*FE + 0.5*LP

where

FE is the grade of the final exam

LP is the grade of the lab project

Bibliografia

Bàsica:

Handbook of natural language processing - Dale, R.; Moisl, H.; Somers, H. (eds.), Marcel Dekker, 2000. ISBN: 0824790006
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002071619706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Handbook of natural language processing - Indurkhya, N.; Damerau, F.J. (eds.), Chapman and Hall/CRC, 2010. ISBN: 9781420085938
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001234699706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition - Jurafsky, D.; Martin, J.H, Prentice Hall, 2008. ISBN: 9332518416
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003460299706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
The Oxford handbook of computational linguistics - Mitkov, R. (ed.), Oxford University Press, 2003. ISBN: 0198238827
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002689009706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Foundations of statistical natural language processing - Manning, C.D.; Schütze, H, MIT Press, 1999. ISBN: 0262133601
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001994779706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Linguistic structure prediction - Smith, N.A, Morgan & Claypool, 2011. ISBN: 9781608454051
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004001819706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Natural language processing with deep learning - Manning, C.; See, A, Stanford University,
https://web.stanford.edu/class/archive/cs/cs224n/cs224n.1194/
Natural language processing - Collins, M, Columbia University,
http://www.cs.columbia.edu/~cs4705/
Natural language processing - Titov, I, Universiteit van Amsterdam,
http://ivan-titov.org/teaching/nlp1-15/
Syntactic analysis in language technology: syntactic parsing - Stymne, S.; Lhoneux, Miryam de, Uppsala Universitet, 2017.
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001688389706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
The handbook of computational linguistics and natural language processing - Clark, A.; Fox, C.; Lappin, S. (eds.), Wiley-Blackwell, 2010. ISBN: 9781444324044
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001686059706711&context=L&vid=34CSUC_UPC:VU1&lang=ca

Web links

The course website, includes lecture slides, and links to relevant bibliography and resources. http://www.lsi.upc.edu/~ageno/anlp

Capacitats prèvies

- Although not mandatory, familiarity with basic concepts and methods of Natural Language Processing is strongly recommended

- Good understanding of basic concepts and methods of Machine Learning.

- Advanced programming skills.

Tecnologies Avançades del Llenguatge Humà

Professorat

Responsable

Hores setmanals

Competències

Competències Tècniques Generals

Genèriques

Competències Tècniques de cada especialitat

Acadèmiques

Professionals

Competències Transversals

Treball en equip

Raonament

Analisis i sintesis

Bàsiques

Objectius

Continguts

Activitats

Statistical Language Models

Distances and Similarities

Sequence Prediction

Syntax and Parsing

Word Embeedings

Recurrent Neural Networks

Convolutional Neural Networks

Transformers

Large Language Models

Lab project

Final exam

Metodologia docent

Mètode d'avaluació

Bibliografia

Bàsica:

Web links

Capacitats prèvies

On som

Contacta amb la FIB

Tecnologies Avançades del Llenguatge Humà

Esteu aquí

Professorat

Responsable

Hores setmanals

Competències

Competències Tècniques Generals

Genèriques

Competències Tècniques de cada especialitat

Acadèmiques

Professionals

Competències Transversals

Treball en equip

Raonament

Analisis i sintesis

Bàsiques

Objectius

Continguts

Activitats

Statistical Language Models

Distances and Similarities

Sequence Prediction

Syntax and Parsing

Word Embeedings

Recurrent Neural Networks

Convolutional Neural Networks

Transformers

Large Language Models

Lab project

Final exam

Metodologia docent

Mètode d'avaluació

Bibliografia

Bàsica:

Web links

Capacitats prèvies

On som

Contacta amb la FIB