Advanced Human Languages Technologies

Weekly hours
Competences
Objectives
Contents
Activities
Teaching methodology
Evaluation methodology
Bibliography
Web links
Previous capacities

Credits

Types

MIRI: Specialization complementary (Data Science)
MAI: Elective

Requirements

This subject has not requirements, but it has got previous capacities

Department

CS;TSC

Web

https://www.cs.upc.edu/~padro/ahlt/ahlt.html

This course offers an in-depth coverage of main basic tasks for Natural Language Processing. We will present fundamental models and tools to approach a variety of Natural Language Processing tasks, ranging from named entity recognition to syntactic processing and document classification. The flow of the course is along two main axis: (1) computational formalisms to describe natural language processes, and (2) statistical and machine learning methods to acquire linguistic models from large data collections and solve specific linguistic tasks

Teachers

Person in charge

Lluis Padro Cirera ( )

Weekly hours

Theory

1.5

Problems

0.5

Laboratory

Guided learning

Autonomous learning

5.3333

Competences

Generic Technical Competences

Generic

CG3 - Capacity for modeling, calculation, simulation, development and implementation in technology and company engineering centers, particularly in research, development and innovation in all areas related to Artificial Intelligence.

Technical Competences of each Specialization

Academic

CEA3 - Capability to understand the basic operation principles of Machine Learning main techniques, and to know how to use on the environment of an intelligent system or service.
CEA5 - Capability to understand the basic operation principles of Natural Language Processing main techniques, and to know how to use in the environment of an intelligent system or service.
CEA7 - Capability to understand the problems, and the solutions to problems in the professional practice of Artificial Intelligence application in business and industry environment.
CEA10 - Capability to understand advanced techniques of Human-Computer Interaction, and to know how to design, implement and apply these techniques in the development of intelligent applications, services or systems.

Professional

CEP3 - Capacity for applying Artificial Intelligence techniques in technological and industrial environments to improve quality and productivity.
CEP4 - Capability to design, write and report about computer science projects in the specific area of ??Artificial Intelligence.

Transversal Competences

Teamwork

CT3 - Ability to work as a member of an interdisciplinary team, as a normal member or performing direction tasks, in order to develop projects with pragmatism and sense of responsibility, making commitments taking into account the available resources.

Reasoning

CT6 - Capability to evaluate and analyze on a reasoned and critical way about situations, projects, proposals, reports and scientific-technical surveys. Capability to argue the reasons that explain or justify such situations, proposals, etc..

Analisis y sintesis

CT7 - Capability to analyze and solve complex technical problems.

Basic

CB6 - Ability to apply the acquired knowledge and capacity for solving problems in new or unknown environments within broader (or multidisciplinary) contexts related to their area of study.
CB8 - Capability to communicate their conclusions, and the knowledge and rationale underpinning these, to both skilled and unskilled public in a clear and unambiguous way.
CB9 - Possession of the learning skills that enable the students to continue studying in a way that will be mainly self-directed or autonomous.

Objectives

Learn to apply statistical methods for NLP in a practical application
Related competences: CEA3, CEA5, CEA7, CEA10, CT3, CB6, CB8,
Understand statistical and machine learning techniques applied to NLP
Related competences: CEA3, CG3, CEP3, CT6, CT7, CB6,
Develop the ability to solve technical problems related to statistical and algorithmic problems in NLP
Related competences: CEA3, CEA5, CEA7, CEA10, CG3, CT7, CB6, CB8, CB9,
Understand fundamental methods of Natural Language Processing from a computational perspective
Related competences: CEA5, CEP4, CT7, CB6,

Statistical Models for NLP
Introduction to statistical modelling for language. Maximum Likelhood models and smooting. Maximum entropy estimation. Log-Linear models
Distances and Similarities
Distances (and similarities) between linguistic units. Textual, Semantic, and Distributional distances. Semantic spaces (WN, Wikipedia, Freebase, Dbpedia).
Sequence Predicion
Prediction in word sequences: PoS tagging, NERC. Local classifiers, HMM, global predictors, Log-linear models.
Syntactic Parsing
Parsing constituent trees: PCFG, CKY vs Inside/outside
Parsing dependency trees: CRFs for parsing. Earley algorithm
Word Embeddings
Static word embeddings. Word2Vec, Glove
Limitations - need of contextual embeddings
Recurrent Neural Networks
RNNs for Language Modeling and sequence labeling
Bottleneck problem. LSTMs
Vanishing gradient problem
LSTM-based word embeddings: ELMO
Convolutional Neural Networks
CNNs for NLP. 1D kernels vs 2D kernels.
stride, padding
Pooling
NLP tasks suitable for CNNs vs RNNs
Transformers
Vanishing gradient problem in RNN/LSTM
Attention
Tranformer architecture
Large Language Models
Large Language Models: origin and evolution
Reinforcement Learning from Human Feedback
Fundational vs Instructed LLMs
Use of LLMs in NLP applications: Zero shot, few-shot, fine-tuning
Optimization and efficiency issues
Ethics - Limitations and Risks of LLMs
Biases
Hallucinations
Security
Environmental costs
Social costs

Activities

Activity Evaluation act

Statistical Language Models

Introduction to statistical modelling for language. Maximum Likelhood models and smooting. Maximum entropy estimation. Log-Linear models
Objectives: 4 2
Contents:

1 . Statistical Models for NLP

Theory

2.5h

Problems

0.5h

Laboratory

Guided learning

Autonomous learning

Distances and Similarities

Distances (and similarities) between linguistic units. Textual, Semantic, and Distributional distances. Semantic spaces (WN, Wikipedia, Freebase, Dbpedia). Latent Semantic Analysis. Word Embeddings
Objectives: 4 2
Contents:

2 . Distances and Similarities

Theory

Problems

0.5h

Laboratory

Guided learning

Autonomous learning

Sequence Prediction

These lectures will present sequence labeling models, an important set of tools that is used for sequential tasks. We will present this in the framework of structured prediction (later in the course we will see that the same framework is used for parsing and translation). We will focus on machine learning aspects, as well as algorithmic aspects. We will give special emphasis to Conditional Random Fields.
Objectives: 4 2
Contents:

3 . Sequence Predicion

Theory

Problems

Laboratory

Guided learning

Autonomous learning

Syntax and Parsing

We will present statistical models for syntactic structure, and in general tree structures. The focus will be on probabilistic context-free grammars and dependency grammars, two standard formalisms. We will see relevant algorithms, as well as methods to learn grammars from data based on the structured prediction framework.
Objectives: 4 2
Contents:

4 . Syntactic Parsing

Theory

Problems

Laboratory

Guided learning

Autonomous learning

Word Embeedings

Static Word embeddings. Word2Vec, Glove, FastText
Objectives: 4 2
Contents:

5 . Word Embeddings

Theory

Problems

0.5h

Laboratory

Guided learning

Autonomous learning

Recurrent Neural Networks

Recurrent Neural Networks. Bottleneck problem. LSTMs. Vanishing Gradient problem
Objectives: 4 2
Contents:

6 . Recurrent Neural Networks

Theory

Problems

Laboratory

Guided learning

Autonomous learning

Convolutional Neural Networks

CNNs for NLP. 1D kernels vs 2D kernels stride, padding, pooling
Objectives: 2
Contents:

7 . Convolutional Neural Networks

Theory

Problems

Laboratory

Guided learning

Autonomous learning

Transformers

Attention. Transformers
Objectives: 3
Contents:

8 . Transformers

Theory

Problems

Laboratory

Guided learning

Autonomous learning

Large Language Models

Large Language Models. Origin & Evolution. Usage: zero-shot, few-shot, fine tuning optimization
Objectives: 3 1
Contents:

9 . Large Language Models
10 . Ethics - Limitations and Risks of LLMs

Theory

Problems

Laboratory

Guided learning

Autonomous learning

Lab project

Objectives: 3 1
Week: 15 (Outside class hours)

Theory

Problems

Laboratory

Guided learning

Autonomous learning

30h

Final exam

Week: 15 (Outside class hours)

Theory

Problems

Laboratory

Guided learning

Autonomous learning

15h

Teaching methodology

The course will be structured around four different linguistic analysis levels: word level, phrase level, sentence level, and document level. Typical NLP tasks and solutions corresponding to each level will be presented.
The first half of the course is devoted to "classical" statistical and ML approaches. The second half of the course revisits the same levels under a deep learning perspective

Theoretical background and practical exercises will be developed in class.

Finally, students will develop a practical project in teams of two students. The goal of the project is to put into practice the methods learned in class, and learn how the experimental methodology that is used in the NLP field. Students have to identify existing components (i.e. data and tools) that can be used to build a system, and perform experiments in order to perform empirical analysis of some statistical NLP method.

Evaluation methodology

Final grade = 0.5*FE + 0.5*LP

where

FE is the grade of the final exam

LP is the grade of the lab project

Bibliography

Basic:

Handbook of natural language processing - Dale, R.; Moisl, H.; Somers, H. (eds.), Marcel Dekker, 2000. ISBN: 0824790006
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002071619706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Handbook of natural language processing - Indurkhya, N.; Damerau, F.J. (eds.), Chapman and Hall/CRC, 2010. ISBN: 9781420085938
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001234699706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition - Jurafsky, D.; Martin, J.H, Prentice Hall, 2008. ISBN: 9332518416
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003460299706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
The Oxford handbook of computational linguistics - Mitkov, R. (ed.), Oxford University Press, 2003. ISBN: 0198238827
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002689009706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Foundations of statistical natural language processing - Manning, C.D.; Schütze, H, MIT Press, 1999. ISBN: 0262133601
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001994779706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Linguistic structure prediction - Smith, N.A, Morgan & Claypool, 2011. ISBN: 9781608454051
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004001819706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Natural language processing with deep learning - Manning, C.; See, A, Stanford University,
https://web.stanford.edu/class/archive/cs/cs224n/cs224n.1194/
Natural language processing - Collins, M, Columbia University,
http://www.cs.columbia.edu/~cs4705/
Natural language processing - Titov, I, Universiteit van Amsterdam,
http://ivan-titov.org/teaching/nlp1-15/
Syntactic analysis in language technology: syntactic parsing - Stymne, S.; Lhoneux, Miryam de, Uppsala Universitet, 2017.
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001688389706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
The handbook of computational linguistics and natural language processing - Clark, A.; Fox, C.; Lappin, S. (eds.), Wiley-Blackwell, 2010. ISBN: 9781444324044
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001686059706711&context=L&vid=34CSUC_UPC:VU1&lang=ca

Web links

The course website, includes lecture slides, and links to relevant bibliography and resources. http://www.lsi.upc.edu/~ageno/anlp

Previous capacities

- Although not mandatory, familiarity with basic concepts and methods of Natural Language Processing is strongly recommended

- Good understanding of basic concepts and methods of Machine Learning.

- Advanced programming skills.

Advanced Human Languages Technologies

Teachers

Person in charge

Weekly hours

Competences

Generic Technical Competences

Generic

Technical Competences of each Specialization

Academic

Professional

Transversal Competences

Teamwork

Reasoning

Analisis y sintesis

Basic

Objectives

Contents

Activities

Statistical Language Models

Distances and Similarities

Sequence Prediction

Syntax and Parsing

Word Embeedings

Recurrent Neural Networks

Convolutional Neural Networks

Transformers

Large Language Models

Lab project

Final exam

Teaching methodology

Evaluation methodology

Bibliography

Basic:

Web links

Previous capacities

Where we are

Contact with us

Advanced Human Languages Technologies

You are here

Teachers

Person in charge

Weekly hours

Competences

Generic Technical Competences

Generic

Technical Competences of each Specialization

Academic

Professional

Transversal Competences

Teamwork

Reasoning

Analisis y sintesis

Basic

Objectives

Contents

Activities

Statistical Language Models

Distances and Similarities

Sequence Prediction

Syntax and Parsing

Word Embeedings

Recurrent Neural Networks

Convolutional Neural Networks

Transformers

Large Language Models

Lab project

Final exam

Teaching methodology

Evaluation methodology

Bibliography

Basic:

Web links

Previous capacities

Where we are

Contact with us