Credits
6
Types
Compulsory
Requirements
This subject has not requirements
, but it has got previous capacities
Department
CS
Web
https://www.cs.upc.edu/~turmo/mud/plan0a6/MUD.html
The flow of the course is along two main axis: (1) computational formalisms to describe natural language processes, and (2) statistical and machine learning methods to acquire linguistic models from large data collections and solve specific linguistic tasks
Teachers
Person in charge
- Jordi Turmo Borrás ( turmo@cs.upc.edu )
Others
- Carlos Escolano Peinado ( carlos.escolano@upc.edu )
- Salvador Medina Herrera ( salvador.medina.herrera@upc.edu )
Weekly hours
Theory
1.5
Problems
0.5
Laboratory
2
Guided learning
0
Autonomous learning
7.11
Competences
Information literacy
Third language
Entrepreneurship and innovation
Basic
Generic
Especifics
Objectives
-
Know and understand basic NLP tasks and their application to text analysis.
Related competences: CT4, CT1, CG2, CE6, CE7, CE11, CB6, CB7, CB10, -
Know, understand, and apply text mining techniques, including entity recognition, sentiment analysis, and document retrieval.
Related competences: CT4, CT5, CE11, CE12, CB6, CB7, CB8, CB9, -
Know, understand, and apply basic principles of deep learning in unstructured data tasks, such as natural language processing, or computer vision.
Related competences: CT4, CT5, CG2, CE6, CE7, CE11, CE13, CB6, CB7, CB8, CB9, CB10,
Contents
-
Natural language processing and its application to text analysis
Introduction: What is NLP and its applications -
natural language processing stages
Text segmentation: sentence splitting, tokenization; morpholigcal analysis, PoS tagging, syntactic parsing -
text classification, text similarity.
Similarity measures for text. String edit based distances. Vector and set distance measures, distributional semantics. Document retrieval.
Text classification: Sentiment analysis -
Information extraction: Entity recognition, relation extraction
-
Deep learning techniques for the analysis of non-structured data
Word embeddings, neural language processing -
Main deep learning architectures for non-structured data
Recurrent NN, Convolutional NN, Transformers
Activities
Activity Evaluation act
Theory
0h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
NLP and its applications
Introduction. What is NLP, tasks, components, and applications.Objectives: 1
Contents:
Theory
2h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
natural language processing stages
Text segmentation: sentence splitting/tokenization; morphological analysis; PoS tagging; syntactic parsing.Objectives: 1
Contents:
Theory
7.3h
Problems
2.5h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
Text classification, text similarity
Similarity measures for text. String edit based distances. Vector and set distance measures, distributional semantics. Document retrieval. Text classification: Sentiment analysisObjectives: 2
Contents:
Theory
1.5h
Problems
0.5h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
Deep learning techniques for the analysis of non-structured data
Word embeddings, neural language processingObjectives: 3
Contents:
Theory
4.5h
Problems
2h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
Main deep learning architectures for non-structured data
Recurrent NN, Convolutional NN, TransformersObjectives: 3
Contents:
Theory
3.5h
Problems
1.5h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
Theory
0h
Problems
0h
Laboratory
6h
Guided learning
0h
Autonomous learning
0h
Theory
0h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
0h
Teaching methodology
Participative lectures with theoretical and practical contentPractical sessions with student participation for the resolution of exercises related to the course contents
lab project - team work
Consulting sessions
Evaluation methodology
Lab projects 40% + partial exam 30% + final exam 30%Bibliography
Basic
-
Speech and language processing : an introduction to natural language processing, computational linguistics, and speech recognition
- Jurafsky, Dan; Martin, James H,
Prentice Hall,
2008.
ISBN: 9789332518414
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003460299706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Handbook of natural language processing
- Indurkhya, Nitin ; Damerau, Fred J,
Taylor¬Francis,
2010.
ISBN: 9780429149207
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004874386806711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Foundations of statistical natural language processing
- Manning, Christopher D; Schütze, Hinrich,
MIT Press,
1999.
ISBN: 0262133601
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001994779706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
The Oxford handbook of computational linguistics
- Mitkov, Ruslan,
Oxford University Press,
2003.
ISBN: 0198238827
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002689009706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Handbook of natural language processing
- Indurkhya, Nitin; Damerau, Frederick J,
Chapman and Hall/CRC,
2010.
ISBN: 9781420085938
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001234699706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
The Handbook of computational linguistics and natural language processing
- Clark, Alexander; Fox, Chris; Lappin, Shalom,
Wiley-Blackwell,
2010.
ISBN: 9781444324044
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001686059706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Natural language processing with deep learning
- Manning, C.; See, A,
Stanford University,
2022.
-
Natural language processing
- Collins, M,
Columbia University,
Previous capacities
Advanced skills on python programmingMath and statistics skills to the level of an engineering/tech/science university degree
Fundamentals of machine learning