NLP for economic decisions
ECTS : 3
Description du contenu de l'enseignement :
General introduction: aims and uses of NLP
Part I: translating texts into quantities
- Tokenization, preprocessing
- Word frequency and inverse-document frequency (tf and tf-idf)
- Word embeddings
Part II: Traditional NLP methods
Methods used before LLMs
- Text classification
- Topic modelling and text summarization
- Sentiment analysis
Part III: NLP with LLMs
- Short presentation of LLMs: transformers, encoders-only models, decoders-only models
- Text classification with LLM
- Topic modelling and text summarization
- Sentiment analysis
- Generating prompt for generative LLM
- Fine-tuning of LLM
Compétence à acquérir :
At the end of this course, the students should:
- have a good knowledge of the main methods used in Natural Language Processing
- be able to implement these methods to data with Python
- be able to interpret the results obtained through these examples
Mode de contrôle des connaissances :
The evaluation is based on a project made by groups of two students
Bibliographie, lectures recommandées :
- Albrecht, Jens, Ramachandran, Sidharth, Winkler, Christian. Blueprints for text analytics using Python machine learning-based solutions for common real world (NLP) applications, O'Reilly, 2020
- Alammar Jay and Maarten Grootendorst. Hands-On Large Language Models, Language Understanding and Generation, O'Reilly, 2024
- Useful website: Kaggle