Retour

Reinforcement learning

ECTS : 4

Volume horaire : 27

Description du contenu de l'enseignement :

Compétence à acquérir :

Reinforcement Learning (RL) refers to scenarios where the  learning algorithm operates in closed-loop, simultaneously using past  data to adjust its decisions and taking actions that will influence  future observations. Algorithms based on RL concepts are now commonly  used in programmatic marketing on the web, robotics or in computer game  playing. All models for RL share a common concern that in order to  attain one's long-term optimality goals, it is necessary to reach a  proper balance between exploration (discovery of yet uncertain  behaviors) and exploitation (focusing on the actions that have produced  the most relevant results so far).

The methods used in RL draw ideas from control, statistics and  machine learning. This introductory course will provide the main  methodological building blocks of RL, focussing on probabilistic methods  in the case where both the set of possible actions and the state space  of the system are finite. Some basic notions in probability theory are  required to follow the course. The course will imply some work on simple implementations of the algorithms, assuming familiarity with Python.

Mode de contrôle des connaissances :

Bibliographie, lectures recommandées :

Bibliographie, lectures recommandées

Université Paris Dauphine - PSL - Place du Maréchal de Lattre de Tassigny - 75775 PARIS Cedex 16 - 21/11/2024