Retour

Mathematics for data science

ECTS : 5

Description du contenu de l'enseignement :

Volume horaire : 
CM : 18h
TD : 18h

Data science relies heavily on mathematical concepts from analysis, linear algebra and statistics. In this course, we will investigate the theoretical foundations of data science, through two axes. The first part of the course will focus on (convex) optimization problems. Optimization is indeed at the heart of the key advances in machine learning, as it provides a framework in which data science tasks can be modeled and solved. The second part of the course will be concerned with statistical tools for data science, that are instrumental in studying the underlying distribution of data. We will cover statistical estimation in connection with regression tasks, as well as concentration inequalities for random vectors and random matrices.


Compétence à acquérir :

Bibliographie, lectures recommandées :

References:

S. Boyd et L. Vandenberghe, Convex optimization (2004)

M. Mahoney, J. C. Duchi, A. C. Gilbert (eds), The mathematics of data (2018)

J. A. Tropp, An introduction to matrix concentration inequalities (2015)

Université Paris Dauphine - PSL - Place du Maréchal de Lattre de Tassigny - 75775 PARIS Cedex 16 - 06/07/2024