2-Day Workshop - Introduction to Data Science in Python

Room Grace Hoper (Inria Saclay Ile-de-France)

Room Grace Hoper

Inria Saclay Ile-de-France

1 Rue Honoré d'Estienne d'Orves 91120 Palaiseau
Guillaume Lemaitre (Inria), Jiaping Liu (Inria), Loic Esteve (Inria), Maria Telenczuk (Inria)


Data science is gaining attention impacting many scientific fields and applications. Data science encompasses a large number of topics such as data mining, data wrangling, data visualisation, pattern recognition, or machine learning.

This workshop intends to give an introduction to some of these topics using Python and the PyData ecosystem. It is not a course on deep learning.


Day 1 - Data wrangling, exploration, and visualisation

Goal: introduce the PyData ecosystem to manipulate, explore, and visualize data.

  • Introduction to the basics of numpy, pandas, and matplotlib.

Day 2 - Machine learning

Goal: introduce the basics of machine learning using the  scikit-learn library.

  • Get familiar with general principles of machine learning;

  • Use these principles by using the scikit-learn library on some toy and real-world data examples.


The 2-day workshop is aimed at graduate students and researchers, who have a basic knowledge of Python or have experience in another scientific programming language such as R or Matlab.

Registration is free but mandatory due to limited space (see the link below to register). Registration will close on November 21.

Material and install

The material of the previous session can be found on the GitHub page of the Paris-Saclay Center for Data Science. Refer to the README file regarding the information to install the required packages which will be used during the workshop. This material will be updated soon.


This workshop will take place in Inria Saclay Ile-de-France (1 Rue Honoré d'Estienne d'Orves, 91120 Palaiseau, room Grace Hoper).


Loïc Esteve, Alexandre Gramfort, Jiaping Liu, Guillaume Lemaitre, Bartosz Telenczuk, Maria Telenczuk, Gaël Varoquaux.