Scientists from all disciplines are nowadays confronted with the necessity of diving into the world of scientific programming, whether to interpret basic data or make more advanced computation. However, developed code is often found to be sloppy and even buggy, leading to irreproducible research as well as making it difficult to collaborate and share resources.
Therefore, this workshop organized by the Paris-Saclay Center for Data Science aims at presenting, on the one hand, the PyData ecosystem widely used in scientific programming and, on the other hand, the software engineering best practises to achieve efficient and reusable code.
Event organized in collaboration with PROTO204.
Goal: introducing the most important packages for scientific computing and data analysis in Python. The main topics presented will be:
Introduction to the basics of numpy, pandas, and matplotlib;
Introduction to scientific computing toolbox: scipy, statsmodels, and scikit-learn.
Goal: moving from standalone scripts to efficient and reusable code with tutorial on writing modular, documented, tested, and shareable code.
Migrating from scripts to modular code;
Best practices through PEP8 and well documented code;
Test and continuous integration in a nutshell
The 2-day workshop is aimed at graduate students and researchers, who have a basic knowledge of Python or have experience in another scientific programming language R or matlab.
Registration is free but mandatory due to limited space (see the link below to register). Registration will close on June 23.
This workshop will take place in the Saclay region (details will be updated once location is confirmed).
Loïc Esteve, Alexandre Boucaud, Alexandre Gramfort, Balazs Kegl, Guillaume Lemaitre, Bartosz Telenczuk, Joris Van den Bossche, Gaël Varoquaux.