Publication date : 07/08/2025

Course : Python Data Science, manipulating and visualizing data

optional TOSA® certification

Practical course - 4d - 28h00 - Ref. IYT
Price : 2240 € E.T.

Python Data Science, manipulating and visualizing data

optional TOSA® certification


Required course

Data analytics is an ever-expanding multidisciplinary field. It relies on scientific methods, algorithms and processes that Python has mastered thanks to a particularly rich ecosystem. Today, it has become the reference language for data analysis, whatever the format. Our training course enables you to get to grips with Python's tools, libraries and modules, and rapidly acquire data science skills using this language.


INTER
IN-HOUSE
CUSTOM

Practical course in person or remote class
Available in English on request

Ref. IYT
  4d - 28h00
2240 € E.T.




Data analytics is an ever-expanding multidisciplinary field. It relies on scientific methods, algorithms and processes that Python has mastered thanks to a particularly rich ecosystem. Today, it has become the reference language for data analysis, whatever the format. Our training course enables you to get to grips with Python's tools, libraries and modules, and rapidly acquire data science skills using this language.


Teaching objectives
At the end of the training, the participant will be able to:
Get an overview of Python's scientific ecosystem
Learn about the essential scientific libraries for data analytics
Be able to manipulate large data sets with Python
Understanding the benefits of datavisualization
Visualizing data with Python

Intended audience
Engineers, developers, researchers, data scientists, data analysts and anyone who wants to learn about the scientific world of Python.

Prerequisites
Python language practice.

Practical details
Hands-on work
Numerous exercises are used to illustrate the topics.
Teaching methods
Active teaching, feedback and demonstrations are provided by the trainer to enable participants to put them into practice more quickly.

Course schedule

1
Introduction to the scientific Python ecosystem

  • Overview of Python's scientific ecosystem: must-have libraries.
  • Know where to find new bookstores and assess their sustainability.
  • The main open source tools and software for data analysis.
  • Why use scientific distribution, Anaconda.
  • Understand the benefits of a virtual environment and know how to use it.
  • IPython interpreter and Jupyter server.
  • Best practices for getting your data analytics project off to a good start with Python.
  • Scientific file formats and libraries for manipulating them.
Hands-on work
Setting up the development environment: installing Anaconda, creating a virtual environment, exporting and duplicating an environment, using Jupyter notebooks.

2
The SciPy Stack

  • The SciPy Stack is the foundation of the essential scientific libraries on which all the others are based.
  • NumPy: numerical calculation and linear algebra (vectors, matrices, images).
  • SciPy, based on NumPy for statistics, functional analysis, geospatial analysis, signal processing, etc.
  • Pandas: tabular data analysis (CSV, Excel, etc.), statistics, pivots, filters, search...
  • Matplotlib: the essential data visualization library.
Hands-on work
Measure the performance of the NumPy installed by your Linux and that of Anaconda. Image processing with NumPy. First plots. Statistical analysis of CSV files. First mapping elements. Fourier transforms.

3
Visualization libraries

  • Overview of Python visualization libraries: 2D/3D, desktop/web, statistics, cartography, big data...
  • Desktop-oriented libraries: Matplotlib, Pandas, Seaborn.
  • Web-oriented libraries: Bokeh, Altair, Plotly...
  • 3D libraries: Plotly, pythreejs, ipyvolume...
  • Cartographic libraries: Cartopy, folium, ipyleaflet, Bokeh, cesiumpy...
  • Big data libraries: datashader, Vaex...
Hands-on work
Multiple exercises with some of the libraries presented. Big data, cartographic, 2D and 3D visualization.

4
Data visualization

  • The benefits of datavisualization
  • Using PyViz and the HoloViz ecosystem.
  • Overview of SuperSet, Mayavi, Paraview and VisIt tools.
Hands-on work
Continue to use visualization libraries and manipulate tools.

5
Scientific file formats and the handling of voluminous data

  • Overview of the main scientific file formats: NetCDF, HDF5, GRIB, JSON, PARQUET, MATLAB, CGNS...
  • Handle big data with Dask, Vaex and Xarray.
Hands-on work
Handle data exceeding GB, read and write NetCDF/HDF5 files. Visualization of climate data, satellite images, creation of videos/graphic animations.


Customer reviews
4,6 / 5
Customer reviews are based on end-of-course evaluations. The score is calculated from all evaluations within the past year. Only reviews with a textual comment are displayed.
HUBERT STÉPHANE S.
14/10/25
5 / 5

Very interesting course. We were able to cover a wide range of topics, and the fact that we were able to do the exercise on our own and then as a group, step by step, really helped us to make progress. Thank you Benjamin!
FRÉDÉRIC P.
14/10/25
4 / 5

Interesting course. The trainer was very educational, clear in his explanations and attentive to questions. I particularly appreciated the good balance between theory and practical exercises.
HOUARI MOHAMED D.
14/10/25
5 / 5

Solid training with a good pedagogical progression. The trainer explains the concepts clearly. Practical exercises are relevant and provide a good basis for getting started in data science.



Dates and locations
Select your location or opt for the remote class then choose your date.
Remote class

Last places available
Guaranteed date, in person or remotely
Guaranteed session

REMOTE CLASS
2026 : 24 Mar., 31 Mar., 26 May, 23 June, 15 Sep., 29 Sep., 24 Nov., 15 Dec.

PARIS LA DÉFENSE
2026 : 31 Mar., 23 June, 29 Sep., 24 Nov.