Course : Data Science, the fundamentals

Synthesis course - 2d - 14h00 - Ref. DTX
Price : 1720 € E.T.

Data Science, the fundamentals



Required course

A major strategic challenge for organizations, data science uses mathematical tools to reveal the behavior of data and analyze the events they describe. This course covers the fundamentals of data science and provides an introduction to data analysis.


INTER
IN-HOUSE
CUSTOM

Synthesis course in person or remote class
Disponible en anglais, à la demande

Ref. DTX
  2d - 14h00
1720 € E.T.




A major strategic challenge for organizations, data science uses mathematical tools to reveal the behavior of data and analyze the events they describe. This course covers the fundamentals of data science and provides an introduction to data analysis.


Teaching objectives
At the end of the training, the participant will be able to:
Understand the basic principles of data science and how to organize the process
Understand the application of data science to solve problems and its limitations
Develop your ability to analyze and interpret figures through graphical representation
Understand how to use data science tools and develop models for professional use
Opening up to the challenge of exploiting data in a competitive and continuous improvement context
Understanding the organization and infrastructure for data science services and projects

Intended audience
IS managers, data analysis project managers, statistical research managers.

Prerequisites
No special knowledge required.

Practical details
Hands-on work
Guided practice of the fundamentals through exercises. MCQs and summary tables help you to position yourself.
Teaching methods
During this summary course, the trainer gives demonstrations that each participant reproduces to put the main concepts into practice.

Course schedule

1
What is data science?

  • Fundamentals: big data, data lake, data mining, artificial intelligence, machine and deep learning, text mining.
  • New challenges: the emergence and multiplication of new data sources.
  • Take into account data heterogeneity, real-time flows and the explosion of data volumes.
  • The big data technological ecosystem.
  • Demystify the world of data science: descriptive, predictive and prescriptive analysis.
  • The job, tools and methods of the data scientist.
  • Introduction to machine learning, supervised analysis and unsupervised analysis.
  • Notions of over- and underlearning.
Demonstration
Use cases for data science in a business value chain (customer behavior, product offering, etc.).

2
Data science methods and models

  • Data collection, preparation and exploration.
  • The importance of the data quality approach (cleaning, transforming, enriching).
  • Definition of metrics.
  • Basic statistical methods.
  • The main classes of supervised algorithms: decision trees, K-nearest neighbors, regression, Naive Bayes.
  • The main classes of unsupervised algorithms: clustering, PCA, CAH, neural networks.
  • Text mining and other families of algorithms.
Storyboarding workshops
Simple analyses with R or Python to illustrate supervised (regression and classification) and unsupervised (clustering, segmentation and anomaly detection) analysis techniques.

3
Graphical representation and data retrieval

  • R and Python statistical analysis languages.
  • Their development environments (R-Studio, Anaconda, PyCharm) and libraries (Pandas, machine learning).
  • DataViz tools (Power BI, Qlik, tableau, etc.).
  • Data modeling: representation of processes, flows, controls and conditions.
  • Data modeling: tools (Orange, Power BI).
  • Communicate results through data storytelling: organize visuals (diagrams, rankings, maps).
  • Communicating results through data storytelling: conveying the meaning of results.
Storyboarding workshops
Exercises in graphical data exploration, analyzing the position and extent of data (clouds, histograms, etc.).

4
Modeling a data science problem

  • Summary of the process.
  • Analysis of two business cases, customer relations and fraud detection for example, but others are also possible.
  • Case study 1: customer relations in the insurance industry.
  • Cibler les campagnes marketing. Comprendre les causes d’attrition client. Quels produits pour quels clients ?
  • Case study 2: fraud detection.
  • Compare research using classical statistics and data mining.
  • Detection by supervised method. Unsupervised detection.
Case study
Practical application of the storytelling method to business cases.


Customer reviews
4,3 / 5
Customer reviews are based on end-of-course evaluations. The score is calculated from all evaluations within the past year. Only reviews with a textual comment are displayed.
JULES B.
16/12/25
4 / 5

très bien mais un peu court. Il a manqué de partie TD/atelier technique
ELISE V.
16/12/25
5 / 5

Première approche de la Data Science très complète.
DIEGO L.
16/12/25
5 / 5

Le formateur sait vulgariser les sujets techniques complexes, en s’appuyant sur des exemples simples.



Dates and locations
Select your location or opt for the remote class then choose your date.
Remote class

Dernières places
Date garantie en présentiel ou à distance
Session garantie

REMOTE CLASS
2026 : 24 Mar., 14 Apr., 26 May, 23 June, 15 Sep., 29 Sep., 13 Oct., 3 Dec., 8 Dec.

PARIS LA DÉFENSE
2026 : 24 Mar., 26 May, 23 June, 29 Sep., 13 Oct., 8 Dec.