Course : Data mining in practice

Practical course - 3d - 21h00 - Ref. DMP
Price : 2010 € E.T.

Data mining in practice




Data mining involves discovering patterns, correspondences and motifs in a set of numerical or qualitative data. This activity is based on an algorithmic toolkit that will be presented in this course. The data mining approach will be illustrated on several projects, using R.


INTER
IN-HOUSE
CUSTOM

Practical course in person or remote class
Available in English on request

Ref. DMP
  3d - 21h00
2010 € E.T.




Data mining involves discovering patterns, correspondences and motifs in a set of numerical or qualitative data. This activity is based on an algorithmic toolkit that will be presented in this course. The data mining approach will be illustrated on several projects, using R.


Teaching objectives
At the end of the training, the participant will be able to:
Understand the benefits of the data mining approach
Translating and responding to a problem
Learn about the main data mining methods
Identify and use data mining tools
Pose a data mining problem and find the appropriate method
Ability to report results

Intended audience
Researchers, data analysis project managers, data center, marketing or quality managers, database users and business managers, future data scientists.

Prerequisites
Basic knowledge of statistics or knowledge equivalent to that acquired in the course "Descriptive statistics, introduction" (ref. UES).

Course schedule

1
The data mining project

  • The data scientist's problem: from data to information.
  • Vocabulary and concepts.
  • Descriptive exploration of the dataset.
  • Metadata for monitoring data mining projects.
  • Reminders about R software.
Hands-on work
Using R. Descriptive characterization, definition and entry of metadata for a dataset.

2
Data mining techniques

  • Classification-based method: identification of statistical groups of individuals.
  • Association method: highlighting a cause and a consequence.
  • Estimation method: addition of a number or frequency to a data set.
  • The benefits of data mining for processing large volumes of data.
  • Segmentation method: definition of criteria, extension of the classification method and k-means principle.
  • Forecasting methods: the importance of timing and assumptions.
Hands-on work
Understand the different methods depending on the needs expressed.

3
Statistical tools

  • Descriptive methods: correlation, classification, Kohonen networks, association rules.
  • Predictive methods: regression, decision trees, neural networks, K-nearest neighbors.
  • Implementation of k-means classification and CAH (Classification Ascendante Hiérarchique).
  • Principle of supervised methods.
Hands-on work
Put the different methods into practice in R.

4
Data visualization

  • Data visualization objectives.
  • The different types of quantitative data representation.
  • Design dashboards.
Hands-on work
Creating a dashboard with R using quantitative data. Represent quantitative and qualitative data with R.

5
Analysis of qualitative and textual data

  • Specifics of the problem and alternatives (factorial correspondence analysis, contingency table).
  • Introduction to instantiation, pattern, vector and heuristics.
  • How to use a vector, indexing and scoring space.
  • Different types of transformation and processing of a text document.
Hands-on work
Qualitative and textual data processing in R.


Customer reviews
5 / 5
Customer reviews are based on end-of-course evaluations. The score is calculated from all evaluations within the past year. Only reviews with a textual comment are displayed.
ALEXANDRE M.
15/10/25
5 / 5

Very interesting training, even for someone with little or no knowledge of data mining.
FAUSTINE C.
15/10/25
5 / 5

The course met my expectations and the exchanges with the trainer and the other participants were very constructive. The trainer was available and attentive to ensuring that each participant understood what was being said. He also made sure that the participants benefited fully from his professional experience.
SOPHIE D.
15/10/25
5 / 5

Very rich and operational.



Dates and locations
Select your location or opt for the remote class then choose your date.
Remote class

Dernières places
Date garantie en présentiel ou à distance
Session garantie

REMOTE CLASS
2026 : 8 June, 19 Oct.

PARIS LA DÉFENSE
2026 : 8 June, 19 Oct.