Course : Big Data, technical overview

Synthesis course - 2d - 14h00 - Ref. BAG
Price : 1720 € E.T.

Big Data, technical overview




This summary will introduce you to the challenges and benefits of big data, and the technologies available for its implementation. You'll follow the steps involved in a massive data project, from setting up a big data platform, ingesting and processing the data, to visualizing the results.


INTER
IN-HOUSE
CUSTOM

In person or remote class
Available in English on request

Ref. BAG
  2d - 14h00
1720 € E.T.




This summary will introduce you to the challenges and benefits of big data, and the technologies available for its implementation. You'll follow the steps involved in a massive data project, from setting up a big data platform, ingesting and processing the data, to visualizing the results.


Teaching objectives
At the end of the training, the participant will be able to:
Discover the key concepts of big data
Understanding the technological ecosystem of a big data project
Evaluate techniques for managing massive data flows
Implement statistical analysis models to meet business needs
Discover data visualization tools

Intended audience
Dataminers, statistical researchers, developers, project managers, business intelligence consultants.

Prerequisites
Basic knowledge of relational models, statistics and programming languages. Basic knowledge of business intelligence concepts.

Practical details
Demonstration
Present the Hadoop platform and its basic components, use an ETL to manage data, create analysis models and dashboards.

Course schedule

1
Understanding the key concepts and challenges of big data

  • The origins of big data.
  • The value of data: an important change.
  • Data as raw material.
  • Key market figures worldwide and in France.
  • The challenges of big data: ROI, organization, data confidentiality.
Demonstration
Introduction to big data architecture.

2
Big data technologies

  • Architecture and components of the Hadoop 2 platform.
  • Storage modes (NoSQL, HDFS).
  • How MapReduce and Yarn work...
  • Main Hadoop distributions: Hortonworks, Cloudera, MapR...
  • Technologies: Spark, Storm, Databrick, Azure Machine Learning...
  • How to install a Hadoop platform.
  • Presentation of specific Big Data technologies (Talend, Tableau, QlikView...).
Demonstration
Installation of a complete big data platform.

3
Big data processing

  • How Hadoop Distributed File System (HDFS) works.
  • Import data to HDFS.
  • Data processing with PIG.
  • SQL queries with HIVE.
  • Create massive data flows with an ETL.
Demonstration
Implementation of massive data flows.

4
Data analysis and processing methods for Big Data

  • Exploration methods.
  • Segmentation and classification.
  • machine learning, estimation and prediction.
  • Real time, artificial intelligence.
  • Model implementation.
Demonstration
Introduction to the Spark environment, Jupyter Notebook, R Notebook and Shiny. Setting up machine learning analyses with R, Python and Scala.

5
Data visualization, representing data visually

  • Market-leading solutions.
  • Going beyond static reports.
  • Data visualization and the art of telling numbers in a creative and entertaining way.
  • Measure e-reputation, brand awareness, customer experience and satisfaction...
Demonstration
Presentation and use of a data visualization tool to create dynamic analyses.

6
Conclusion

  • Conditions for success.
  • Synthesis of best practices.
  • Bibliography.


Customer reviews
5 / 5
Customer reviews are based on end-of-course evaluations. The score is calculated from all evaluations within the past year. Only reviews with a textual comment are displayed.
OLIVIER C.
16/12/25
5 / 5

A very rich and instructive course with a lot of concepts that I didn't know. Very rewarding training for a first approach to Big Data, even if there's still a long way to go before you can launch a project.



Publication date : 07/09/2025


Dates and locations
Select your location or opt for the remote class then choose your date.
Remote class

Last places available
Guaranteed date, in person or remotely
Guaranteed session

REMOTE CLASS
2026 : 2 Apr., 26 May, 8 Sep., 17 Nov.

PARIS LA DÉFENSE
2026 : 26 May, 8 Sep., 17 Nov.