Course : Big Data: Practical methods and solutions for data analysis

Practical course - 5d - 35h00 - Ref. BID
Price : 3660 CHF E.T.

Big Data: Practical methods and solutions for data analysis



Required course



INTER
IN-HOUSE
CUSTOM

Practical course in person or remote class
Available in English on request

Ref. BID
  5d - 35h00
3660 CHF E.T.






Teaching objectives
At the end of the training, the participant will be able to:
Understand the concepts and benefits of Big Data with respect to business challenges
Understand the technological ecosystem needed to carry out a Big Data project
Acquire the technical skills to manage massive, unstructured, complex data flows
Implement statistical analysis models to address business needs
Learn about a data visualization tool for reporting dynamic analyses

Practical details
Hands-on work
Set up a Hadoop platform and its basic components, use an ETL to manage the data, create analysis modules and dashboards.

Course schedule

1
Understanding the concepts and challenges of Big Data

  • Origins and definition of Big Data.
  • Key figures in the international and French markets.
  • The challenges of Big Data: ROI, organization, data privacy.
  • An example of Big Data architecture.

2
Big Data technologies

  • Description of the architecture and components of the Hadoop platform.
  • Storage methods (NoSQL, HDFS).
  • Operating principles of MapReduce, Spark, Storm, etc.
  • Most popular distributions on the market (Hortonworks, Cloudera, MapR, Elastic Map Reduce, Biginsights).
  • Installing a Hadoop platform.
  • Technologies for the data scientist.
Exercise
Exercise

3
Installing a Hadoop Big Data platform (via Cloudera Quickstart or other software).

  • Operating principles of the Hadoop Distributed File System (HDFS).
  • Importing outside data into HDFS.
  • Creating SQL requests with HIVE.
  • Using PIG to process the data.
  • Using an ETL to industrialize the creation of massive data flows.
  • Overview of Talend For Big Data.
Exercise
Operating principles of the Hadoop Distributed File System (HDFS).

4
Importing outside data into HDFS.

  • Creating SQL requests with HIVE.
  • Using PIG to process the data.
  • The principle of ETL (Talend, etc.).
  • Managing massive data streaming (NIFI, Kafka, Spark, Storm, etc.)
Exercise
Implementing massive data flows

5
Big Data Analytics techniques and methods

  • Machine Learning: A component of artificial intelligence.
  • Discovering the three families: Regression, Classification, and Clustering.
  • Data preparation, feature engineering.
  • Generating models in R or Python.
  • Ensemble Learning.
Exercise
Exercise

6
Setting up analyses with the tools studied.

  • Takeaways.
  • Summary of best practices.
  • Bibliography.


Customer reviews
4,2 / 5
Customer reviews are based on end-of-course evaluations. The score is calculated from all evaluations within the past year. Only reviews with a textual comment are displayed.
SÉBASTIEN V.
18/05/26
4 / 5

la formation sur 5jours est trop riche et son interet en terme d’impact réel sur les compétences de l’apprenant dépend justement du niveau de connaissances et d’expérience de l’apprenant ; le rythme soutenu et la quantité de nouvelles informations me concernant ont rendu la capacité à intégrer le sujet dans l’ensemble de sa complexité et surtout de sa diversité très difficile à partir du 3ème jour ; les cas pratiques des jours 3 et 4 ne m’ont pas permis de faire le
BOB B.
08/12/25
4 / 5

Overall, satisfied with the educational activities.
DIDIER M.
08/12/25
4 / 5

A good understanding of the HADOOP and Bigdata ecosystems



Publication date : 08/01/2024


Dates and locations

Last places available
Guaranteed date, in person or remotely
Guaranteed session
From 20 to 24 July 2026 *
FR
Remote class
Registration
From 21 to 25 September 2026 *
FR
Remote class
Registration
From 16 to 20 November 2026 *
FR
Remote class
Registration
From 16 to 20 November 2026
EN
Remote class
Registration

REMOTE CLASS
2026 : 20 July, 21 Sep., 16 Nov., 16 Nov.