Machine Learning with Python

Create powerful machine-learning applications with Python and scikit-learn.


Machine learning has become an essential component in many applications and projects that involve data. With the power of Python and the scikit-learn package, this exciting field is no longer exclusive to large companies with extensive research teams. If you use Python, even as a beginner, machine learning applications are limited only by your imagination.

During this workshop, we will take a hands-on approach to learning about machine learning algorithms. Topics include: regression, classification, outlier detection, dimensionality reduction, and clustering. During two days, we’ll explore various algorithms such as linear regression, logistic regression, decision trees, neural networks, and many more.

By the end of this workshop you’ll confidently select and employ machine learning algorithms using Python and scikit-learn. You’ll have gained a new understanding of the inner workings of machine learning algorithms and know how to leverage them to produce valuable results and insights.

About your instructor

Jeroen Janssens
Principal Instructor, Data Science Workshops

Jeroen is an RStudio Certified Instructor who enjoys visualizing data, building machine learning models, and automating things using either Python, R, or Bash. Previously, he was an assistant professor at Jheronimus Academy of Data Science and a data scientist at Elsevier in Amsterdam and various startups in New York City. He is the author of Data Science at the Command Line. Jeroen holds a PhD in machine learning from Tilburg University and an MSc in artificial intelligence from Maastricht University.

What you’ll learn

  • The fundamental concepts behind machine learning
  • An overview of various machine learning algorithms
  • How to use Jupyter Notebook, Python, and the scikit-learn package to perform machine learning
  • How to apply supervised machine learning, such as regression and classification
  • How to apply unsupervised machine learning, such as dimensionality reduction, clustering, and outlier detection

This workshop is for you because

  • You’re a programmer who wants to see what machine learning is all about, and how to apply it using Python and the scikit-learn package
  • You’re a data analyst who wants to leverage the power of machine learning to build new insights from their data
  • You want to take your machine learning knowledge to the next level and move beyond a “black box” understanding


Day 1:

  • Machine learning fundamentals
    • Features and labels
    • Training and testing
    • Types of machine learning
  • Scikit-learn API
    • Overview of modules and classes
    • Common process
  • Unsupervised machine learning
    • Outlier detection
    • Clustering
  • Feature engineering
    • Principal Components Analysis
    • Feature selection
    • One-hot encoding
    • Bag-of-words representation
    • TF-IDF

Day 2:

  • Classification
    • K-Nearest Neighbour
    • Decision Tree Classifier
    • Random Forest
    • Neural Network
  • Regression
    • Linear Regression
    • Polynomial Regression
    • Support Vector Regression
  • Model evaluation
    • Measuring performance
    • Overfitting and underfitting
    • Cross validation
    • Model selection
    • Pipelines and grid search
  • Where to go from here?


We’ve previously delivered this workshop at:

Jheronimus Academy of Data Science
eHealth Africa


You’re expected to have some experience with programming in Python. Our workshop Introduction to Programming in Python is one option that can help you with that. Roughly speaking, if you’re familiar with the following Python syntax and concepts, then you’ll be fine:

  • assignment, arithmetic, boolean expression, tuple unpacking
  • bool, int, float, list, tuple, dict, str, type casting
  • in operator, indexing, slicing
  • if, elif, else, for, while
  • range(), len(), zip()
  • def, (keyword) arguments, default values
  • import, import as, from import ...
  • lambda functions, list comprehension
  • JupyterLab or Jupyter Notebook

We’re going to use Python together with JupyterLab and the scikit-learn package. The recommended way to get everything set up is to download and install the Anaconda Distribution.

Photos and testimonials

Aboubacar Sidiki Douno
Senior Software Engineering Manager, eHealth Africa

Before the six-day workshop with Data Science Workshops, our team of engineers only had some theoretical knowledge of Data Science and we primarily used costly tools such as Tableau to do data analysis. However, after four days of interactive hands-on sessions with Jeroen, we were able to use Python, our preferred programming language at eHealth Africa, to analyse our data, create some amazing visualisations and even start making machine learning predictions. We moved from theory to real application in a very short period of time, making this workshop extremely valuable. I highly recommend Data Science Workshops.

Charles Verstegen
Head of Partner Sales and Data & Analytics, Transavia

Data Science Workshops facilitated a data hackathon for the data team of Transavia. They made sure it was inspiring, helpful, and leading to valuable insights in the way of working with Python for multiple projects and analyses that Transavia is currently implementing.

Sign up

One upcoming date:
Jun 30–Jul 1, 2021
No more spots left
We can also organise this hands-on workshop as an online training for your team. Learn more.