Machine learning has become an essential component in many applications and projects that involve data. With the power of Python and the scikit-learn package, this exciting field is no longer exclusive to large companies with extensive research teams. If you use Python, even as a beginner, machine learning applications are limited only by your imagination.
During this workshop, we will take a hands-on approach to learning about machine learning algorithms. Topics include: regression, classification, outlier detection, dimensionality reduction, and clustering. During two days, we’ll explore various algorithms such as linear regression, logistic regression, decision trees, neural networks, and many more.
By the end of this workshop you’ll confidently select and employ machine learning algorithms using Python and scikit-learn. You’ll have gained a new understanding of the inner workings of machine learning algorithms and know how to leverage them to produce valuable results and insights.
About your instructor
Jeroen is an RStudio Certified Instructor who enjoys visualizing data, building machine learning models, and automating things using either Python, R, or Bash. Previously, he was an assistant professor at Jheronimus Academy of Data Science and a data scientist at Elsevier in Amsterdam and various startups in New York City. He is the author of Data Science at the Command Line. Jeroen holds a PhD in machine learning from Tilburg University and an MSc in artificial intelligence from Maastricht University.
What you’ll learn
- The fundamental concepts behind machine learning
- An overview of various machine learning algorithms
- How to use Jupyter Notebook, Python, and the
scikit-learnpackage to perform machine learning
- How to apply supervised machine learning, such as regression and classification
- How to apply unsupervised machine learning, such as dimensionality reduction, clustering, and outlier detection
This workshop is for you because
- You’re a programmer who wants to see what machine learning is all about, and how to apply it using Python and the
- You’re a data analyst who wants to leverage the power of machine learning to build new insights from their data
- You want to take your machine learning knowledge to the next level and move beyond a “black box” understanding
- Machine learning fundamentals
- Features and labels
- Training and testing
- Types of machine learning
- Scikit-learn API
- Overview of modules and classes
- Common process
- Unsupervised machine learning
- Outlier detection
- Feature engineering
- Principal Components Analysis
- Feature selection
- One-hot encoding
- Bag-of-words representation
- K-Nearest Neighbour
- Decision Tree Classifier
- Random Forest
- Neural Network
- Linear Regression
- Polynomial Regression
- Support Vector Regression
- Model evaluation
- Measuring performance
- Overfitting and underfitting
- Cross validation
- Model selection
- Pipelines and grid search
- Where to go from here?
We’ve previously delivered this workshop at:
You’re expected to have some experience with programming in Python. Our workshop Introduction to Programming in Python is one option that can help you with that. Roughly speaking, if you’re familiar with the following Python syntax and concepts, then you’ll be fine:
- assignment, arithmetic, boolean expression, tuple unpacking
str, type casting
inoperator, indexing, slicing
def, (keyword) arguments, default values
from import ...
- lambda functions, list comprehension
- JupyterLab or Jupyter Notebook
We’re going to use Python together with JupyterLab and the
scikit-learn package. The recommended way to get everything set up is to download and install the Anaconda Distribution.
Photos and testimonials
“Before the six-day workshop with Data Science Workshops, our team of engineers only had some theoretical knowledge of Data Science and we primarily used costly tools such as Tableau to do data analysis. However, after four days of interactive hands-on sessions with Jeroen, we were able to use Python, our preferred programming language at eHealth Africa, to analyse our data, create some amazing visualisations and even start making machine learning predictions. We moved from theory to real application in a very short period of time, making this workshop extremely valuable. I highly recommend Data Science Workshops.”
“Data Science Workshops facilitated a data hackathon for the data team of Transavia. They made sure it was inspiring, helpful, and leading to valuable insights in the way of working with Python for multiple projects and analyses that Transavia is currently implementing.”