Training ProRail’s DataLab in Spark, Python, Linux, and Machine Learning

  • Industry: Transportation
  • Founded: 2003
  • Headquarters: Utrecht, the Netherlands
  • Employees: 3,000+
  • Website:

Utrecht, the Netherlands, 17 July 2017 — Both ProRail and Nederlandse Spoorwegen have a Hadoop cluster to analyse the vast amounts of data they produce and collect. In order to be able to make the most of these clusters, thirteen of their data scientists participated in our PySpark training.

In three days we covered the Spark architecture, the RDD and DataFrame APIs, and machine learning. To put this new knowledge to the test, we worked through two real-world use cases based on their own data. It’s wonderful to see these two organisations learning and working together.

Laurens Koppenol
Lead Data Scientist, ProRail

Our DataLab team enjoyed a three-day PySpark course from Jeroen. Jeroen’s approach is personal and professional. I recommend Data Science Workshops to anyone in the field of data science.