SQL for Data Scientists

Introduction

Description will be added soon.

Schedule

  • Understanding databases
    • Definition of database
    • Relational databases vs NoSQL databases
    • Tables, columns, rows, types, and primary keys
    • Normalization
  • Using SQLite
    • Introduction to SQLite
    • Differences between SQLite and other databases
    • Importing and navigating a database
  • SELECT
    • Obtaining and viewing data with SELECT
    • Expressions in SELECT statements
    • Getting unique rows with DISTINCT
    • Getting a sample with LIMIT
  • WHERE
    • Filtering rows with WHERE
    • AND, OR, and IN statements
    • Understanding booleans
    • Handling NULL
    • Working with dates and time
  • GROUP BY and ORDER BY
    • Grouping rows with GROUP BY
    • Aggregate functions
    • Filtering aggregates with HAVING
    • Ordering rows with ORDER BY
  • JOIN
    • Merging multiple tables
    • INNER JOIN
    • LEFT JOIN
    • Other JOIN types
  • Using SQL in a programming language
    • SQL in Python
    • SQL in R
    • SQL from the command line

About your instructor

Jeroen Janssens
Principal Instructor, Data Science Workshops

Jeroen is an RStudio Certified Instructor who enjoys visualizing data, building machine learning models, and automating things using either Python, R, or Bash. Previously, he was an assistant professor at Jheronimus Academy of Data Science and a data scientist at Elsevier in Amsterdam and various startups in New York City. He is the author of Data Science at the Command Line. Jeroen holds a PhD in machine learning from Tilburg University and an MSc in artificial intelligence from Maastricht University.

Clients

We’ve previously delivered this workshop at:

Venturn
KPN

Photos and testimonials

Sign up

One upcoming date:
We can also organise this hands-on workshop as an online training for your team. Learn more.