š± Beginner Data Engineer Roadmap
Start from zero and become job-ready. A step-by-step learning path covering SQL, Python, ETL basics, cloud fundamentals, and your first data pipeline projects.
This roadmap was created by data engineering professionals with 51 hands-on tasks covering production-ready skills used by companies like Netflix, Airbnb, and Spotify. Master Python, SQL, PostgreSQL and 5 more technologies.
Skills You'll Learn
- SQL
- Python
- ETL fundamentals
- Cloud basics
- Data modeling
- Version control
Tools You'll Use
- Python
- SQL
- PostgreSQL
- Docker
- Git
- DuckDB
- Airflow
- dbt
Projects to Build
- Local Data Engineering Environment with dlt, DuckDB & Jupyter
Set up a local development environment for data processing and analytics using Jupyter notebooks, dlt, and DuckDB. All tools are open-source and run locally.
- Scheduled GitHub ETL with Polars, DLT & DuckDB
Build a scheduled ETL pipeline that extracts GitHub repository data, transforms it with Polars, and stores results in DuckDB
- End-to-End Analytics Platform with DuckDB + Metabase
Build a modern, low-cost analytics stack using DuckDB, Metabase, and GitHub Actions for automated data updates and business-ready dashboards.