Data Analyst Roadmap 2026: From Zero to Job-Ready

    A free, step-by-step data analyst roadmap for 2026. SQL, Excel, Power BI, Tableau, Python, statistics, dbt, GA4, and the portfolio + interview prep you need to land your first or second analyst role.

    ✓ Expert-Designed Learning Path• Industry-Validated Curriculum• Real-World Application Focus

    This roadmap was created by data engineering professionals with 47 hands-on tasks covering production-ready skills used by companies like Netflix, Airbnb, and Spotify. Master SQL, Excel, Power BI and 6 more technologies.

    How long does it take? Most career-changers complete this roadmap in 6-9 months studying part-time (10-15 hours/week), or about 3-4 months full-time. The 11 sections contain 47 hands-on tasks.

    The 11 steps: (0) Prerequisites and the Analyst's Role · (1) SQL Fundamentals · (2) Excel and Spreadsheets · (3) Statistics for Analysts · (4) Python for Data Analysis · (5) Data Visualization with Power BI · (6) Tableau for Portfolio and Storytelling · (7) Modern Data Stack Literacy · (8) Product and Web Analytics · (9) Experimentation and Business Insight · (10) Portfolio and Job Search.

    Beginner to Intermediate
    11 sections • 47 tasks

    Skills You'll Learn

    • SQL
    • Excel
    • Power BI
    • Tableau
    • Python
    • Statistics
    • A/B testing
    • Data storytelling

    Tools You'll Use

    • SQL
    • Excel
    • Power BI
    • Tableau
    • Python
    • pandas
    • Jupyter
    • dbt
    • GA4

    Projects to Build

    • Tourism Recovery Dashboard (SQL + Power BI)

      Answer a real business question end to end: load Eurostat regional tourism data into DuckDB, model the metrics in SQL, and ship a one-page Power BI dashboard explaining where tourism recovered fastest between 2022 and 2025.

    • Airbnb Listings EDA (Python + pandas)

      Clean a real, messy Inside Airbnb listings dataset, run an exploratory analysis in a Jupyter notebook, and ship the result on GitHub with a README and one publication-quality chart.

    Step 0: Prerequisites and the Analyst's Role

    -Understand what a data analyst does day to day, and how the role differs from data scientist, data engineer, and analytics engineer
    -Set up your workstation: code editor (VS Code), terminal, a GitHub account, and a free SQL playground (DBeaver + Postgres, or DB Fiddle in the browser)
    -Learn the data-team mental model: warehouse → modeling → BI → stakeholder. Where the analyst sits and which numbers you owe to whom
    -Get oriented to the data analyst job market: typical sectors that hire (fintech, e-commerce, SaaS, consumer apps, consulting), the seniority ladder, and how to read a job description for what they really mean

    Step 1: SQL Fundamentals

    -Master SELECT, WHERE, ORDER BY, LIMIT, DISTINCT, IN, BETWEEN, LIKE, IS NULL. Learn the precedence rules and the NULL pitfalls.
    -Learn every JOIN type with concrete examples: INNER, LEFT, RIGHT, FULL OUTER, CROSS. When each one is the right call.
    -GROUP BY, HAVING, COUNT/SUM/AVG/MIN/MAX, and aggregating across multiple grouping columns
    -Subqueries, correlated subqueries, and Common Table Expressions (CTEs with `WITH`) for readable multi-step logic
    -Window functions: ROW_NUMBER, RANK, LAG/LEAD, running totals, PARTITION BY, frame clauses. This is the biggest interview filter for analyst roles.
    -Explore the [SQL fundamentals](/fundamentals/sql) page for deeper dive on syntax and patterns

    Step 2: Excel and Spreadsheets

    -Lookups: VLOOKUP, XLOOKUP, INDEX-MATCH. Know when each one wins and why XLOOKUP usually beats VLOOKUP in 2026.
    -PivotTables, slicers, and dashboard-style summaries built straight from a flat table
    -Spreadsheet hygiene: structured tables, named ranges, conditional formatting, and the Google Sheets crossover for shared work

    Step 3: Statistics for Analysts

    -Descriptive statistics that don't lie: mean, median, mode, variance, standard deviation, percentiles. When the average is misleading and what to report instead.
    -Distributions, sampling, central limit theorem, confidence intervals. Build intuition before formulas.
    -Hypothesis testing fundamentals: t-test, chi-square, p-values, statistical significance vs practical significance
    -Cognitive pitfalls that bite analysts: Simpson's paradox, survivorship bias, p-hacking, multiple comparisons

    Step 4: Python for Data Analysis

    -Install Python, set up a virtual environment, and run your first Jupyter notebook locally
    -pandas essentials: DataFrame, Series, indexing, filtering, groupby, merge, pivot_table
    -Data cleaning in pandas: missing values, type coercion, deduplication, string normalization, datetime parsing
    -Plotting with matplotlib and seaborn, plus a peek at plotly for interactive charts
    -Ship a mini-EDA end to end: load a public dataset, clean it, summarize it, plot three findings, write a one-paragraph conclusion
    -Explore the [Python fundamentals](/fundamentals/python) page if you need a deeper refresher on the language itself

    Step 5: Data Visualization with Power BI

    -Install Power BI Desktop, load a CSV or Excel file, and build your first report with three visuals
    -DAX basics: calculated columns vs measures, SUMX, CALCULATE, FILTER. The 80/20 of DAX for analyst work.
    -Page design and interactivity: slicers, cross-filtering, drillthrough, bookmarks, mobile layout
    -Publish to Power BI Service, share with stakeholders, schedule refresh, and understand workspace permissions

    Step 6: Tableau for Portfolio and Storytelling

    -Create a Tableau Public account, connect to your first dataset, build a viz using calculated fields
    -Level of Detail (LOD) expressions: FIXED, INCLUDE, EXCLUDE. The Tableau-specific power tool recruiters look for.
    -Storytelling dashboard: build a single Tableau Public dashboard around one business question, with title, three supporting visuals, and a clear conclusion

    Step 7: Modern Data Stack Literacy

    -Warehouse fundamentals: row-based vs columnar storage, partitioning, what makes Snowflake and BigQuery fast for analytics queries
    -dbt basics: what analytics engineering is, why analysts read (and increasingly write) dbt models, and a 30-minute hands-on
    -BI tool landscape: Looker, Looker Studio, Metabase. When a company picks each one and what changes for the analyst.
    -Where the analyst fits in: the bridge between dbt models and stakeholder questions. Read the lineage, trust but verify the numbers.
    -Explore the [Data Warehousing fundamentals](/fundamentals/data-warehousing) for the deeper picture

    Step 8: Product and Web Analytics

    -GA4 fundamentals: events, parameters, audiences, the report library, and the BigQuery export that unlocks SQL on raw event data
    -Google Tag Manager essentials: containers, triggers, tags, the dataLayer, and the QA mindset
    -The three product-analytics frames every consumer company uses: funnels, cohorts, retention curves

    Step 9: Experimentation and Business Insight

    -A/B testing end to end: hypothesis, MDE, sample size, randomization, primary vs secondary metrics, guardrails
    -Pick your North Star: how to choose, define, and operationalize a business metric without gaming it
    -Translate analysis into recommendations: pyramid principle, executive summaries, and answering "so what?"
    -Stakeholder communication: managing requests, scoping ambiguous questions, and the polite "no"

    Step 10: Portfolio and Job Search

    -Build a SQL + Power BI portfolio project on a real public dataset. Follow the [Tourism Recovery Dashboard project](/projects/data-analyst-tourism-dashboard) for a scoped, recruiter-ready end-to-end build.
    -Build a Python EDA portfolio project: clean a messy dataset, document the analysis, ship the notebook on GitHub. The [Airbnb EDA project](/projects/data-analyst-airbnb-eda) walks the full cleaning + analysis + writeup loop.
    -Build a Tableau Public story dashboard you would actually present at an interview
    -Tailor your CV and LinkedIn for analyst roles: sectors, quantified outcomes, and how the recruiter screen works
    -Interview prep: walk through SQL live-coding and case studies using the [Interview Prep section](/interview-prep).

    Curriculum Reference

    A free preview of the learning material in this roadmap — the full reference for every section is available when you sign in. Click any task to expand it.

    Step 0: Prerequisites and the Analyst's Role

    Understand what a data analyst does day to day, and how the role differs from data scientist, data engineer, and analytics engineer

    Four titles sit close to each other on the data team. Knowing the differences keeps you from applying to roles that don't match what you actually want.


    Data Analyst

    • Primary job: Answer specific business questions with data.
    • Day to day: SQL queries, dashboards, ad-hoc analysis, stakeholder calls.
    • Tools: SQL, Excel, Power BI / Tableau, GA4, sometimes Python.
    • Output: Reports, dashboards, recommendations.

    Data Scientist

    • Primary job: Build statistical and ML models to predict or classify.
    • Day to day: Notebooks, feature engineering, model training, experiment design.
    • Tools: Python, scikit-learn, pandas, sometimes deep-learning frameworks.
    • Output: Models, papers, A/B test designs.

    Data Engineer

    • Primary job: Build and maintain the pipelines that move data from sources into the warehouse.
    • Day to day: Airflow / dbt-core / Spark, infra, on-call.
    • Tools: Python, SQL, cloud, Docker, orchestrators.
    • Output: Reliable data pipelines, source systems integration.

    Analytics Engineer

    • Primary job: Transform raw data inside the warehouse into clean, modeled tables that analysts can trust.
    • Day to day: dbt models, tests, docs, semantic-layer work.
    • Tools: SQL, dbt, Snowflake / BigQuery, Git.
    • Output: Modeled marts that power dashboards.

    Where to start: Analyst. It has the lowest tooling barrier, the broadest hiring market of the four, and gives you the SQL and business fluency you'll need if you later move into analytics engineering or DS.

    Set up your workstation: code editor (VS Code), terminal, a GitHub account, and a free SQL playground (DBeaver + Postgres, or DB Fiddle in the browser)

    You need four things before Step 1.


    1. Code editor

    Install VS Code. Add these extensions:

    • SQLTools (run SQL inside VS Code)
    • Python (when you reach Step 4)
    • Rainbow CSV (color-coded columns)
    • GitLens (Git history in the editor)

    2. Terminal + Git + GitHub

    Mac and Linux: built-in. Windows: install Git Bash or WSL2. Then git --version should print a version. Create a free GitHub account — you'll push portfolio projects there.

    3. SQL playground

    Two options:

    • In-browser: DB Fiddle — pick PostgreSQL, paste a schema, query. Zero setup.
    • Local: install DBeaver + a local PostgreSQL via Docker (docker run --name pg -e POSTGRES_PASSWORD=postgres -p 5432:5432 -d postgres:16). More realistic for the long run.

    4. A spreadsheet tool

    Excel or Google Sheets. You'll need pivot tables and lookups starting in Step 2.


    Tip: Don't overinvest in tooling. The job is the analysis, not the IDE.

    Learn the data-team mental model: warehouse → modeling → BI → stakeholder. Where the analyst sits and which numbers you owe to whom

    Data flows in one direction. Knowing the stops helps you triage problems and own the right slice.

      Source systems         Warehouse           Modeling layer        BI layer         Stakeholder
      (app DB, CRM,    →    (Snowflake,    →    (dbt models,     →    (Power BI,    →   (PM, CEO,
       GA4, Stripe...)       BigQuery)           views, marts)         Tableau,           sales lead,
                                                                        Looker)            ops...)
             ↑                    ↑                    ↑                    ↑                  ↑
          data engineer    analytics engineer    analytics engineer    data analyst    everyone
    

    What the analyst owns

    • The question. Frame the business problem in a way that data can answer.
    • The numbers. SELECT from already-modeled tables. Run analyses, build dashboards.
    • The interpretation. Translate output back into a recommendation.
    • The trust. When a number on a dashboard is wrong, you take the first call — even if the bug is upstream.

    What the analyst does NOT own (most of the time)

    • Pipeline reliability — that's data engineering.
    • The shape of marts and dimensions — that's analytics engineering (you may write some, but governance sits there).
    • The model accuracy of ML systems — that's data science.

    Rule of thumb: If a stakeholder asks "why is this number wrong?", you investigate first and route the bug to the right team. The analyst is the front door of the data team.

    Get oriented to the data analyst job market: typical sectors that hire (fintech, e-commerce, SaaS, consumer apps, consulting), the seniority ladder, and how to read a job description for what they really mean

    Knowing the market shapes how you spend the next 6 months.


    Sectors that hire analysts

    1. Fintech and payments. Fraud, credit risk, AML/compliance analytics. Heavy SQL + Python.
    2. E-commerce and retail. GA4 + product analytics + experimentation dominant.
    3. Consumer apps and travel. Funnels, cohorts, A/B testing, retention curves.
    4. Industrial and energy. More Excel + Power BI flavored, less Python.
    5. Consulting and Big Four. High volume of junior roles but generally thinner tech depth.
    6. SaaS / developer tools. Product analytics, self-serve funnels, Looker / Looker Studio common.

    Two channels for finding roles

    • In-country onsite / hybrid. Local job boards, LinkedIn filtered to your city, company careers pages.
    • Remote-first companies. Often pay through Employer-of-Record providers (Remote.com, Deel) and expect English as the working language.

    Seniority ladder

    • Junior analyst (0-2 yrs). Reporting, ad-hoc SQL, dashboard maintenance. Often comes in via internship.
    • Mid (2-5 yrs). Owns a business area's analytics end to end, drives experiments, mentors juniors.
    • Senior / Lead (5+ yrs). Sets metrics strategy, partners with execs, hires the team.

    How to read a job description

    The "required" list is usually aspirational. Look for what's in the first three bullets under day-to-day work and the last paragraph under interview process. Those are what they actually test.


    Practical: match your CV's working language to the company's. International remote-first → English. Local-market role → the local language too. Recruiters screen on language fluency first.

    Unlock the learning materials for the remaining 10 sections

    Sign in free to open the curated guides, videos and docs for every task — and track your progress as you go.

    Sign in to continue

    Frequently Asked Questions

    How long does it take to become a data analyst?

    Most people complete this roadmap in 6-9 months part-time (10-15 hours/week) or 3-4 months full-time, working through 47 hands-on tasks across 11 sections from SQL fundamentals to portfolio and interview prep.

    What is the difference between a data analyst and a data engineer?

    A data analyst answers business questions and builds dashboards using SQL, Excel, Power BI, and Tableau, sitting between dbt models and stakeholders. A data engineer builds the pipelines and infrastructure that move and store the data the analyst queries.

    Do I need a degree to become a data analyst?

    No. This roadmap builds a portfolio of three real projects: a SQL plus Power BI dashboard, a Python pandas EDA notebook, and a Tableau Public story dashboard. A recruiter-ready portfolio and quantified outcomes matter more than a formal degree.

    What tools should a data analyst learn?

    This roadmap covers SQL, Excel, Power BI, Tableau, and Python with pandas as the core toolkit, plus statistics, dbt, GA4, and Google Tag Manager. SQL and window functions are the biggest interview filter for analyst roles.

    Do data analysts need to know Python?

    Yes, increasingly. This roadmap teaches Python for data analysis with pandas for cleaning, matplotlib and seaborn for plotting, and a full mini-EDA in Jupyter. SQL and Excel remain the daily essentials, with Python rounding out the skill set.

    Sign up for free courses and get early access to AI-powered grading, quizzes, and curated learning resources for each roadmap step.