Pre-launch ยท runs on Databricks Free Edition

Master PySpark
without a cloud bill.

Hands-on course running 100% on Databricks Free Edition (the free environment from Databricks). 10 didactic units + a mini-Lakehouse project. The smoothest start before the Associate certification.

What you get

Practice-first. No theory dumps.

Every unit is an executable notebook. You run it, change it, break it, fix it.

๐Ÿ““

14 EN notebooks

Executable, commented, version-controlled.

๐Ÿ†“

Free Databricks env

Sign up at databricks.com and run everything.

๐Ÿ—๏ธ

Mini-Lakehouse project

Bronze โ†’ Silver โ†’ Gold with Delta, end-to-end.

๐Ÿค–

AI tutor on WhatsApp

Athena answers your doubts citing the lesson.

๐ŸŽ

Bonus: Python classes

Compose pipelines with reusable class patterns.

๐Ÿชœ

Bridge to the Associate

Designed as the cleanest warm-up.

Syllabus

10 units + a project. Built in order.

Fundamentals first, then a real mini-Lakehouse you can show in interviews.

00a

What Spark, PySpark and DataFrames really are

The mental model behind the whole course. Drivers, executors, lazy evaluation.

00d

Schemas, StructType, StructField and cast

How to declare schemas, why they matter, casting types the right way.

00e

Fundamental DataFrame methods

select, filter, withColumn, lit, when. The daily 80% of data engineering.

00f

Cheat sheet of essential commands

Reference of patterns you'll come back to a thousand times.

00g

How Spark works under the hood

DAG, stages, tasks, shuffles. The "why" behind performance.

01

Setup and your first DataFrames

Free Edition workspace ready. First end-to-end notebook running.

02

Ingestion: CSV, JSON and schemas

Read real data, enforce schema, handle the messy parts.

03

Transformations, joins and aggregations

Inner, left, semi/anti. groupBy + agg. Where the real work happens.

04

Mini Lakehouse: Delta on Databricks Free

Bronze โ†’ Silver โ†’ Gold with Delta, end-to-end in free.

05

Bonus: Python classes and PySpark composition

Refactor the project with class-based patterns you'll use forever.

Access

Inside the Gold Plan

PySpark on Databricks Free is part of our 6-course family in the Gold Plan โ€” alongside Associate, Professional, GenAI, SDP and DP-750.

$149
/year โ€” single payment
Join the Gold Plan โ†’
FAQ

Honest answers.

Is the course really free?

The environment is free โ€” Databricks Free Edition lets you run notebooks with no cloud bill. The course content (notebooks, exercises, AI tutor) is a paid product included in our Gold Plan. We use "Free" in the title because the cost barrier โ€” usually cloud compute โ€” is eliminated here.

Do I need to know Python before?

Basic Python helps but isn't required. The course walks you through Python concepts as they come up.

What hardware do I need?

Any laptop with a browser. All compute runs on Databricks' servers โ€” your machine just renders the notebook UI.

What's the bridge to the Associate exam?

After PySpark Free you'll be comfortable with DataFrames, joins, schemas, and a Lakehouse project โ€” exactly what the Associate course assumes. Most students go straight from one to the other.

When will videos be available in English?

Notebooks are EN-ready. Video narration in EN follows in waves through 2026. Athena (our AI tutor) is fully bilingual already.