
Overview
This skill path prepares you for the Databricks Data Engineer Associate certification, equipping you with the skills to design and implement scalable data solutions in Databricks. You’ll start with foundational concepts, including data ingestion, transformation, and storage in the Databricks environment. Topics cover critical areas like building ETL pipelines, optimizing performance, and ensuring data quality. Through hands-on exercises and projects, you’ll master using Delta Lake, Spark SQL, and Databricks Workflows to construct and maintain efficient data pipelines.
By the end of this path, you’ll be ready to confidently apply your knowledge and take the Databricks Data Engineer Associate certification exam.
Skill path
MasterclassNeed for Pyspark to Build ELT Pipelines
e-commerce
general
approach
data-storage
data-quality
data-wrangling
batch-etl
data-governance
databricks
spark
data-understanding
python
40m
MasterclassIntroduction to Databricks: The Unified Analytics Platform
general
e-commerce
approach
data-understanding
data-storage
data-wrangling
batch-etl
databricks
spark
cloud-management
01h 25m
ScenarioDatabricks Architecture, Cluster & Lakehouse: Practice Questions
e-commerce
approach
data-understanding
data-storage
data-quality
data-wrangling
batch-etl
distributed-processing
databricks
30m
MasterclassData Ingestion using Databricks
general
e-commerce
approach
data-understanding
data-storage
batch-etl
data-wrangling
databricks
50m
MasterclassData Analysis using Pyspark
e-commerce
data-wrangling
databricks
sql
spark
python
data-understanding
batch-etl
data-storage
55m
MasterclassData Analysis using SparkSQL
e-commerce
general
approach
data-storage
data-wrangling
batch-etl
databricks
spark
sql
data-understanding
01h 15m
ScenarioETL Transformations using Pyspark & SQL: Practice Questions
e-commerce
approach
data-understanding
data-quality
data-wrangling
batch-etl
data-storage
databricks
spark
50m
ScenarioDatabricks DE Associate Partial Mock Test - 01
e-commerce
approach
data-storage
data-quality
data-wrangling
batch-etl
cloud-management
code-versioning
distributed-processing
databricks
01h
ScenarioIntroduction to Medallion Architecture
e-commerce
data-storage
batch-etl
approach
databricks
25m
ScenarioIntroduction to Lakehouse & Delta Lake
general
approach
databricks
data-understanding
data-storage
data-wrangling
batch-etl
45m

Building ETL Pipeline using Medallion Architecture
general
e-commerce
approach
data-understanding
data-storage
data-quality
batch-etl
databricks
data-wrangling
data-modelling
quality
sql
python
03h 20m
ScenarioETL & Medallion Architecture: Practice Questions
general
batch-etl
approach
databricks
30m

Delta Lake Operations - 01: Timetravel, Optimize & Vacuum
general
approach
data-understanding
data-storage
data-quality
batch-etl
databricks
data-wrangling
01h
ScenarioDelta Lake Operations - 01: Practice Questions
e-commerce
approach
quality
data-storage
data-quality
data-wrangling
batch-etl
distributed-processing
data-understanding
databricks
spark
01h

Delta Lake Operations - 02: Handle Incremental Data
e-commerce
general
approach
data-understanding
data-wrangling
batch-etl
data-storage
databricks
01h 10m
ScenarioDelta Lake Operations - 02: Practice Questions
e-commerce
approach
data-understanding
data-storage
data-wrangling
batch-etl
databricks
spark
01h
ScenarioDatabricks DE Associate Partial Mock Test - 02
e-commerce
approach
quality
data-understanding
data-storage
data-modelling
data-wrangling
batch-etl
databricks
45m

Working with Structured Streaming Data
general
approach
data-understanding
data-wrangling
stream-etl
data-storage
databricks
spark
01h 40m

Working with Structured Streaming Data using Autoloader
general
data-wrangling
stream-etl
data-storage
databricks
45m
ScenarioStructured Streaming & Autoloader: Practice Questions
e-commerce
approach
data-storage
data-modelling
data-wrangling
stream-etl
data-understanding
databricks
spark
40m
MasterclassIntroduction to Delta Live Tables (DLT) in Databricks
general
approach
data-wrangling
batch-etl
data-quality
databricks
data-storage
data-understanding
02h 35m
ScenarioDelta Live Tables(DLT): Practice Questions
e-commerce
approach
databricks
40m
ScenarioIntroduction to Workflows in Databricks
e-commerce
batch-etl
approach
databricks
50m
ScenarioWorkflows in Databricks: Practice Questions
e-commerce
batch-etl
databricks
approach
20m
MasterclassIntroduction to SQL Warehouse: Compute, Dashboard & Alerts
general
data-understanding
data-visualization
batch-etl
databricks
sql
approach
data-storage
data-modelling
data-wrangling
01h 30m
ScenarioSQL Warehouse - Databricks: Practice Questions
e-commerce
approach
data-storage
data-quality
data-wrangling
batch-etl
databricks
sql
01h
MasterclassIntroduction to Unity Catalog in Databricks
general
batch-etl
data-governance
access-control-security
approach
databricks
data-storage
data-modelling
01h 20m
ScenarioCode Versioning with Github & Databricks
general
approach
code-versioning
batch-etl
databricks
30m
ScenarioUnity Catalog & Code versioning in Databricks: Practice Questions
general
code-versioning
data-governance
access-control-security
databricks
30m
ScenarioDatabricks DE Associate Partial Mock Test - 03
e-commerce
approach
data-quality
data-wrangling
batch-etl
stream-etl
code-versioning
data-understanding
databricks
45m
ScenarioDatabricks DE Associate Full Length Mock Test - 01
e-commerce
approach
data-storage
data-quality
data-wrangling
batch-etl
stream-etl
code-versioning
distributed-processing
data-modelling
databricks
sql
01h 30m
ScenarioDatabricks DE Associate Full Length Mock Test - 02
e-commerce
approach
data-understanding
data-storage
data-wrangling
batch-etl
stream-etl
code-versioning
data-governance
access-control-security
cloud-management
databricks
spark
01h 30m
ScenarioDatabricks DE Associate Full Length Mock Test - 03
e-commerce
approach
data-understanding
data-storage
data-quality
data-wrangling
batch-etl
stream-etl
cloud-management
code-versioning
distributed-processing
databricks
spark
01h 30m
