Enqurious logo
Go back

Databricks Data Engineer Skill Path

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp skill path cover image

End-to-end industry focused skill path to help gain essential skills of a Databricks Data Engineer

Pre-Requisites

Data Analyst Skill Path

Key Highlights

✅ Big Data analysis on datasets larger than 7 GB

✅ Aligned to Databricks Data Engineer Associate Certification

✅ SQL and Python optimization for efficient data analysis

Skill Path

sql-essentials.png Skill path cover imageScenario

A Practical Sense to SQL Query Optimization

This module dives deep into how to write queries in the most optimal way which would save compute resources and cost for an organization

e-commerce
sql-optimization
120 Minutes
python-essentials.png Skill path cover imageMasterclass

Advanced Data wrangling and Python code optimization

This module dives deep into writing a modular code for challenging data tasks and working with complex data structures like JSON

e-commerce
data-ingestion
550 Minutes

big-data.png Skill path cover imageMasterclass

Big Data Foundations

This module introduces you to the world of big data storage and processing and why using monolithic systems is a bad design for big data

general
data-storage
data-wrangling
420 Minutes

cloud.png Skill path cover imageMasterclass

Significance of Cloud

This module introduces you to the need for using the cloud and how it solves the modern-day data problems in the industry

general
cloud-computing
420 Minutes

big-data.png Skill path cover imageMasterclass

Working with Object storage on Cloud

This module gives you a practical sense of data lakes, and when and where to use them. It also introduces you to the work of cloud SDK and how to use it programmatically Ingest the data from Data-Lakes

e-commerce
data-lake
480 Minutes

big-data.png Skill path cover imageMasterclass

Databases connectivity on Cloud

This module gives you to various databases on the cloud and how to connect to them This module also deep dives into the ingestion of data from these databases

e-commerce
180 Minutes

big-data.png Skill path cover imageMasterclass

Need for ETL for Organizations

This module dives deep into the need for ETL Pipelines

general
ETL
120 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Introduction to Databricks

This Module details the need for a Unified Analytics platform like Databricks and how to utilize it to tackle Data + AI challenges. In this Module, we will look into Databricks architecture and how it can be created in Azure Databricks. We will also understand different types of clusters needed for various Analytical workloads

general
databricks
clusters
240 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Data Analysis using PySpark on Databricks Part 1

This Module dives deep into the analysis of data using PySpark

e-commerce
databricks
PySpark
420 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Data Analysis using PySpark on Databricks Part 2

This Module dives deep into the analysis of data using Spark SQL

e-commerce
databricks
data-wrangling
300 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageScenario bundle

Data Analysis on Databricks - Practice Set

2401 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageProject

Data Harmonization using Databricks

This is an industry Inspired Project to harmonize upstream heterogeneous systems

insurance
databricks
180 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Introduction to Delta Lake in Databricks

This module details the need for Delta Lake and its advantages over Data Lake and a Data warehouse. Additionally, we will also look into how to create and work with data in a Delta Lake

e-commerce
databricks
delta-lake
270 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Advanced Delta Lake Operation in Databricks

This module dives deep into design of delta lake for the industry and optimizing the serving layer for efficient queries

e-commerce
databricks
delta-lake
240 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageProject

Building Batch ETL pipeline in Databricks using Medallion Architecture

This is an industry-inspired project

insurance
databricks
ETL
400 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Structured Streaming in Databricks

This module gives an in-depth exploration of processing live data streams. building scalable streaming applications and integration with various data sources.

e-commerce
structured-streaming
databricks
240 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Efficient Streaming Pipelines using Auto Loaders in Databricks

This module on Auto Loaders in Databricks dive deep into reducing manual dependencies and increasing efficiency of a streaming pipeline. This module also talks about the implementation of scalable and fault tolerant streaming pipelines using Auto Loaders

e-commerce
databricks
auto-loaders
270 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Declarative ETL using Delta Live Tables in Databricks

This modules is all about how to accelerate the ETL process by building scalable data pipelines using Delta Live Tables in Databricks

e-commerce
delta-live-tables
databricks
ETL
210 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Data Quality Tests using Delta Live Tables in Databricks

This module helps you to understand how Delta Live Tables enable automated data quality checks to ensure accurate, reliable data.

e-commerce
delta-live-tables
databricks
data-quality
120 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

ETL Pipeline orchestration using Workflows in Databricks

This module details the need for orchestration of pipelines to build data pipelines at scale. This module also details out the use of workflows in Databricks for efficient orchestration of pipelines

e-commerce
databricks
Workflow Orchestration
90 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Introduction to Data Governance using Unity Catalog in Databricks

e-commerce
data-governance
unity-catalog
databricks
300 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageMasterclass

Cost Optimization with Spark Performance tuning in Databricks

banking
databricks
PySpark
210 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageScenario bundle

Certification Bundle

This module will allow you to be battle-ready for taking the databricks data engineer associate certification

330 Minutes

3c951ca9-8450-4127-8e92-ab3ea29ce5a6_83d04ac6-cb74-4a96-a06a-e0d5442aa126_Screenshot 2024-01-22 211100.webp Skill path cover imageProject

Capstone Project

400 Minutes