Enqurious logo
Go back

Batch Processing through Databricks on Azure

3 modules
27 Hours 23 Minutes
databricks-azure.webp skill path cover image

Overview

Globalmart, an ecommerce startup, faces challenges with data inaccuracies, schema inconsistencies, and a lack of trust in data systems from stakeholders. What measures are necessary to address and resolve these issues?


GlobalMart is a startup revolutionizing the shopping experience for its customers, both in the retail landscape and the online marketplace. As GlobalMart continues to expand, it is increasingly relying on data-driven decision making.

For GlobalMart to be data-driven, the stakeholders needs to be provided with accurate and refreshed data. Unfortunately, this has become a great challenge and bottleneck. The journey that started as a way to enhance operational efficiency and decision-making is now leading lot of friction between stakeholders.

Globalmart is now faced with following challenges

  • Data Silos and Absence of a Single Source of Truth
  • Data Inconsistency and Quality Issues
  • Lack of Access Control and Compliance Challenges
  • Complex and Time-Consuming Data Transformation Processes
  • Unclear Data Location and Origin Leading to Redundancy

These issues led to lack of trust in data systems rendering them useless. In this project you will be spending time to implement the following architecture that addresses all the problems that Globalmart is currently facing in their data systems

Image

Skill path

Content Poster (7).png Skill path cover imageProject

Batch Processing through Databricks on Azure - Part 01

e-commerce
data-storage
cloud-management
approach
azure
data-understanding
data-quality
data-wrangling
databricks
spark
sql
04h 30m

Content Poster (7).png Skill path cover imageProject

Batch Processing through Databricks on Azure - Part 02

e-commerce
data-storage
data-quality
cloud-management
distributed-processing
databricks
spark
sql
code-versioning
google-cloud
data-wrangling
google-cloud-storage
data-modelling
data-understanding
16h 53m

Content Poster (7).png Skill path cover imageProject

Batch Processing through Databricks on Azure - Part 03

e-commerce
data-storage
data-wrangling
databricks
batch-etl
google-cloud
airflow
06h

Redefining the learning experience

Supercharge Your
Data+AI Teams with us!