Advertisements

The Complete Databricks Project on Google Cloud (GCP)

Advertisements
Real-World Traffic Analysis Project-Medallion Arch,Autoloader,Structured Streaming,Workflows,Environments,Github,CICD
5
5/5
(13) Ratings
859 students
Created by Saidhul Shaik
Advertisements

What you'll learn

  • Set up Databricks on Google Cloud (GCP) including workspace, Unity Catalog, clusters, and GCS integration.
  • Implement the Medallion Architecture (Bronze, Silver, Gold) using PySpark and Databricks Auto Loader for structured streaming.
  • Build end-to-end ETL pipelines that load raw data, perform transformations, and generate business-ready gold tables for analytics.
  • Automate workflows and orchestrate pipelines in Databricks with parameterized jobs and file arrival triggers.
  • Integrate GitHub with Databricks and manage code promotion across Dev, UAT, and PRD environments using pull requests.
  • Analyze real-world road traffic data to derive insights such as busiest regions, EV adoption trends, and yearly traffic volume patterns.
This course includes:
5.5 total hours on-demand video
1 articles
0 downloadable resources
19 lessons
Full lifetime access
Access on mobile and TV
Certificate of completion
Advertisements

Course content

Requirements

  • Basic knowledge of SQL PySpark and data concepts
  • Enthusiasm to learn hands-on with a real-world project – no prior Databricks experience required!

Description

  • Are you looking to master Databricks on Google Cloud (GCP) with a real-world, end-to-end project? This course is designed to give you hands-on experience with one of the most in-demand skills in data engineering: building scalable data pipelines using Databricks, PySpark, and Medallion Architecture.

  • In this project, we take on the role of a government transport agency analyzing road traffic data. You will learn how to manage road infrastructure datasets, process traffic counts from sensors, and generate insights such as the busiest regions, EV adoption trends, and year-over-year traffic volume.

  • We will start by setting up Databricks on GCP, creating buckets, external locations, and Unity Catalog, and then move step by step through Bronze, Silver, and Gold layers using Auto Loader and Structured Streaming. You’ll gain real-world exposure to data ingestion, transformation, and aggregation pipelines.

  • The course also goes beyond development by covering workflow orchestration, GitHub integration, and CI/CD practices. You will learn how to set up Dev, UAT, and Production environments, manage code using Git branches, and promote pipelines using pull requests – just like in real industry projects.

  • By the end of this course, you will not only have built a portfolio-ready project, but also be equipped with the practical knowledge and interview-ready concepts to crack Data Engineering roles involving Databricks and GCP.

  • Whether you’re a beginner or a working professional, this project-based course ensures you learn by doing – and walk away with confidence in both technical skills and real-world applications.

Who this course is for:

  • Aspiring Data Engineers who want to gain practical, hands-on experience to crack interviews
  • Cloud Professionals (GCP/Azure/AWS) looking to expand their skills into Databricks and Medallion architecture.
  • Students & Beginners in Data Engineering who want a guided, real-world project to add to their portfolio.
Advertisements
D21327C01908EED7A13B
Advertisements
Advertisements
Free Online Courses with Certificates
Logo
Register New Account