Workshop

Data Engineering with Databricks

25th, Aug 2022 | 11:00am – 3:00pm CST

Why Attend This Workshop?

This is an introductory course that serves as an appropriate entry point for anyone with strong SQL knowledge. This course seeks to prepare students to complete the Associate Data Engineering certification exam, and provides the requisite knowledge to take the course Advanced Data Engineering with Databricks.

4 Hours Instructor Lead Training

Things You'll Learn

Upon Completion of the course, students should be able to:

Leverage the Databricks Lakehouse Platform to perform core responsibilities for data pipeline development.
Simplify data ingestion and incremental change propagation using Databricks-native features and syntax, including Delta Live Tables.
Use SQL and Python to write production data pipelines to extract, transform, and load data into tables and views in the Lakehouse.
Orchestrate production pipelines to deliver fresh results for ad-hoc analytics and dashboarding.

Prerequisites

Work or educational experience as a data or IT professional
Experience using SQL to query data from enterprise data stores
Familiarity with basic cloud concepts (virtual machines, object storage, identity management)
Basic familiarity with Python variables, functions, and control flow (preferred)

Workshop Speakers

Mustafa Ali

Mustafa Ali

Data Engineer GCP Practice
at Royal Cyber

Course Topics

Part 1:

  • Databricks Workspace and Services
  • Delta Lake
  • Relational Entities on Databricks

Part 2:

  • ETL With Spark SQL
  • Incremental Data Processing

Part 3:

  • Multi-Hop Architecture
  • Delta Live Tables

Part 4:

  • Task Orchestration with Jobs
  • Running a DBSQL Query
  • Productionalizing Dashboards and Queries in DBSQL (Optional)
25th, Aug 2022 | 11:00am – 3:00pm CST

Data Engineering with Databricks

Register For Workshop

Share this Post

Copyright © 2002-2022 Royal Cyber Inc. All Rights Reserved.