KEMBAR78
Data Engineering With Databricks | PDF | Apache Spark | Computer Data
0% found this document useful (0 votes)
2K views5 pages

Data Engineering With Databricks

This document outlines a training course on data engineering with Databricks. The course goals are to perform data engineering tasks using the Databricks workspace, use Spark to extract, transform, and load data into Delta Lake, define and schedule data pipelines with Delta Live Tables, orchestrate pipelines with Databricks Workflow Jobs, and configure access permissions with Unity Catalog. The six module course covers getting started with the Databricks workspace, transforming data with Spark, managing data with Delta Lake, building pipelines with Delta Live Tables, deploying workloads with Databricks Workflows, and managing data access with Unity Catalog.

Uploaded by

Jaya Bharathi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2K views5 pages

Data Engineering With Databricks

This document outlines a training course on data engineering with Databricks. The course goals are to perform data engineering tasks using the Databricks workspace, use Spark to extract, transform, and load data into Delta Lake, define and schedule data pipelines with Delta Live Tables, orchestrate pipelines with Databricks Workflow Jobs, and configure access permissions with Unity Catalog. The six module course covers getting started with the Databricks workspace, transforming data with Spark, managing data with Delta Lake, building pipelines with Delta Live Tables, deploying workloads with Databricks Workflows, and managing data access with Unity Catalog.

Uploaded by

Jaya Bharathi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Welcome to

Data Engineering
with Databricks

©2022 Databricks Inc. — All rights reserved 1


Course goals
Perform common code development tasks in a data engineering workflow
1
using the Databricks Data Science & Engineering Workspace.

Use Spark to extract data from a variety of sources, apply common cleaning
2 transformations, and manipulate complex data to load into Delta Lake.

Define and schedule data pipelines that incrementally ingest and process data
3
through multiple tables in the lakehouse using Delta Live Tables.

Orchestrate data pipelines with Databricks Workflow Jobs and schedule


4
dashboard updates to keep analytics up-to-date.

Configure permissions in Unity Catalog to ensure that users have proper access
5
to databases for analytics and dashboarding.

©2022 Databricks Inc. — All rights reserved


Course overview
Module 1: Get Started with Databricks Data Science and Engineering Workspace

Module 2: Transform Data with Spark (SQL/PySpark)

Module 3: Manage Data with Delta Lake

Module 4: Build Data Pipelines with Delta Live Tables (SQL/PySpark)

Module 5: Deploy Workloads with Databricks Workflows

Module 6: Manage Data Access for Analytics with Unity Catalog

©2022 Databricks Inc. — All rights reserved 3


Agenda
Module Name Duration

Get Started with Databricks Data Science and Engineering Workspace 1 hour, 20 min

Transform Data with Spark (SQL/PySpark) 2 hours, 50 min

Manage Data with Delta Lake 1 hour, 30 min

Build Data Pipelines with Delta Live Tables (SQL/PySpark) 3 hours

Deploy Workloads with Databricks Workflows 1 hour, 10 min

Manage Data Access for Analytics with Unity Catalog 2 hours

● We will take 10 minute breaks about every hour

4
©2022 Databricks Inc. — All rights reserved
Databricks Policy on recording
Instructor-led Training
We do not permit the recording of any of our instructor-led training
classes, whether for internal purposes or on behalf of our customers
Our prohibition on recordings helps protect the privacy of our instructors
and students taking our courses
If you would like to access recorded training materials, many of our training
materials are pre-recorded and available as free self-paced at
Databricks Academy

https://www.databricks.com/learn/training/home

©2022 Databricks Inc. — All rights reserved 5

You might also like