Online learning from an accredited institution
DP-203: Azure Data Engineer Associate
In this course, the student will learn how to implement and manage data engineering workloads on Microsoft Azure, using Azure services such as Azure Synapse Analytics, Azure Data Lake Storage Gen2, Azure Stream Analytics, Azure Databricks, and others. The course focuses on common data engineering tasks such as orchestrating data transfer and transformation pipelines, working with data files in a data lake, creating and loading relational data warehouses, capturing and aggregating streams of real-time data, and tracking data assets and lineage.
4 Day Instructor-Led
Prerequisites:
Successful students start this course with knowledge of cloud computing and core data concepts and
professional experience with data solutions. Specifically completing:
• AZ-900 – Azure Fundamentals
• DP-900 – Microsoft Azure Data Fundamentals
Course Outline
Module 1: Get Started with Data Engineering on Azure
• Introduction to data engineering on Azure
• Introduction to Azure Data Lake Storage Gen2
Module 2: Experiment with Azure Machine Learning
• Use Azure Synapse serverless SQL pool to query files in a data lake
• Use Azure Synapse serverless SQL pools to transform data in a data lake
• Create a lake database in Azure Synapse Analytics
• Secure data and manage users in Azure Synapse serverless SQL pools
Module 3: Perform Data Engineering with Azure
Synapse Apache Spark Pools
• Analyze data with Apache Spark in Azure Synapse Analytics
• Transform data with Spark in Azure Synapse Analytics
• Use Delta Lake in Azure Synapse Analytics
Module 4: Use Azure Synapse Serverless SQL Pool
to Query Files in a Data Lake
• Understand Azure Synapse serverless SQL pool capabilities and use cases
• Query files using a serverless SQL pool
• Create external database objects
Module 5: Transfer and transform data with Azure Synapse Analytics pipelines
• Build a data pipeline in Azure Synapse Analytics
• Use Spark Notebooks in an Azure Synapse Pipeline
Module 6: Implement a Data Analytics Solution with Azure Synapse Analytics
• Introduction to Azure Synapse Analytics
• Use Azure Synapse serverless SQL pool to query files in a data lake
• Analyze data with Apache Spark in Azure Synapse Analytics
• Use Delta Lake in Azure Synapse Analytics
• Analyze data in a relational data warehouse
• Build a data pipeline in Azure Synapse Analytics