AZURE DATA ENGINEER – Apex Online IT Training

AZURE DATA ENGINEER Course Content

Module-1: Explore compute and storage options for data engineering workloads

Introduction to Azure Synapse Analytics
Describe Azure Databricks
Introduction to Azure Data Lake storage
Describe Delta Lake architecture
Work with data streams by using Azure Stream Analytics

Module-2: Run interactive queries using Azure Synapse Analytics serverless SQL pools

Explore Azure Synapse serverless SQL pools capabilities
Query data in the lake using Azure Synapse serverless SQL pools
Create metadata objects in Azure Synapse serverless SQL pools
Secure data and manage users in Azure Synapse serverless SQL pools

Module-3: Data exploration and transformation in Azure Databricks

Describe Azure Databricks
Read and write data in Azure Databricks
Work with DataFrames in Azure Databricks
Work with DataFrames advanced methods in Azure Databricks

Module-4: Explore, transform, and load data into the Data Warehouse using Apache Spark

Understand big data engineering with Apache Spark in Azure Synapse Analytics
Ingest data with Apache Spark notebooks in Azure Synapse Analytics
Transform data with DataFrames in Apache Spark Pools in Azure Synapse Analytics
Integrate SQL and Apache Spark pools in Azure Synapse Analytics

Module-5: Ingest and load data into the data warehouse

Use data loading best practices in Azure Synapse Analytics
Petabyte-scale ingestion with Azure Data Factory

Module-6: Transform data with Azure Data Factory or Azure Synapse Pipelines

Data integration with Azure Data Factory or Azure Synapse Pipelines
Code-free transformation at scale with Azure Data Factory or Azure Synapse Pipelines

Module-7: Orchestrate data movement and transformation in Azure Synapse Pipelines

Orchestrate data movement and transformation in Azure Data Factory

Module-8: End-to-end security with Azure Synapse Analytics

Secure a data warehouse in Azure Synapse Analytics
Configure and manage secrets in Azure Key Vault
Implement compliance controls for sensitive data

Module-9: Support Hybrid Transactional Analytical Processing (HTAP) with Azure Synapse Link

Design hybrid transactional and analytical processing using Azure Synapse Analytics
Configure Azure Synapse Link with Azure Cosmos DB
Query Azure Cosmos DB with Apache Spark pools
Query Azure Cosmos DB with serverless SQL pools

Module-10: Real-time Stream Processing with Stream Analytics

Design hybrid transactional and analytical processing using Azure Synapse Analytics
Configure Azure Synapse Link with Azure Cosmos DB
Query Azure Cosmos DB with Apache Spark pools
Query Azure Cosmos DB with serverless SQL pools

Module-11: Create a Stream Processing Solution with Event Hubs and Azure Databrick

Azure Data Factory Course Content

Introduction to cloud computing and Azure Data Factory

Introduction to cloud computing
Why Cloud Computing?
Different Cloud Computing Models like SaaS, PaaS and Iaas
Understanding Hybrid Cloud
How is azure a leader in the cloud market?
Importance of Data Warehouse, ETL and Analytics with cloud
What is Microsoft Azure Data Factory?
Overview of its workflow and different services in it

Implement non-relational data stores

Intro to non-relational data stores
Intro to Cosmos DB, Data Lake Storage and Blob Storage
Why Cosmos DB?
Working with Cosmos DB
Implement a solution that uses Cosmos DB
Why Data Lake?
Working with Data Lake
Implement a solution that uses Data Lake Storage Gen2
Why Blob Storage?
Working with Blob Storage
Implement a solution that uses Blob storage
Implement data distribution and partitions
Implement a consistency model in Cosmos DB
Provision a non-relational data store
Provide access to data to meet security requirements
Understanding High Availability, disaster recovery and global distribution
Implement for high availability, disaster recovery, and global distribution

Implement relational data stores

Intro to relational database and data stores
Understand elastic pools in a data store
Configuring elastic pools
Configuring geo-replication
Provide access to data for meeting the security requirements
Implement for high availability, disaster recovery, and global distribution in relational data stores
What is Azure Synapse Analytics and why is it important?
Implement data distribution and partitions for Azure Synapse Analytics
Understanding the relevance of PolyBase
Implement PolyBase

Manage data security

Understand the data masking
Implement data masking
Encrypt data at rest and in motion

Develop batch processing solutions

What is Azure Databricks?
What is Azure Data Factory?
Why Azure Databricks?
Why Azure Data Factory?
Develop batch processing solutions by using Data Factory and Azure Databricks
Understanding the concept behind data ingestion
What is PolyBase?
Understanding the workflow of PolyBase
Ingest data by using PolyBase
Implement the integration runtime for Data Factory
Implement Copy Activity within Azure Data Factory
Create linked services and datasets
Create pipelines and activities in Data Factory
Implement Mapping Data Flows in Azure Data Factory
Create and schedule triggers in Azure Data Factory
Implement Azure Databricks clusters, notebooks, jobs, and autoscaling
Ingest data into Azure Databricks
Creating the whole data lifecycle in Azure Data Factory

Develop streaming solutions

What is Azure Stream Analytics?
Why Azure Stream Analytics?
Configure input and output
Select the appropriate windowing functions
Implement event processing by using Stream Analytics

Monitor relational and non-relational data sources

Monitoring in Azure Data Factory
What is Azure Monitor?
Why use Azure Monitor?
Understanding Azure Log Analytics
Why Azure Log Analytics?
Implement monitoring in Blob Storage
Implement monitoring in Data Lake Storage
Implement monitoring in SQL Database
Implement monitoring in Azure Synapse Analytics
Implement monitoring in Cosmos DB
Configuring alerts in Azure Monitor
Implement auditing by using Azure Log Analytics

Monitor data processing

Monitoring Data Factory pipelines
Monitoring Azure Databricks
Monitoring Stream Analytics
Configuring alerts in Azure Monitor in the data processing
Implement auditing by using Azure Log Analytics in the data processing

Optimize Azure data solutions

Troubleshoot data partitioning bottlenecks
Optimize Data Lake Storage
Optimize Stream Analytics
Optimize Azure Synapse Analytics
Optimize SQL Database
Manage the data lifecycle

Azure Data Bricks Course Content

Data processing workflows scheduling and management
Working in SQL
Generating dashboards and visualizations
Data ingestion
Managing security, governance and HA/DR
Data discovery, annotation and exploration
Compute management
Machine learning (ML) modelling and tracking
ML model serving
Source control with Git

AZURE DATA ENGINEER Course Content

Azure Data Factory Course Content

Azure Data Bricks Course Content

Python, BI Stack, Azure BUS Services training

We're Here to Help You Achieve Your Goals

About us

Quick Links

Useful Links

Contact us