Data
Engineering
Reach Us
+91 77603 09798 3rd cross, Aswath Nagar,
info@edukron.com
+91 77602 89798 Marathahalli Bridge,
Bangalore – 560037
Data Course
Engineering Curriculum
MODULE MODULE Control Flow and Error Handling in
Variables and Data Types in Python Python
01 02
1. Comprehensive curriculum from basics to advanced concepts
Overview of Python Conditional Statements (if, elif, else)
2. Hands-on projects for practical learning.
Installation and Setup Loops (for loop, while loop)
3. Expert-led instruction by industry professionals.
Python Syntax Control Flow (break, continue)
4. Flexible learning: online or offline options. Basic Input and Output Exception Handling
5. Interactive sessions and collaborative projects
Variables and Data Types Error Handling Best Practices
6. Personalised mentorship for individual guidance.
MODULE MODULE
7. Continuous assessments for skill enhancement. Data Structures in Python Functions and Scope in Python
03 04
8. Updated content reflecting industry practices.
Lists Defining Functions
9. Networking opportunities with peers and pros.
Tuples Parameters and Arguments
10. Certification upon completion. Dictionaries Return Statement
Sets Scope of Variables
List Comprehensions Lambda Functions
01 02
MODULE MODULE
Modules and Packages Advanced Topics
07 08
Creating and Using Modules Decorators
Tuples Generators
Dictionaries Regular Expressions
Sets Introduction to Libraries and Frameworks
List Comprehensions Debugging Techniques
MODULE MODULE
File Handling Object-Oriented Programming (OOP)
05 06
MODULE Introduction to Apache Spark and MODULE
PySpark Working with Data in PySpark
09 10
Reading and Writing Files Classes and Objects
File Modes Constructors and Destructors Overview of Apache Spark and PySpark Transformations and Actions
Working with JSON and CSV Files Inheritance Installation and Setup Lazy Evaluation
Exception Handling with Files Polymorphism PySpark Architecture Caching and Persistence
File Handling Best Practices Encapsulation RDDs (Resilient Distributed Datasets) Partitions and Parallelism
DataFrames and Spark SQL Performance Optimization Techniques
MODULE PySpark Transformations and MODULE MODULE Performance Tuning and Cluster MODULE
Machine Learning with PySpark Advanced Data Analytics with Spark
11 Actions 12 15 Management 16
Reading and Writing Data Introduction to MLlib Performance Tuning and Optimization Data Pipelines and ETL (Extract, Transform, Load)
Working with Structured Data (CSV, JSON, Parquet) Feature Engineering Clickstream Analysis
Cluster Management
Dictionaries Model Training and Evaluation Data Partitioning Strategies Fraud Detection
Sets Model Tuning Security and Authentication Recommender Systems
List Comprehensions Deployment and Serving Integration with other Big Data Ecosystem Tools Social Network Analysis
Java Methods and object-
MODULE MODULE Oriented programming MODULE MODULE
Spark Streaming Graph Processing with Spark Introduction to Azure Databricks Data Processing with Databricks
13 14 17 18
Introduction to Spark Streaming Introduction to GraphFrames Overview of Azure Databricks Working with DataFrames
DStreams (Discretized Streams) Creating GraphFrames Creating and Configuring Databricks Workspace ETL (Extract, Transform, Load) Operations
Window Operations Graph Algorithms Databricks Architecture Data Cleaning and Preprocessing
Stateful Streaming Graph Queries Collaborative Notebooks Window Functions and Aggregations
Encapsulation Visualization and Analysis Data Ingestion and Integration Optimization Techniques
MODULE MODULE MODULE
Data Analysis and Visualization Machine Learning with Databricks Introduction to Databases and SQL
19 20 23
Data Visualization Libraries (Matplotlib, Seaborn) Overview of Databases and Database Management Systems
Introduction to MLlib
Exploratory Data Analysis (EDA) Introduction to SQL and its Importance
Feature Engineering
Statistical Analysis Setting Up SQL Environment (e.g., MySQL, PostgreSQL)
Model Training and Evaluation
Interactive Dashboards (Databricks Visualization) Basic SQL Syntax and Statements
Hyperparameter Tuning
SQL Analytics Introduction to Data Types and Constraints
Model Deployment
MODULE
SQL Fundamentals
MODULE Real-time Data Processing with MODULE 24
Advanced Topics in Databricks
21 Structured Streaming 22
Introduction to Structured Streaming Delta Lake and Data Versioning Select Statement and Retrieving Data
Real-time Data Processing Graph Analytics Filtering Data with WHERE Clause
Window Operations Security and Authentication Sorting Data with ORDER BY Clause
Stateful Streaming Best Practices in Databricks Limiting Results with LIMIT and OFFSET Clauses
Integrating with Event Hubs or Kafka Case Studies and Hands-on Projects Using DISTINCT and Aggregate Functions
MODULE Advanced SQL Techniques and MODULE Database Administration and
MODULE Advanced SQL Queries and 27 Performance Optimization 28 Security
25 Subqueries
Working with Views Creating and Managing Users and Permissions
Understanding Joins (INNER JOIN, LEFT JOIN, RIGHT JOIN)
Using Common Table Expressions (CTEs) Backup and Restore Operations
Combining Multiple Tables with JOIN
Analytic Functions and Window Functions Database Maintenance and Optimization
Using Subqueries in SELECT Statements
Handling NULL Values Security Best Practices
Correlated Subqueries
Performance Optimization Techniques Introduction to Database Migration and Cloud Databases
Using EXISTS and NOT EXISTS
MODULE Introduction to Data Visualization MODULE Basic Visualizations and Customiza-
MODULE Data Manipulation and Transaction 29 Principles 30 tions in Tableau
26 Control
Overview of Data Visualization Principles Creating Basic Charts (Bar, Line, Pie)
Inserting Data into Tables
Introduction to Tableau and its Importance Formatting and Customizing Visualizations
Updating Existing Data
Installing and Setting Up Tableau Desktop Adding Filters and Parameters
Deleting Data from Tables
Connecting to Data Sources Using Groups and Sets
Managing Transactions with COMMIT and ROLLBACK
Understanding Tableau Interface and Navigation Introduction to Calculated Fields
Controlling Data Integrity with Constraints
MODULE Introduction to Power BI and Data MODULE Data Loading, Transformation, and
MODULE Advanced Visualizations and MODULE Interactive Dashboard Design in
35 Connection 36 Modeling in Power BI
31 Calculations in Tableau 32 Tableau
Using Dual Axes and Combined Axis Designing Interactive Dashboards Overview of Business Intelligence (BI) Concepts Data Loading and Transformation
Working with Trend Lines and Reference Lines Dashboard Layout and Formatting Introduction to Power BI and its Importance Data Cleansing and Manipulation
Using Maps for Geospatial Analysis Creating Actions and Interactivity Installing and Setting Up Power BI Desktop Creating Relationships between Data Tables
Implementing Advanced Calculations Best Practices for Dashboard Design Connecting to Data Sources Data Modeling and DAX (Data Analysis Expressions)
Incorporating Tableau Prep for Data Preparation Storytelling with Data using Tableau Story Points Understanding Power BI Interface and Navigation Introduction to Power Query Editor
MODULE Basic Visualizations and MODULE Advanced Visualizations and
MODULE Advanced Data Analysis Techniques MODULE Tableau Server and Online 37 Customizations in Power BI 38 Analytics in Power BI
33 in Tableau 34 Administration
Implementing Level of Detail (LOD) Expressions Creating Basic Visualizations (Bar, Line, Pie) Using Calculated Columns and Measures
Publishing to Tableau Server or Tableau Online
Advanced Table Calculations Formatting and Customizing Visualizations Implementing Conditional Formatting
Managing Permissions and Access Control
Forecasting and Trend Analysis Adding Filters and Slicers Working with KPIs (Key Performance Indicators)
Scheduling Data Refreshes
Clustering and Segmentation Introduction to Hierarchies and Drill-down Incorporating Map Visualizations
Collaborating with Tableau Server/Online
Integrating R and Python Scripts Using Custom Visuals from AppSource Utilizing AI Insights (Quick Insights, Q&A)
Introduction to Tableau Mobile and Embedded Analytics
Target Audience
MODULE Advanced Interactive Dashboard The ideal candidates for a data engineering course include
39 Design and Implementation
recent graduates with degrees in computer science,
information technology, or related fields.
Designing Interactive Dashboards
These individuals possess strong analytical and
Dashboard Layout and Formatting programming skills, providing a solid foundation for
Creating Drill-through and Drill-down Reports learning data engineering concepts.
Implementing Cross-filtering and Highlighting They are eager to apply their academic knowledge to
Advanced Interactivity with Bookmarks and Buttons real-world scenarios, focusing on building and managing
scalable data pipelines and infrastructure.
Additionally, professionals with experience in software
development, database management, or IT are prime
MODULE
40
Power BI Service and Collaboration candidates for a data engineering course.
They bring a practical understanding of systems and data
Publishing to Power BI Service management, which allows them to quickly adopt data
engineering practices.
Managing Dashboards and Reports in Power BI Service
These individuals often aim to upskill or transition into roles
Sharing and Collaboration Features
that focus on designing, constructing, and maintaining the
Security and Access Control architectures that enable data analysis and business
Introduction to Power BI Mobile App and Embedded intelligence, thus enhancing their value in data-driven
Analytics industries.
Projects Placement
ARJUN Pooja Karthik
Larsen & Toubro (L&T) Mahindra & Mahindra Tata Motors
15LPA 15LPA 15LPA
Data Pipeline Development Real-time Data Streaming
Processing
Ananya Vikram Deepa
Bharat Heavy Electricals Ashok Leyland Maruti Suzuki
Limited (BHEL)
14.5LPA 14LPA
14.5LPA
4.3LPA 4.3LPA 4.3LPA 4.3LPA
Data Warehousing Data Quality Assessment Data Integration and ETL
Implementation and Improvement (Extract, Transform, Load)
Reviews
"EDUKRON in Marathahalli is undoubtedly the go-to "Attending the Data Engineering course at EDUKRON
destination for mastering Data Engineering. The was a game-changer for me. Bharath's lucid
Varun Shruti instructor, Bharath, brings extensive expertise in Data explanations coupled with real-world examples made
Engineering, Python, and Big Data technologies. With complex concepts digestible. The availability of both
hands-on sessions and a comprehensive curriculum, weekday and weekend batches accommodates
EDUKRON ensures every learner grasps the intricacies diverse schedules. Through real-time projects, I gained
Hero MotoCorp Bosch of data engineering. I highly recommend this institute practical experience that propelled my career forward.
for anyone aspiring to excel in the field of Data Thanks to Bharath and EDUKRON, I now feel confident in
13.5LPA 13.5LPA Engineering." my Data Engineering skills."
Vivek Nisha "I can't recommend EDUKRON enough for anyone
serious about mastering Data Engineering with Python.
"EDUKRON stands out as the premier destination for
learning Data Science, ML, and Deep Learning. Bharath's
Bharath's expertise in Machine Learning and Deep wealth of industry experience and exceptional teaching
Learning is unparalleled. His clear explanations of the skills make learning a breeze. Starting from basic
Escorts Limited mathematical foundations behind algorithms are concepts to advanced topics like NLP and Deep Learn-
TVS Motor Company invaluable. At EDUKRON, you not only learn but also ing, Bharath covers it all. The hands-on approach and
13LPA 12.5LPA evolve into a competent Data Engineer. Trust me,
EDUKRON is where your journey to becoming a Data
real-time projects ensure practical learning. If you
aspire to excel in Data Engineering, look no further than
Scientist begins." EDUKRON."