IITM Pravartak and
AWS Certified Advanced
Professional course in Big
Data and Cloud Analytics
Live Classes with 100% Job
Placement Assistance
About IIT MADRAS & Incubation Cell
The Indian Institute of Technology Madras or IIT Madras is
recognized globally and holds the laureate of being the
No.1 engineering university in India. With a faculty of
international fame, bolstered with a highly motivated and
profound student community, IIT Madras stands true as an
Institute of Eminence. It is a public technical and research
university established by the Government of India. IITM
Incubation Cell nurtures technology ventures through their
start-up phase by providing all the support
GUVI In A Glance
Grab Ur Vernacular Imprint-GUVI (an IIT-Madras
Incubated Company) is World’s First Vernacular Ed-Tech
Learning Platform. Introduced by Ex PayPal Employees,
GUVI empowers students to master programming skills
with the comfort of their native language. Its mission is
to impart technical skills to all through focussed
pedagogical tools.
About the Co-Founders
Co - Founder at GUVI
20+ years of Technical Expertise
& more
Tech Women Entrepreneur who was selected
For Google Developers’ Launchpad Program
Sridevi
CEO & Founder at GUVI
20+ years of Technical Expertise
& more
Built 7 Products from Scratch Mentored 1000+
Arun Prakash
students Hosted 200+ sessions & 25+ webinars
Co - Founder at GUVI
17+ years of experience with IT industry
Technologist with 9+ years of Entrepreneurial
experience & Member of the Syllabus Sub-
Bala Murugan
Committee at Anna University
Data Engineering Program
Within a stipulated 5-Months Weekend Live Online
Classes, our Data Engineering Program covers every
technology in-depth. Hands-on training with 20+ Industry
Projects backed with bi-weekly Hackathons, Mock
Interviews, unlimited practice sessions on Coding Practise
Platforms like Codekata, Webkata, & MicroArc, assist
students with absolutely no tech knowledge to skill up
and get ready for Data Engineering job roles. Along with
an Industry-oriented curriculum and Industry-recognized
certification, we offer 50+ guaranteed interviews with
100% job placement support.
Why GUVI’s
Data Engineering Class?
175% Highest Hike ₹21 Lakhs Highest Salary
40% Hike in Demand 200+ Hiring Partners
360+ Best Industry
100% Job Placement
Experts
Support
Pre-Bootcamp Phase
Book a Seat with ₹8000
(100% Refundable)
Attend Pre-Bootcamp Session
(2 sessions)
Take Assessment
If Selected/
If Not Selected/
Interested Not Interested
Proceed to Data
Engineering Program
Immediate Refund
with
of ₹8000
₹1,23,900(Course Fee) -
₹8000(Booking Fee) =
₹1,15,900(Remaining Fee)
Data Engineering
Program
5-Months Weekend Live Online Class
Hands-on Industry Projects + Bi-weekly
Hackathons
Technical Mentorship by Industry Experts +
Periodic developer Meetups
Practise on Coding Practise Platforms
CodeKata, WebKata, MicroArc
Mock interviews + Group Discussion Test
Live Cumulative Coding Test + Capstone Project
Proceed to Placement Phase
Placement Phase
Tech-Skilled Data
Engineering Learner
Enters Placement Window
Exclusive Skill-based
Job Notifications
Company-specific Expectations Setting
(Hits Interviews Arranged)
Tech-Guidance +
100% Placement Support
Top skills you’ll learn!
Great command in Python.
Solid Foundation in Database.
Cloud Services.
Hands-on in Big Data.
Excellent knowledge of Data Cleaning & Data
Visualization techniques.
Technologies covered
Python Database Shell Script
(Primary)
Orchestrator Cloud Services Big Data
Data Cleaning Data Pipelines
Infrastructure Data security Capstone
as code & Privacy Project
Program Curriculum
Module 1 : Python
Students explore Python, a versatile and
beginner-friendly programming language. Python
is known for its readability and wide range of
applications, from web development and data
analysis to artificial intelligence and automation.
It offers a rich ecosystem of libraries and tools,
making it a popular choice for both novice and
experienced programmers.
Why python ?
Python IDE
Hello World Program
Variables & Names
String Basics
List
Tuple
Dictionaries
Conditional Statements
For and While Loop , TRY AND EXCEPT
Numbers and Math Functions
Common Errors in Python
Module 2 : Python(Advance)
Students will dive into some advanced concepts
like comprehensions, file handling, regular
expressions, object oriented programming,
pickling and many more essential concepts.
Functions , Lambda, Filters and MapPython
Functions as Arguments
List Comprehension
Debugging in Python
Class and Objects
Inheritance , polymorphism , abstractions
Liner and non-Linear Data structures
Singly , doubly ,ciculer Linked list , Binary tree
Bubble , insertion , merge ,quick , Heap sorting
File Handling (Text , Json , csv )
Iterators
Pickling, Multi Threading
Module 3 : SQL
Students dive into SQL (Structured Query
Language) to acquire the skills needed for
managing and querying relational databases. SQL
enables them to retrieve, update, and manipulate
data, making it a fundamental tool for working with
structured data in various applications.
Sub module
Joins
SQL Outer Join
SQL Left Join
SQL Right Join
SQL Full Join
SQL Cross Join
Integrating Python SQL
Window functions(rank, dense
rank, row number, etc)
Sub module
Data Types, Variables, and Constants
Conditional Structures(IF,CASE,GOTO, and NULL)
Stored procedures and Function
sub quaries
Module 4 : RDBMS
Students explore RDBMS (Relational Database
Management System) to understand the
database technology that organizes data into
structured tables with defined relationships.
Mysql
Sql keys
Primary Key
Foreign Key
Unique Key
composite Key
triggers
indexes
transaction
Views
Module 5 : Mongo DB
Students delve into MongoDB to understand this
popular NoSQL database, which stores data in
flexible, JSON-like documents. They learn how
MongoDB's scalability and speed make it suitable
for handling large volumes of unstructured data
Sub module
CAP Theorem
Structured and unstructured data
OLTP vs OLAP
Schema vs Schema less
Dimensional modelling
Cluster set and up Monitoring"
Insert First Data
CRUD Operations
Insert Many
Update and Update Many
Delete and Delete Many
Sub module
Projection
Intro to Embed Documents
Embed Documents in Action
Adding Arrays
Fetching Data From Structured Data
Schema Types
Types of Data in MongoDB
Relationship between data's
Aggregation
Sub module
One to One using Embed Method
One to One using Reference Many
One to Many Embed
One to Many Reference Method
Assessment-MongoDB
Module 6 : Shell Script
Students explore shell scripting in the Linux
environment , where they learn to write and
execute scripts using the command-line
interface. Shell scripts are text files containing
a series of commands, and students discover
how to automate tasks
Intrduction to Linux
Basic Shell script commands
Creating Frameworks
Cron jobs, Email alerts
Running Batch jobs
Module 7 : Git
Students study Git , a distributed version control
system, to learn how it tracks changes in software
code. Git allows collaborative development , enabling
multiple people to work on the same project
simultaneously while managing different versions of
code. It is essential for software development , as it
tracks revisions, facilitates collaboration, and helps
in code management.
Introduction to Git
Git commonds
cloning repository in vs code
Working on cloning branches, commit , push,
add, merge from vs code
Module 8 : AWS Cloud
Students delve into cloud computing, which
involves delivering various computing services
(such as servers, storage, databases, networking,
software, and analytics) over the internet.
Sub module
Introduction to Cloud
AWS Services overview
Server vs serverless
IAM, roles, policies
EC2 , VM’s
S3
RDS – MySQL Free tier database
Integrating RDS to Local System and
Integrating RDS to Python Environment
Sub module
Lambda
Cloud Watch,
Integratrating All the Above componets and RDS
Monitoring ETL pipline with Step funtion
Glue, Data crawler, Athena
monitoring ETL pipeline with step funtion
System Design
Load balancer and High availbility
Horizontal vs Vertical Scaling
Monolithic vs microservice
Distributed messing service and Aws SQS
CDN (content deliver Network)
Caching , scalability
Aws API gateway
Module 9 : Snowflake
Students study Snowflake to grasp modern
cloud-based data warehousing, focusing on its
architecture, data sharing, scalability, and data
analytics applications.
introduction to snowflake
Diffrence between
Datalake,Datawarehouse,Deltalake,Database
Dimension and Fact Tables
Roles and users
Data modeling , snowpipe
MLOAP and ROLAP
Partitioning and indexing
Data mart and data cubes & caching
Data masking
Handling json files
Data loading from S3 and tranformation
Module 10 : Airflow
Students explore Airflow to understand its role
in orchestrating and automating workflows,
scheduling tasks, managing data pipelines, and
monitoring job execution.
Why and what is airflow
airflow UI
Run first dag
grid view
graph view
landing times view
calender view
gantt view
Code view
Core concepts of airflow
DAGs
Scope
Operators
control flow
Task and task instance
Database and executors
ETL/ ELT process implementation
monitoring ETL pipeline with aitflow
Module 11 : BigData
Students delve into big data to learn about
handling and analyzing vast datasets, using
tools like Hadoop, Hive , and HDFS , PIG for
insights and decision-making.
Sub module
Installing Hive, Installing MSQL Locally ,
Running Hive Query to integrat Local
and HDFS file system
Installing Pig
Working with Pig script and itegrating
with local and HDFS file system
Installing HBase working with HBase Qurey
Installing Cassandra and working
with Cassandra
Sub module
Installing Sqoop and fume and do the
data Migration
Local RDBMS to HDFS
Local RDBMS to Hive
Local RDBMS to HBase
HDFS to local RDBMS
Hive to RDBMS
Module 12 : Kafka
Students learn about Kafka, an open-source stream
processing platform. Kafka is used for ingesting,
storing, processing, and distributing real-time data
streams and explore Kafka's architecture, topics,
producers, consumers, and its role in handling large
volumes of data with low latency.
Introduction to kafka
producer, consumer, Consumer Groups
topics , offset , partitions, brokers
Zookeper,replication
Batch vs realtime streaming
real streaming process
Assignment and Task
Module 13 : Spark
Students will explore Spark is an open-source,
distributed computing framework that provides
high-speed, in-memory data processing for
big data analytics.
Sub module
Introduction to Apache Spark
Spark architecture Hadoop vs Spark
RDDs , Dag , tranformation , actions
Data Partitioning and Shuffling
DataFrame & Spark SQL
Streaming data handling in Spark
Sub module
Spark batch data processing (CSV,
JSON,parquet files)
AWS Data Management Tools
[AWS EMR , GLUE jobs] Assignment &
Assessments
Module 14 : Data cleaning
Students engage in data cleaning to understand
the process of identifying and correcting errors or
inconsistencies in datasets, ensuring data
accuracy and reliability for analysis and reporting.
Structured vs Unstructured Data using Pandas
Common Data issues and how to clean them
Data cleaning with Pandas and pyspark\
Handling Json Data
Meaningful data transformation (Scaling and
Normalization)
Example: Movies Data Set Cleaning
Module 15 : Prometheus
Students study Prometheus to explore its role as
an open-source monitoring and alerting toolkit ,
used for collecting and visualizing metrics from
various systems, aiding in performance
optimization and issue detection.
Sub module
server,architecture
installation
understading prom UI
node exporters
promql (agg , fun ,opertaors,datatypes)
integrating python with prom
counter , gauge , summary, histogram
recording rules
alerting rules
alert manager ,installation of alert manager
grouping, inhebiting , throttling ,
silencing alerts
Sub module
salck integration with prom with alert manager
pager duty integration with alert manager
black box exporters,installation
mysql exporter
integrating aws and prom
aws cloudwatch and prom
implementing graffana dashboard to prom"
Module 16 : Data dog
Metrics
Dashboards
Alerts
Monitors
Tracing
Logs monitoring
Integrations
Module 17 : Docker
Students learn about Docker to understand
containerization technology, which allows them to
package applications and their dependencies into
portable, efficient containers.Docker facilitates easy
deployment , scaling, and management of applications
across various environments.
What is docker
Installation of docker
Docker images , containers
Docker file
Docker volume
Docker registry
Containerzing applaciton with docker hands-on
Module 18 : Kubernetes
Nodes
Pods
ReplicaSets
Deployments
Namespaces
Ingress
Hear it from our learners
“They are very approachable and friendly when “GUVI is one of the best platforms to
we ask any doubt or any clarification. Before start a new course and a new career.
joining guvi I have already done a course of data Advanced Programming and Master Data
science in another institution.When comparing Science is one of the best programs
these two institutions, there is a lot of difference which are been trained with industry
in teaching.I love that the mentor who is experts. It has its own software to
teaching the course is not only a mentor but a practise and a huge number of exercises
professional too. This is a very unique thing to master any topic.”
about guvi. I will rate 5/5 to Guvi.”
Vishally Tejas Samanthapudi
“I have attended several classes of Masters in Data
“Guvi helps me to improve my self-confidence science course conducted by Guvi. It is really
in coding skills . The zoom classes are totally helpful to gain knowledge as it is different from
comfortable,friendly and easy to learn .It helps other online courses. Here, we have mentors in live
me to understand the basic and the core sessions, so we will be more concentrated than
concepts and it helped me to. Build logical other online courses where we watch pre
skills.I got great mentor's which helped me to recorded videos. Also we are getting weekly tasks
bridge between the academics. I'm very proud that would make us learn even if there is no class.
Thanks to Guvi.”
I am thankful for all the people in Guvi for building
up such a valuable program for our career.”
Gokila Gokul
“I always liked coding but I didn't really get a
good platform to learn things as per industrial
“Guvi offers a cordial, supportive and friendly requirements. When I was in search I got to know
environment to learners. With excellent support about Guvi, I really felt trustworthy by their
and 24*7 assistance from the mentors guvi response When I joined the Data Science course
does not leave any stone unturned to improvise the weekend live classes and recorded course
your learning. Thanks for being such an videos has made learning easy to me. Eventually
inspiration to us.”
I started spending more time practicing in
Codekata. I loved the way Guvi took care of
clarifying doubts asap. Thank you!.”
Gokak Mohd Ishtiyaque Sonia kola
“Hello folks, if you are thinking of a career transition in
the ‘Data Science’ field then, “GUVI” is the best
platform to get nourished, indulged and protruded in “The datascience course is very good,
this upcoming field and also, it doesn’t matter from the concepts are being explained in a
which engineering background you are or whether crisp manner. The instructors have good
you are a working fellow. The best thing I found here depth in the subject and solve every
is you will always get motivated unknowingly and doubt one might have. Thanks to GUVI for
become curious to learn more & more from the setting a great structured program.”
tutorial videos conducted by the IITM professors.
GUVI helps me to think about the problem in
multidimensional ways. Thanks to the GUVI team”
Shubham Nehete D iliban Sibi
“This course is designed being dynamic, interactive
“The course videos help you to learn the tools by and range of materials to refer. This is very well
yourself and you can track the progress.The structured in such a way that it makes the
mentors are very patient and ensure that participants to perform, discuss, and to participate in
students understand the concept, sometimes assessments that will help the participants to
going the extra mile and explaining. Sometimes maximize the utilization. This program is suitable for all
the mentors try to teach in your native language, if students, freshers and working professionals. This
needed. The practice platforms are easy to learn course is excellent for those who would like to learn
and practice. By completing this data science the basics of program like Python and would like to
course, sure you can become a Data Scientist.”
broaden their knowledge in Data Science. I enjoyed
seeing videos in GUVI website from experts that also
explains the concepts in a detailed manner. ”
Sridharan K A nbazhagan
Instructors
Learn from India’s top Industry Leaders
Dr Sanatan Sukhija Dr Yayati Gupta
Doctorate - IIT-Ropar Doctorate - IIT-Ropar
Shabarinath Premlal Mr Koushik Krishnan
Founder of ResPro Labs Data Science Analyst at
Credit Suisse
Mr Nethaji Nirmal Mr Revanth Guthala
Co-Founder, Webdojo Lead Data Analyst, Yulu
Mr Bala Chandar Mr Abhishek
Data Scientist,
Data Scientist, Bosch
US-based client
Our Placements
"I got a 57% Hike,
Thank you GUVI Team"
Sonia Kola
Data Scientist
Watch Video
“Every topic was
covered from scratch”
Rakesh
Python Developer
Watch Video
Watch Video
Program Details
5-Months Weekend Live Online Classes
Please contact our Data Engineering coordinator
Deepak: +91-97360 97320
Total Course Fee ₹1,23,900
Pre-BootCamp Booking Fees -₹8000
Remaining Fee ₹1,15,900
Now become a proficient Data Engineer at Affordable
Installments! Master Data Engineering
at just ₹10893 /Month
Upto 12 Months
Note: Valid documents are required for EMI Process.
Additional processing fee will be applied. EMI Amount
might vary with Vendors
No - Eligibility / Restrictions!!!
Students & Working Professionals, seeking
opportunities to upskill their Data Engineering
proficiency for faster career growth.
Develop your Data
Engineering skills
&
Unlock a challenging &
rewarding Career
Begin your Skill Development Journey Today!
For further information:
IITM Research park - phase 2
Deepak@guvi.in
module #9, 3rd floor, D block,
+91 9736097320 Kanagam Rd, Tharamani, Chennai,
Tamil Nadu, India. 600113