KEMBAR78
Roadmap For Jobs | PDF | Statistics | Data Warehouse
0% found this document useful (0 votes)
5 views10 pages

Roadmap For Jobs

The document outlines a comprehensive curriculum for Data Science and Data Engineering, covering topics such as Python, SQL fundamentals, data warehousing, and data analytics. It includes resources like video playlists, online tutorials, and books for further learning. Additionally, it provides helpful links for interview preparation and tips for aspiring data professionals.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views10 pages

Roadmap For Jobs

The document outlines a comprehensive curriculum for Data Science and Data Engineering, covering topics such as Python, SQL fundamentals, data warehousing, and data analytics. It includes resources like video playlists, online tutorials, and books for further learning. Additionally, it provides helpful links for interview preparation and tips for aspiring data professionals.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 10

Data Science Data Engine

Advanced Python SQL Fundame


SELECT, ORDER BY, Inli
WHERE, Filtering with Conditions, BET
Filter with Subqu
Iterators
CASE Statem
Collections
Regular Expressions Joins, UNION, UNION ALL, EX
Decorators
Lambdas Aggregating Results - GROUP BY, G
OOPS - Classes, Inheritance, Abstract Classes, Encapsulation COUNT, COUNT DISTINCT, Filter w
Unit Test and Pytest Frameworks Aggregate Fun
Design Patterns (OOP design principles, the Singleton, Factory, Regex - LIKE, SIMILAR TO, P
Facade, Proxy, Observer, Command, Template Method, and State
Design patterns) Window Functions and Subqueries
Python-SQL and Python-NoSQL database access (relational and DENSE RANK, NTILE, LAG AND
non-relational databases, CRUD, object-relational mapping: ORM)
Best practices, Standardization, and Coding Conventions Date and Time Functions - EXTRA
DATE_ADD and DATE_SUB, DATEDI
Functions in Aggregate S
Window Funct
Video Resources: Video Resources:
https://www.youtube.com/playlist? https://www.youtube.com/playlist?list=P
list=PLqnslRFeH2UqLwzS0AwKDKLrpYBKzLBy2 https://www.youtube.com/watch?v=Ww
https://www.youtube.com/playlist?list=PLzMcBGfZo4- https://www.youtube.com/playlist?list=P
kwmIcMDdXSuy_wSqtU-xDP BOYibnN0QqIPFbMlS01bw8x9g07Ll
https://www.youtube.com/playlist?list=PL7yh-
TELLS1FuqLSjl5bgiQIEH25VEmIc Online Resources:
https://mode.com/sql-tutorial/
Online Resources: https://www.geeksforgeeks.org/30-day
https://roadmap.sh/python advanced-level/
https://www.geeksforgeeks.org/python-programming-language/ https://gorillalogic.com/blog/sql-for-data
https://www.101computing.net/category/python- https://roadmap.sh/sql
intermediate/ https://www.101computing.net/category/python-
advanced/ https://realpython.com/learning-paths/ Books:
SQL for Data Scientists -- A Beginner's
Datasets for Analysis - Renee M Teate

Python Libraries
Database Fundam
NumPy
SciPy
Scikit-learn ER Model, Schema Refinement an
Pandas Transactions, Transaction Managenm
Crash Recovery, CAP Theorem, OTL
Matplotlib Vertical Scaling, Dimens
PyTorch
TensorFlow
Keras
Pandas
Matplotlib
PyTorch
TensorFlow
Keras Video Resources:
https://www.youtube.com/watch?v=4cW
Video Resources:
https://www.youtube.com/watch?v=LHBE6Q9XlzI Online Resources:
https://www.youtube.com/watch?v=V_xro1bcAuA&t=43348s https://www.geeksforgeeks.org/dbms/
https://www.youtube.com/playlist?
list=PLeo1K3hjS3uu7CxAacxVndI4bE_o3BDtO Books:
Database Management Systems - Rag
Online Resources:
https://www.geeksforgeeks.org/python-programming-language/
https://www.machinelearningplus.com/python/101-pandas-
exercises-python/
https://www.machinelearningplus.com/python/101-numpy-
exercises-python/

Data Warehousing and D


Data Warehouse ar
Data Warehouse infr
Data Modelin
Setting up an ETL
Dimensional Modeling: Fac
Slowly Changing Di
Understanding ET
ELT vs. ET
Advanced topics: Columnar storage,
databases, massive parallel processin
Optimizing a data warehouse using
Bitmap index
Practically using and connectin

Video Resources:
https://www.udemy.com/course/data-w
beginners/learn/lecture/17618900#ove
https://www.udemy.com/course/data-w
guide/learn/lecture/32244268#overview

Online Resources:
https://www.geeksforgeeks.org/dbms/

Books:
Database Management Systems - Rag
Data Engineering Data Analytics
SQL Fundamentals Statistics Fundamentals
SELECT, ORDER BY, Inline Calculations
tering with Conditions, BETWEEN, IN, LIKE, IS NULL,
Filter with Subqueries
Descriptive Statistics: Mean, Median, Mode, Variance, Stan
CASE Statement
Deviation
s, UNION, UNION ALL, EXCEPT, INTERSECT Data Types: Categorical, Numerical, Ordinal, Interval, Rat
Data Distribution: Normal, Binomial, Poisson, etc.
g Results - GROUP BY, Group Summary, MIN, MAX, Inferential Statistics: Hypothesis Testing, ANOVA, Chi-squ
COUNT DISTINCT, Filter with HAVING, CASE inside Correlation vs. Causation
Aggregate Function Central Limit Theorem, Conditional Probability and P-Valu
gex - LIKE, SIMILAR TO, POSIX Comparators Significance of Hypothesis Testing, Random Variables, Ba
Theorem, PDF (Probability Distribution Function), CDF
Functions and Subqueries - ROW NUMBER, RANK, (Cumulative Distribution Function), Linear Regression an
E RANK, NTILE, LAG AND LEAD, Window Alias, Ordinary Least Squares (OLS), Gauss-Markov Theorem
Hypothesis Testing and Statistical Significance, Type I & Ty
nd Time Functions - EXTRACT and DATE_PART, Errors, Statistical tests (Student’s t-test, F-test), p-value and
and DATE_SUB, DATEDIFF, TIMESTAMPDIFF, Date limitations
Functions in Aggregate Summaries and
Window Functions
urces: Video Resources:
youtube.com/playlist?list=PL6EDEB03D20332309 https://www.youtube.com/playlist?list=PLqzoL9-
youtube.com/watch?v=Ww71knvhQ-s eJTNBZDG8jaNuhap1C9q6VHyVa
youtube.com/playlist?list=PLgR- https://www.youtube.com/watch?v=LZzq1zSL1bs
qIPFbMlS01bw8x9g07Ll
Online Resources:
ources: https://www.geeksforgeeks.org/statistics/
.com/sql-tutorial/ https://www.khanacademy.org/math/statistics-probability
geeksforgeeks.org/30-days-of-sql-from-basic-to- https://www.udemy.com/course/statistics-for-data-
vel/ science-and-business-analysis/learn/lecture/7592230#overvie
alogic.com/blog/sql-for-data-engineering
map.sh/sql Books:
Practical Statistics for Data Scientists - Peter Bruce, Andrew
Bruce, and Peter Gedeck
a Scientists -- A Beginner's Guide for Building
Analysis - Renee M Teate
Power BI

Database Fundamentals
Data connection: learn the types of data connectors and b
practices, Basic table transformations, Text-specific, numb
del, Schema Refinement and Normal Forms, ACID specific, and date-specific tools, Index and conditional colum
ns, Transaction Managenment, Concurrency Control, Grouping & aggregating data, Pivoting and unpivoting tabl
overy, CAP Theorem, OTLP vs OLAP, Horizontal vs Merging & appending queries, Defining hierarchies
Vertical Scaling, Dimensional Modeling Creating Data Model: Database normalization, Data tables
lookup tables, Schema types (e.g. Star and snowflake), Crea
table relationships, Understanding filter flow, Relationshi
cardinality, Managing and editing relationships, Active vs. ina
relationships, Connecting multiple data tables
DAX: What DAX is and its best practices, Calculated colum
Measures, including implicit vs explicit measures, Filter con
Grouping & aggregating data, Pivoting and unpivoting tabl
Merging & appending queries, Defining hierarchies
Creating Data Model: Database normalization, Data tables
lookup tables, Schema types (e.g. Star and snowflake), Crea
table relationships, Understanding filter flow, Relationshi
urces: cardinality, Managing and editing relationships, Active vs. ina
youtube.com/watch?v=4cWkVbC2bNE relationships, Connecting multiple data tables
DAX: What DAX is and its best practices, Calculated colum
ources: Measures, including implicit vs explicit measures, Filter con
geeksforgeeks.org/dbms/ examples, Step-by-step measure calculation, DAX syntax a
operators, Common DAX function categories
Visualize Data with Reports: Adding simple objects, Inser
anagement Systems - Raghu Ramakrishnan basic charts and visuals, Formatting options, Report filteri
options, Editing report interactions, Drillthrough filters, Rep
bookmarks, Managing and viewing roles (RLS), Parameters
AI Visuals: The Q&A visual, The key influencers visual, T
decomposition tree visual
Advanced DAX: The DAX engines, Scalar functions, Tabl
Filter functions, Calculated Table Joins, Relationship functio
Advanced Time Intelligence
Power BI Service: Power BI Service administration, D
Data Warehousing and Data Pipeplines connections, Reports and dashboards, Sharing and collabora
Data Warehouse architecture Row-level security, Premium Per User (PPU)
Data Warehouse infrastructure
Data Modeling
Setting up an ETL process Video Resources:
Dimensional Modeling: Facts & Dimensions https://www.youtube.com/watch?v=77jIzgvCIYY
Slowly Changing Dimensions https://www.youtube.com/watch?v=e6QD8lP-m6E
Understanding ETL tools https://www.udemy.com/course/15-days-of-power-bi/learn/lec
ELT vs. ETL 29566712#overview https://www.udemy.com/course/70-778
topics: Columnar storage, OLAP Cubes, In-memory analyzing-and-visualizing-data-with-power-bi/learn/lecture/
massive parallel processing & cloud data warehouses 13971936#overview
g a data warehouse using indexes (B-tree indexes &
Bitmap indexes) Online Resources:
tically using and connecting a data warehouse https://learn.microsoft.com/en-us/training/powerplatform/powe
https://learn.microsoft.com/en-us/power-bi/

urces:
udemy.com/course/data-warehouse-fundamentals-for-
arn/lecture/17618900#overview
udemy.com/course/data-warehouse-the-ultimate-
ecture/32244268#overview

ources:
geeksforgeeks.org/dbms/

anagement Systems - Raghu Ramakrishnan


alytics
ndamentals

ian, Mode, Variance, Standard


ion
erical, Ordinal, Interval, Ratio
Binomial, Poisson, etc.
Testing, ANOVA, Chi-square

onal Probability and P-Value,


ng, Random Variables, Bayes
istribution Function), CDF
on), Linear Regression and
), Gauss-Markov Theorem,
l Significance, Type I & Type II
s t-test, F-test), p-value and its
ons

st=PLqzoL9-

=LZzq1zSL1bs

istics/
h/statistics-probability
/course/statistics-for-data-
n/lecture/7592230#overview

sts - Peter Bruce, Andrew

r BI

s of data connectors and best


ations, Text-specific, number-
ndex and conditional columns,
ivoting and unpivoting tables,
Defining hierarchies
normalization, Data tables vs.
. Star and snowflake), Creating
ding filter flow, Relationship
elationships, Active vs. inactive
e data tables
ractices, Calculated columns,
xplicit measures, Filter context
ivoting and unpivoting tables,
Defining hierarchies
normalization, Data tables vs.
. Star and snowflake), Creating
ding filter flow, Relationship
elationships, Active vs. inactive
e data tables
ractices, Calculated columns,
xplicit measures, Filter context
e calculation, DAX syntax and
on categories
dding simple objects, Inserting
atting options, Report filtering
ns, Drillthrough filters, Report
g roles (RLS), Parameters
e key influencers visual, The

nes, Scalar functions, Table &


Joins, Relationship functions,

I Service administration, Data


ards, Sharing and collaboration,
mium Per User (PPU)

=77jIzgvCIYY
=e6QD8lP-m6E
-days-of-power-bi/learn/lecture/
udemy.com/course/70-778-
h-power-bi/learn/lecture/

aining/powerplatform/power-bi
ower-bi/
Company Name Job Title Link to Apply
Last Date to Apply Venkat Satyam
Y - Yes, N - No
Sanket Neeraj
Y - Yes, N - No
HELPFUL RESOURCES
Interview Preparation and Tips
1. Data Science And Data Engineer Interview Practice Guides
DE - https://docs.google.com/spreadsheets/d/1GOO4s1NcxCR8a44F0XnsErz5rYDxNbHAHznu4pJMRkw/edit?pli=1#gid=0
DS - https://docs.google.com/spreadsheets/d/1djhTq4vD72lzuLY2rCMOkkSuNG2rRf_C5PwNMjcIAMk/edit#gid=859146723

2. Cracking The FAANG Interview - How I Cracked The FAANG Interview As A Data Engineer
https://www.youtube.com/watch?v=kAoNrYJk6u8

3. Solving Coding Interview Questions in Python on LeetCode (easy & medium problems) by Keith Galli
https://www.youtube.com/watch?v=qnSF8YaPx78

4. Coding Interview for Data Scientists | Python Questions | Data Science Interview by Emma Ding
https://www.youtube.com/watch?v=hAqg2dlNeUc

5. What I Learned From 100+ Data Engineering Interviews - Interview Tips


https://www.youtube.com/watch?v=bqCXVpRqTpE

6. SQL Interview Tips And Questions For Data Scientists And Data Engineers
https://seattledataguy.substack.com/p/3-sql-interview-tips-and-questions

7. 5 Concepts in Statistics You Should Know | Data Science Interview by Daniel Lee
https://www.youtube.com/watch?v=jwlhScL3uBc

8. Data Science Interview Questions - Data Lemur


https://datalemur.com/questions

9. How to Create Github Portfolio


https://github.com/katiehuangx/How-to-Create-a-GitHub-Portfolio/blob/main/README.md#how-to-create-your-profile

10. Find Mentors for Guidance


https://www.preplaced.in/explore-mentors

11. Data Analyst Interview Prep


https://www.interviewquery.com/p/how-to-prepare-data-analyst-interview
https://www.upgrad.com/blog/data-analyst-interview-questions-and-answer/
https://www.youtube.com/watch?v=9VQAwhp27eU

You might also like