DATASCIENCE WITH AI - Syllabus
Course Duration – 6 Months
Greens Technologies – Velachery
Module 1 - Python Programming Language
Introduction & Setup
Python Installation – Execution- Interpreter vs Compilers- Interactive
Mode and Script Mode Programming-Running Python Files – Working
with python shell- Integrated Development Environments(IDES)
Level - 1 Basics of Python
Print Statements – Sep and End Parameter – f- strings –String
Concatenation – Escape Sequences - Comments
Data Types – Variables – Numbers – Strings - Accessing Values in
Strings - String Special Operators- String Formatting Operator-Triple
Quotes- Built-in String Operations
Operators – Arithmetic Operators – Assignment Operators-Logical
Operators-Bitwise Operators-Python Membership Operators (in, not in)
- Python Identity Operators (is, is not)
Lists - Accessing Values in Lists - Updating Lists -Delete List Elements
-Basic List Operations - Indexing, Slicing, and Matrixes -Built-in List
Functions & Methods
Tuples - Accessing Values in Tuples - Updating Tuples - Delete Tuple
elements-Basic Tuples Operations -Indexing, Slicing, and Matrixes -
Built-in Tuple Functions
Dictionary - Accessing Values in Dictionary - Updating Dictionary -
Delete Dictionary Elements - Properties of Dictionary Keys - Built-in
Dictionary Functions & Methods
Sets – Sets operations – Union – Intersection – Set Methods – Set
Comprehensions
Decision Making and Looping Statements - if statements - if...else
statements - nested if statements- while loop -for loop -nested loops -
Loop Control Statements - break statement- continue statement- pass
statement
Functions - Function Definition - Calling a Function - Pass by
reference vs value - Function Arguments- Required arguments -
Keyword arguments -Default arguments- Variable-length arguments -
The return Statement - Scope of Variables -Global vs Local variables -
Recursions
Level – 2 Intermediate Python
OOPs Concepts - Class and Objects- Constructor –Inheritance-
Polymorphism- Abstraction and Encapsulation
Modules and Packages - Framework vs Packages -Creating modules -
The import Statement - The from...import Statement - The from...import
* Statement - Locating Modules -The PYTHONPATH Variable -
Namespaces and Scoping - The dir( ) Function -The globals() and
locals() Functions -The reload() Function - Packages in Python
Level 3 – Advance Python
Anonymous Function – Lambda – Map – Filter – Reduce – Decorator –
Iterator and Generator
Zip functions – Pickle Module – List Comprehensions – Collections
Module – Regular Expressions – Handling JSON Data
Exception Handling-Exception - The except Clause with No
Exceptions - The except Clause with Multiple Exceptions -The try-
finally Clause
SQL / MySQL – CRUD Operations – Constraints – Primary key –
Foreign Key – Unique – Joins – Aggregate Functions – Group By and
Having Clauses – Indexing – Views –Stored Procedures and Functions –
Triggers- Normalization – Transactions
Connecting SQL Databases with Python – Using Python Pandas to
execute SQL Queries and Manipulate Data
Data Warehousing Introduction – ETL(Extract, Transform, Load)
Processes
Hands – on Projects – Module 1
✓ Develop a student management system and Bank Management
Systems using Object Oriented Programming Concepts
✓ Develop a simple chatbot that can respond to basic queries
✓ Build a simple blog applications where users can create, read,
update, and delete blog posts
✓ Building and Querying databases for data analysis tasks
✓ Analyze the Products/Companies Dataset to find insights about the
Products and Companies
Module 2 – STATS AND PROBABILITY
Data science Introduction - An Introduction--Business Analytics, Data
Analytics, and Data Science - Careers in Data Science Fields
Descriptive Stastistics – Measure of central tendency
(Mean,Median,Mode), Measure of dispersion(Variance, Standard
deviation), Skewness and Kurtosis
Statistics and Probability – Permutations - Combinations -
Intersections Unions and Complements - Independent and Dependent
Events - Addition and Multiplication Rules - Bayes Theorem
Distributions – Uniform Distribution –Binomial Distribution – Poisson
Distribution – Normal Distribution
Statistics – Sampling - Hypothesis Testing - Type 1 and Type 2 Errors
- Correlation Matrix
Regression Analysis – Simple linear Regression – Multiple linear
Regression – Logistic Regression(classification problems)
Introduction to ANOVA – F-Distribution – Chi Square Analysis
Module 3 – Data Analysis Using Python
Introduction to Data Analysis – Python libraries – Data Wrangling and
Preprocessing (Handling missing data) – Data transformation- Handling
categorical data
Introduction to Numpy – Numpy Array – Numpy Indexing and slicing
–Aggregate Functions – Linear Algebra Operations – Numpy Operations
- Masked arrays for handling missing or invalid data
Introduction to Pandas – Data Manipulation with Pandas - Series –
Data Frames – Missing Data – Groupby - Operations – Merging,
Joining and Concatenating
Hands – on Projects – Module 3
✓ Develop a Sentiment Analysis of Customer Product Reviews
Using Pandas for data manipulation and Numpy for Text
Processing
✓ Analyze e-commerce data using Pandas and Numpy
Module 4 – Data Visualization Using Python
Introduction to Matplotlib – Drawing Graph – Histogram –Plotting –
Box Plot etc
Introduction to Seaborn – Distribution – Categorical Plots – Matrix
Plots- Regression Plots – Grids – Style and Colors
Time Series Analysis – Pandas – Date time Index – Time shifts – ETS
Theory – ARIMA Theory
Introduction to Tableau – Data Connection and Preparation – Types of
Visualization – Mapping and Geographic Visualization – Integration
with Python
Hands – on Projects – Module 4
✓ Build a dashboard for exploratory data analysis of a dataset using
matplotlib and seaborn
✓ Analyze movie ratings and review dataset using Matplotlib and
Seaborn to visualize trends in viewer ratings, genre preferences
✓ Develop a dashboard to visualize COVID-19 data in Tableau
Module 5 – Machine Learning
Introduction to Machine Learning – Machine Learning Approach –
How Supervised -Unsupervised Learning and Reinforcement Models
Work – Scikit – Learn – Supervised and Unsupervised Learning Models
– Linear – Logistic Regressions - Decision Trees and ensemble methods
– K-NN Model – K Means Algorithm – Clustering – Model Evaluation
Introduction to Neural Networks and Deep Learning – Tensorflow
Frameworks – setup and installing – Building and Training Neural
Networks – Keras - Convolutional Neural Networks(CNNs) – Recurrent
Neural Networks(RNNs) – Transfer learning Techniques – TensorFlow
Hub – NLP With TensorFlow - Model Deployment and Serving
Hands – on Projects – Module 5
✓ Build a conversational AI Chatbot using NLP Techniques
✓ Develop a personalized recommendation system for
movies/products using content-based filtering techniques
✓ Build a model for medical image analysis using deep learning
techniques
✓ Develop a sentiment analysis model using BERT model
Module 6 – Natural Language Processing With Scikit-Learn
Introduction to NLP – NLP Environment Setup - NLP Applications –
Install and Working With NLP Libraries – Natural language
toolkit(NLTK) – Working with spaCy – Working with TextBlob –
Integration with Scikit-Learn – Building pipelines integrating NLP
Preprocessing with ML Algorithms
Hands – on Projects – Module 6
✓ Create a text classification on stack overflow questions (Using
Scikit-Learn)
✓ Develop a Fake News Detection using TensorFlow
✓ Develop a Spell and Grammar Checking using NLP
Module 7 – Generative AI
Introduction to Generative AI - Generative Adversarial
Networks(GANs) – BERT Model – TTS – Attention
Mechanisms(Transformers) – Sequence Generation with Recurrent
Neural Networks(RNNs)
Hands – on Projects – Module 7
✓ Develop a Image Generation using Variational
Autoencoders(VAEs)
✓ Build a Text Generation Projects using generative AI
Module 8 - Git and GitHub
Introduction to Version Control – Git Basics – Working with GitHub –
Branching and Merging – Collaboration and Code Review – GitHub
Workflow
Project Work
1) Customised Chat Bot (Tools: Python, NLTK, Hugging Face,
Tensor Flow)
2) Image and Caption Generator (Tools :Python, OpenCV,
PyTorch)
3) Predictive Analysis for Sales Forecasting (Tools : Python,
Pandas, Scikit Learn, ARIMA, Prophet)
4) Sentiment Analysis of Customer/Product/Movie Reviews
(Tools : Python, TextBlob, NLTK, Scikit-Learn, PyTorch)
5) Recommendation System for E-Commerce (Tools : Python ,
Pandas, Scikit-Learn, TensorFlow)
Job Opportunities:
✓ Data Scientist
✓ Data Analyst
✓ AI Engineer
✓ Machine Learning Engineer
✓ Data Engineer