KEMBAR78
Null-Data Science Python Syllabus | PDF | Python (Programming Language) | Chi Squared Test
0% found this document useful (0 votes)
16 views10 pages

Null-Data Science Python Syllabus

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views10 pages

Null-Data Science Python Syllabus

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Data Science with Python Syllabus

SECTION I --- Python Basics


Lesson 1: Overview
• Why do we need Python?
• Program structure
Environment Setup
• Python Installation
• Execution Types
o What is an interpreter?
o Interpreters vs Compilers
o Using the Python Interpreter
o Interactive Mode
o Running python files
o Working with Python shell
o Integrated Development Environments (IDES)
o Interactive Mode Programming
o Script Mode Programming
Lesson 2 : Basic Concepts
• Basic Operators
o Types of Operator
o Python Arithmetic Operators
o Python Comparison Operators
o Python Assignment Operators
o Python Bitwise Operators
o Python Logical Operators
o Python Membership Operators (in, not in)
o Python Identity Operators (is, is not)
o Python Operators Precedence
• Data Types
o Variables
o Assigning Values to Variables
o Multiple Assignment
o Python Numbers
o Python Strings
▪ Accessing Values in Strings
▪ String Special Operators
▪ String Formatting Operator
▪ Triple Quotes
▪ Built-in String Operations
o Python Lists
▪ Accessing Values in Lists
▪ Updating Lists
▪ Delete List Elements
▪ Basic List Operations
▪ Indexing, Slicing, and Matrixes
▪ Built-in List Functions & Methods
o Python Tuples
▪ Accessing Values in Tuples
▪ Updating Tuples
▪ Delete Tuple Elements
▪ Basic Tuples Operations
▪ Indexing, Slicing, and Matrixes
▪ No Enclosing Delimiters
▪ Built-in Tuple Functions
o Python Dictionary
▪ Accessing Values in Dictionary
▪ Updating Dictionary
▪ Delete Dictionary Elements
▪ Properties of Dictionary Keys
▪ Built-in Dictionary Functions & Methods
Lesson 3:Loops and Decision Making
o if statements
o if...else statements
o nested if statements
o while loop
o for loop
o nested loops
o Loop Control Statements
o 1) break statement
o 2) continue statement
o 3) pass statement
Lesson 4 :Functions
o Defining a Function
o Syntax
o Calling a Function
o Pass by reference vs value
o Function Arguments
o Required arguments
o Keyword arguments
o Default arguments
o Variable-length arguments
o The return Statement
o Scope of Variables
o Global vs. Local variables
Lesson 5: Basic OOPs Concept
o Creating class in Python
o Documented String
o Private Identifier
o Constructor
o Inheritance
o Polymorphism

Lesson 6 : Python Modules and Packages


o Framework vs Packages
o Folium Introduction
o Why are modules used?
o Creating modules
o The import Statement
o The from...import Statement
o The from...import * Statement
o Locating Modules
o The PYTHONPATH Variable
o Namespaces and Scoping
o The dir( ) Function
o The globals() and locals() Functions
o The reload() Function
o Packages in Python
Lesson 7: Advance Python
• Decorator, Iterator and Generator
• Anonymous Function
o Lambda
o Map
o Filter
o Reduce
SQL and Python
• Overview of SQLite
• Integrating Python with SQLite
• Errors and Exception Handling
o Standard exceptions
o Assertions in Python
o The assert Statement
o What is Exception?
o Handling an exception
o Syntax
o The except Clause with No Exceptions
o The except Clause with Multiple Exceptions
o The try-finally Clause
o Argument of an Exception
o Example with Tkinter Application
Section II -- Statistics and Data Science Overview

Lesson 8 : Data Science Overview

⚫ Data Science Disciplines


◼ Data Science and Business Buzzwords Why are there so many
◼ What is the difference between Analysis and Analytics
◼ An Introduction--Business Analytics, Data Analytics, and Data Science
◼ Data Science Diagram
◼ Introduction -- BI, ML and AI
◼ Careers in Data Science Fields

⚫ Data Overview
◼ What is Data
◼ Measuring Data
◼ Measurement of Central Tendency
◼ Measurements Dispersion
◼ Measurement Quartile
◼ Bi-variate Data and Co-variance
◼ Pearson Correlation Coefficient

⚫ Lesson 9 : Probability
◼ What is Probability
◼ Permutations
◼ Combinations
◼ Intersections Unions and Complements
◼ Independent and Dependent Events
◼ Conditional Probability
◼ Addition and Multiplication Rules
◼ Bayes Theorem

⚫ Lesson 10: Distributions


⚫ Introduction to Distributions
⚫ Uniform Distribution
⚫ Binomial Distribution
⚫ Poisson Distribution
⚫ Normal Distribution

⚫ Lesson 11:Statistics
◼ What is Statistics
◼ Sampling
◼ Central Limit Theorem
◼ Standard Error
◼ Hypothesis Testing
◼ Hypothesis Testing Example Exercise
◼ Type 1 and Type 2 Errors
◼ Students T Distribution
◼ Practical Example Descriptive Statistics Exercise
◼ What are Confidence Intervals
◼ Correlation Matrix

⚫ Lesson 12:Anova
◼ Introduction to ANOVA
◼ Two Way ANOVA Overview
◼ F- Distribution

⚫ Lesson 13:Chi Square Analysis


◼ Chi-Square Analysis
◼ Chi Squared Analysis - Exercise Example
Section III -- Python for Data Analysis
⚫ Lesson 14 : Python: Environment Setup and Essentials
◼ Introduction to Anaconda
◼ Installation of Anaconda Python Distribution – For Windows, Mac OS, and Linux
◼ Jupyter Notebook Installation
◼ Jupyter Notebook Introduction

⚫ Lesson 15:Data Analysis- Numpy


◼ Introduction to Numpy
◼ Numpy Array
◼ Numpy Indexing
◼ Numpy Operations
◼ Broadcasting Numpy Array

⚫ Lesson 16:Data Analysis -- Pandas


◼ Introduction to Pandas
◼ Series
◼ Data Frames
◼ Missing Data
◼ Groupby
◼ Operations
◼ Merging, Joining and concatenating
◼ Missing Data
◼ Data Input and Output

⚫ Lesson 17:Pandas Exercise


◼ Salaries Exercise
◼ Ecommerce Purchases Exercise

⚫ Lesson 18:Numpy Exercise


◼ Solving Linear System
◼ Problem Set

Section IV -- Python for Data Visualization

⚫ Lesson 19:Matplotlib

◼ Introduction
◼ Matplotlib Drawing Graph -- Histogram, Plotting, Box Plot etc
◼ Exercise

⚫ Lesson 20:Seaborn
◼ Introduction
◼ Distribution
◼ Categorical Plots
◼ Matrix Plots
◼ Regression Plots
◼ Grids
◼ Style and Colors
◼ Exercise

⚫ Lesson 21:Data Visualization with Pandas


◼ Pandas Built-in Data Visualization
◼ Pandas Data Visualization Exercise

⚫ Lesson 22:Data Visualization - Geographical Plotting


◼ Introduction to Geographical Plotting
◼ Choropleth Maps - Part 1 - USA
◼ Choropleth Maps - Part 2 - World
◼ Choropleth Exercises
⚫ Capstone Project I
◼ Calls Data Capstone Project
◼ Finance Project

⚫ Lesson 23:Time Series Analysis


◼ Pandas for Time Series
◼ Introduction to Time Series with Pandas
◼ Date time Index
◼ Time Re-sampling
◼ Time Shifts
◼ Pandas Rolling and Expanding
◼ Time Series Analysis
◼ Introduction to Time Series
◼ Time Series Basics
◼ Introduction to Statsmodel
◼ ETS Theory
◼ EWMA Theory
◼ ARIMA Theory
◼ ACF and PACF
◼ ARIMA with Statsmodel

⚫ Capstone Project II
◼ Stock Market Analysis Project

⚫ Lesson 24:Scientific computing with Python (Scipy)


◼ SciPy and its Characteristics
◼ SciPy sub-packages
◼ SciPy sub-packages –Integration
◼ SciPy sub-packages – Optimize
◼ Linear Algebra
◼ SciPy sub-packages – Statistics
◼ SciPy sub-packages – Weave
◼ SciPy sub-packages – I O

⚫ Lesson 25:Data Science with Python Web Scraping

◼ Web Scraping
◼ Common Data/Page Formats on The Web
◼ The Parser
◼ Importance of Objects
◼ Understanding the Tree
◼ Searching the Tree
◼ Navigating options
◼ Modifying the Tree
◼ Parsing Only Part of the Document
◼ Printing and Formatting
◼ Encoding
Section V -- Machine Learning
⚫ Lesson 26: Machine Learning with Python (Scikit–Learn)
◼ Introduction to Machine Learning
◼ Machine Learning Approach
◼ How Supervised and Unsupervised Learning Models Work
◼ Scikit-Learn
◼ Supervised Learning Models – Linear Regression
◼ Supervised Learning Models: Logistic Regression
◼ K Nearest Neighbors (K-NN) Model
◼ K Means Algorithm
◼ SVMs
◼ Unsupervised Learning Models: Clustering
◼ Unsupervised Learning Models: Dimensionality Reduction
◼ Pipeline
◼ Model Persistence
◼ Model Evaluation – Metric Functions
⚫ Lesson 27: Natural Language Processing with Scikit-Learn
◼ NLP Overview
◼ NLP Approach for Text Data
◼ NLP Environment Setup
◼ NLP Sentence analysis
◼ NLP Applications
◼ Major NLP Libraries
◼ Scikit-Learn Approach
◼ Scikit – Learn Approach Built – in Modules
◼ Scikit – Learn Approach Feature Extraction
◼ Bag of Words
◼ Extraction Considerations
◼ Scikit – Learn Approach Model Training
◼ Scikit – Learn Grid Search and Multiple Parameters
◼ Pipeline

⚫ Lesson 28: Python integration with Hadoop, MapReduce and Spark


◼ Need for Integrating Python with Hadoop
◼ Big Data Hadoop Architecture
◼ MapReduce
◼ Cloudera QuickStart VM Set Up
◼ Apache Spark
◼ Resilient Distributed Systems (RDD)
◼ PySpark
◼ Spark Tools
◼ PySpark Integration with Jupyter Notebook

Section VI: Project Works

Project 1-- Board Game Review Prediction -- To perform a Linear Regression Analysis by
predicting the average reviews in a board game
Project 2 -- Credit Card Fraud Detection -- To focus on Anomaly Detection by using
probability densities to detect credit card fraud
Project 3 – Stock Market Clustering – Learn how to use the K-means clustering algorithm to find
related companies by finding correlations among stock market movements over a given time span
Project 4 – Getting Started with Natural Language Processing in Python – This project will focus
on Natural Language Processing (NLP) methodology, such as tokenizing words and sentences, part of
speech identification and tagging, and phrase chunking.

You might also like