INTM571: Data Analytics
Course Outcomes:
• Introduction to Python through installation, basic numerical and string expressions, listing and
formatting.
• Understand conditional logic, branching and iteration.
• Understand dependent libraries usage, basic graphing and statistical analysis using graphs in Python.
• interpret data modelling for regression
• apply Decision Trees in Data handling and modelling
Unit I: Python basic constructs
installing python, basic input and output, variables and assignments, identifiers, objects, numeric types:
floating-point, arithmetic expressions, python expressions, division and modulo, module basics, math
module, representing text, string basics, list and set basics, common data types summary, type
conversions, binary numbers, string formatting
Unit II: Working with Loops and Functions
If-else branches, Equality and relational operators, Boolean operators and expressions, identity
operators, Code blocks and indentation, Conditional expressions Loops, while loops, For loops, Nested
loops, Break and continue, Loop else, User-defined function basics, Returning values from functions,
Reasons for defining functions, Function arguments
Unit III: Exploratory analytics in python: important packages in python, data handling in python,
data cleaning and treatment, performing descriptive statistics in python, using graphs and plots in
python, performing various descriptive statistics in python-central tendency measure, graphical
measures, hypothesis testing
Unit IV: Using Python to Handle Data and Exploratory analytics in Python
Important packages in Python, Data handling in Python, Data cleaning and Treatment, Performing
Descriptive statistics in Python, using graphs and plots in python, performing various descriptive
statistics in Python-central tendency measure, graphical measures, hypothesis testing, Using graphs and
plots in python
Unit V: Performing Regression in Python
Basics of linear regression, working with simple linear regression, Multiple regression, model
building, Non-linear regression, Line estimation, Application of regression and logistic regression
models
Unit VI: Solving Problems using Decision Trees
Principles of Decision trees, Building Decision trees, Cart, C5.0 and CHAID trees, Prediction by
decision trees, Applications of Decision Trees, Model selection and cross validation, Model overfitting
and underfitting, Validation techniques- Holdout validations, Tenfold CV, Bootstrap
Suggested Readings:
Text Book:
• Python for Data Science for Dummies; John Paul Mueller, Luca Massaron; ISBN: 9788126557394
Reference Book
• Core Python Programming, 2ed; Dr.R. Nageswara Rao; ISBN: 9789386052308
• Machine Learning using Python; Manaranjan Pradhan, U Dinesh Kumar; ISBN: 9788126579907
• Machine Learning (in Python and R) For Dummies; John Paul Mueller, Luca Massaron;
ISBN: 9788126563050