KEMBAR78
IE465 Data Analytics Using R and Python | PDF | Python (Programming Language) | Data Analysis
0% found this document useful (1 vote)
495 views2 pages

IE465 Data Analytics Using R and Python

This document outlines the course code IE465 - Data Analytics Using R and Python. The course is a 3 credit course introduced in 2016. The objectives are to learn about being a data scientist and to use R and Python for data analytics tasks like data manipulation, visualization, statistical modeling, and predictive modeling. The syllabus covers topics in R like data structures, statistics, modeling, and reporting as well as Python topics like programming, NumPy, Pandas, and data analysis. Upon completing the course, students will be able to use R and Python to manipulate, visualize, model, and present results from data analysis problems. The course plan outlines the modules, topics, and exam structure.

Uploaded by

Ujjwala J.B
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (1 vote)
495 views2 pages

IE465 Data Analytics Using R and Python

This document outlines the course code IE465 - Data Analytics Using R and Python. The course is a 3 credit course introduced in 2016. The objectives are to learn about being a data scientist and to use R and Python for data analytics tasks like data manipulation, visualization, statistical modeling, and predictive modeling. The syllabus covers topics in R like data structures, statistics, modeling, and reporting as well as Python topics like programming, NumPy, Pandas, and data analysis. Upon completing the course, students will be able to use R and Python to manipulate, visualize, model, and present results from data analysis problems. The course plan outlines the modules, topics, and exam structure.

Uploaded by

Ujjwala J.B
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Course code Course Name L-T-P- Credits Year of

Introduction
DATA ANALYTICS USING R AND
IE465 3-0-0-3 2016
PYTHON
Prerequisite: Nil
Course Objectives
 To learn about what it’s like to be a Data Scientist.
 To Learn R and Python for Data Analytics.
Syllabus
Introduction to R; R and R studio; Basics of R; Advanced Data Structures; Reading Data into R;
Statistical Graphics; R programming; Data Munging; String Manipulation; Basic Statistics; Linear
Models; Predictive Modeling; Time Series Analysis; Clustering; Association Rules; Text Mining;
Sentiment Analysis; Social Network Analysis; Reports and Slideshows; R Package Building.
Introduction to Python; Python Programming; NumPy; Pandas; Data Loading, Storage , File
formats, Data Wrangling; Plotting and Visualization; Data Aggregation and Group Operations;
Time Series Analysis; Financial and Economic Data Applications
Expected Outcome
After Completion of course, the students will be able to use R and Python to:
i. Manipulate and extract information from data
ii. Make informative plots
iii. Construct and apply statistical learning methods for predictive modeling
iv. Properly select, tune, and assess models
v. Reproduce and present results from data analysis
References
1. Dirk Eddelbuettel, “Seamless R and C++ Integration with Rcpp”, Springer
2. James, Witten, Hastie and Tibshirani,“An Introduction to Statistical Learning: with
Applications in R”, free electronic version of this book available at http://www-
bcf.usc.edu/~gareth/ISL/.
3. Jarad Lander, “R for Everyone: Advanced Analytics and Graphics” , Addison
Wesley.
4. Johannes Ledolter, “Data mining and business analytics with R”, John Wiley & Sons.
5. Mark Gardener, “R The Statistical Programming” , Wiley.
6. Peter Wang and Aron Ahmadia, “Fundamentals of Data Analytics in Python”,
Addison Wesley Live Lessons
7. Torgo, Luís, “Data mining with R : learning with case studies”, CRC Press
8. Wes McKinney, “Python for Data Analysis”, O’Reilly.
9. http://www.rdatamining.com/
COURSE PLAN
Contents Hours End Sem.
Module Exam.
marks

Introduction to R; Installation of R and R Studio; Installing and


1
loading R packages
I Basic building blocks in R; Advanced Data Structures in R;
3 10
Reading data into R; Statistical Graphs in R
R Programming 3
Data Munching-Group manipulation, Reshaping; String 3
II Manipulation 20

Basic Statistics; Linear Models 4


FIRST INTERNAL EXAM
Predictive Modeling: Generalized Linear Models; Model
3
III Diagnostics; Regularization and Shrinkage
20
Nonlinear Models; Time Series and Autocorrelation; Multivariate
data exploration and discrimination. 3

Clustering;Association Rules; Text Mining; Sentiment Analysis;


Social Network Analysis; Reports and Slideshows 4
IV
10
R Package Building, Introduction to Rcpp, Data structures, Using
Rcpp in package, Modules, Operators, Functions, Applications. 4

SECOND INTERNAL EXAM


Introduction to Python: Python Libraries, Installation and Setup;
Python Programming: Data Types and Variables, Python input
V
and output, If statements, while loops, for loops, Iterators, Lists, 7 20
Functions , Modules, Object Oriented Programming, Inheritance,
Exception Handling, Using Data Structures.
Basic Analytics with Python; Numerical Analysis with NumPy 2
VI Advanced Analytics with SciPy and sci-kit learn 2 20
Tabular Data Analysis with Pandas; Python Visualization
3
Tools; Financial and Economic Data Applications
END SEMESTER EXAM
End Semester Examination Question Paper Pattern:
Examination duration: 3 hours Maximum Marks: 100
Part A (Modules I and II):
Candidates have to answer any 2 questions from a choice of 3 questions. Each full question
carries a total of 15 marks and can have a maximum of 4 sub questions (a, b, c, d). No two
questions shall be exclusively from a single module. All three questions shall preferably have
components from both modules. Marks for each question/sub question shall be clearly
specified. Total percentage of marks for the two modules put together as specified in the
curriculum shall be adhered to for all combinations of any two questions.
Part B (Modules III and IV):
(Same as for part A marks)
Part C (Modules V and VI):
(Same as for part A, except that each full question carries 20 marks)

Note: If use of tables and charts are permitted for the university examination for this course,
proper direction of the same should be provided on the facing sheet of the question paper.

You might also like