Data Science using
SAS & R
A comprehensive, job-oriented
training program crafted by experts
Disclaimer: This material is protected under copyright act AnalytixLabs ©, 2011-2016. Unauthorized use and/ or duplication of this material or any part of this material
including data, in any form without explicit and written permission from AnalytixLabs is strictly prohibited. Any violation of this copyright will attract legal actions
About AnalytixLabs
AnalytixLabs is a capability building and training solutions firm led by McKinsey, IIM, ISB and IIT alumni with deep industry experience and a
flair for coaching. We are focused at helping our clients develop skills in basic and advanced analytics to enable them to emerge as
“Industry Ready” professionals and enhance career opportunities. AnalytixLabs has been also featured as top institutes by prestigious
publications like Analytics India Magazine and Higher Education Review, since 2013.
Bottom line
• Job-oriented training
Faculty
• Lucrative job prospects in high
growth domain
• Seasoned analytics professionals
Content
• Together we have 30 + years of • Support for relevant
• World class course structure certifications and diplomas
experience with prestigious firms,
Approach like McKinsey, KPMG, Deloitte
• Career counseling and planning
• Surpasses industry requirements and AOL
• 80-20 focus on practical & theory
• Cater to Standard certifications • Regular sessions by industry • Value for money with high return
on investment
• Personal attention and Individual experts
counselling • High quality course material and
real life case studies
• Industry best practices
Global Data science and Big Data skill gap
McKinsey Global Institute estimates a shortage of nearly 1.7 million big data talents by
2018. This includes a shortage of 140,000 to 190,000 workers with deep technical and
analytical expertise, and a shortage of 1.5 million managers and analysts equipped to
work with and use big data outputs
Candidates trained by us are working in leading companies across
industries…
Data Science using SAS and R is a comprehensive program with following
modules, weekly assignments and case studies
• MS Excel – 6 hours + Practice exercises
Module 1
• Basic data handling to advanced mathematical and logical functions
• SAS Base & Advanced – 24 hours + Practice
Module 2 • Hands-on training on SAS & SQL for data mining & analytics reporting
• R Foundation – 12 hours + Practice
Module 3 • Hands-on training on R packages for data mining & analytics reporting
• Business Analytics using SAS & R – 33 hours
Module 4 • Basic and advanced analytics techniques with applications
Crafted by team of experts and maintains a balance between theoretical concepts and practical applications
MS Excel – Basic and Advanced
Total Duration: 6 hours
Introduction to Excel Logical functions using Excel
• Navigating Worksheets • And, if, not, or, true, false
• Formatting and Editing Worksheet Data • Concatenate, left, right, mid, len, trim,
• Grouping Data, Subtotals and Data • Upper, lower, proper, exact, etc
Validation
Analyzing Data with Excel
Working with Formulas and Functions • Sum, Count, Sumifs, Countifs
• Referencing Functions • Consolidating Worksheets (Vlookup, Index,
• Filter, Sorting, Advance filter Match, Offset)
• Conditional formatting • Using Pivot Tables
Creating Charts and Graphics Data analysis toolpack
• Simple charts • Data tables
• Pie charts • Goal Seek
• Dynamic charts
Using Advanced Excel features
• Worksheet protection and security
• Data Validation
• Text to column, use of delimiter, etc
SAS – Base and Advanced SAS +R Edge program also available
separately - Module 1+2+3
Total Duration: 24 hours
Introduction - Data importing - Understanding • SAS Functions (Text, numeric, date, utility Reporting - Output Exporting
• Introduction to SAS, GUI functions) • Introduction to Reporting
• Concepts of Libraries, PDV, data execution etc • SAS Procedures for data manipulation(Proc sort, • SAS Reporting Procedures ( Proc print, Proc
• Building blocks of SAS (Data & Proc Steps - proc format, Proc transpose, etc) Report, Proc Tabulate etc)
Statements & options) • SAS Options (System Level, procedure level) • Exporting data sets into different formats (Using
• Debugging SAS Codes proc export(
• Importing different types of data & connecting Exploratory Data Analysis & Data visualization • Concept of ODS(output delivery system)
to data bases • Introduction exploratory data analysis • ODS System - Exporting output into different
• Data Understanding(Meta data, variable • Descriptive statistics, Frequency Tables and formats
attributes(format, informat, length, label, etc)) summarization
• SAS Procedures for data • Univariate Analysis (Distribution of data & Optimizing SAS Codes
import/export/understanding(Proc import/Proc Graphical Analysis) • Introduction to Advanced SAS - Proc SQL &
contents/Proc print, etc) • Bivariate Analysis(Cross Tabs, Distributions & Macros
Relationships, Graphical Analysis) • Understanding select statement (From, where,
Data Manipulation • SAS Procedures for Data Analysis(proc freq/Proc group by, having, order by etc)
• Data Manipulation steps(Sorting, filtering, means/proc summary/proc tabulate/Proc • Proc SQL - Data creation/extraction
duplicates, merging, appending, subsetting, univariate, etc) • Proc SQL - Data Manipulation steps
derived variables, sampling, Data type • SAS Procedures for Graphical Analysis (Proc • Proc SQL - Summarizing Data
conversions, renaming, formatting, etc) Sgplot, Proc gplot, etc) • Proc SQL - Concept of sub queries, indexes etc
• Data manipulation tools(Operators, Functions, • SAS Macros - Creating/defining macro variables
Procedures, control structures, Loops, arrays, • SAS Macros - Defining/calling macros
etc) • SAS Macros- Concept of local/global variables
• SAS Macros - Debugging techniques
R – Foundation SAS +R Edge program also available
separately - Module 1+2+3
Total Duration: 12 hours
Introduction - Data Importing/Exporting Data Manipulation
• Introduction R/R-Studio - GUI • R User Defined Functions
• Concept of Packages - Useful Packages (Base & • R Packages for data manipulation(base, dplyr,
other packages) in R plyr, reshape, car, sqldf, etc)
• Data Structure & Data Types (Vectors, Matrices,
factors, Data frames, and Lists) Data Analysis - Visualization
• Importing Data from various sources • Introduction exploratory data analysis
• Database Input (Connecting to database) • Descriptive statistics, Frequency Tables and
• Exporting Data to various formats) summarization
• Viewing Data (Viewing partial data and full data) • Univariate Analysis (Distribution of data &
• Variable & Value Labels – Date Values Graphical Analysis)
• Bivariate Analysis(Cross Tabs, Distributions &
Data Manipulation Relationships, Graphical Analysis)
• Data Manipulation steps (Sorting, filtering, • Creating Graphs- Bar/pie/line
duplicates, merging, appending, subsetting, chart/histogram/boxplot/scatter/density etc)
derived variables, sampling, Data type • R Packages for Exploratory Data Analysis(dplyr,
conversions, renaming, formatting, etc) plyr, gmodes, car, vcd, Hmisc, psych, doby etc)
• Data manipulation tools(Operators, Functions, • R Packages for Graphical Analysis (base, ggplot,
Packages, control structures, Loops, arrays, etc) lattice,etc)
• R Built-in Functions (Text, Numeric, Date, utility)
Business Analytics using SAS and R Business Analytics program also
available separately - Module 1+4
Total Duration: 33 hours
Introduction to Statistics Segmentation Logistic Regression
• Basic Statistics - Measures of Central Tendencies and • Heuristic Segmentation Techniques (Value Based, RFM • Introduction - Applications
Variance Segmentation and Life Stage Segmentation) • Linear Regression Vs. Logistic Regression Vs. Generalized
• Building blocks - Probability Distributions - Normal • Behavioural Segmentation Techniques (K-Means Cluster Linear Models
distribution - Central Limit Theorem Analysis) • Building Logistic Regression Model
• Inferential Statistics -Sampling - Concept of • Cluster evaluation and profiling • Understanding standard model metrics (Concordance,
Hypothesis Testing • Interpretation of results - Implementation on new data Variable significance, Hosmer Lemeshov Test, Gini, KS,
• Statistical Methods - Z/t-tests( One sample, Misclassification, etc)
independent, paired), Anova, Correlations and Chi- Decision Trees • Validation of Logistic Regression Models (Re running Vs.
square • Decision Trees - Introduction - Applications Scoring)
• Types of Decision Tree Algorithms • Standard Business Outputs (Decile Analysis, ROC Curve,
Introduction to Predictive Modeling • CHAID Vs. CART • Probability Cut-offs, Lift charts, Model equation, Drivers,
• Introduction to Predictive Modeling • Decision Trees - Validation etc)
• Types of Business problems - Mapping of Techniques • Overfitting - Best Practices to avoid • Interpretation of Results - Business Validation -
• Different Phases of Predictive Modeling • Implementation of Solution Implementation on new data
Data Preparation Linear Regression Time Series Forecasting
• Need of Data preparation • Introduction - Applications • Introduction - Applications
• Data Audit Report and Its importance • Assumptions of Linear Regression • Time Series Components( Trend, Seasonality, Cyclicity
• Consolidation/Aggregation - Outlier treatment - Flat • Building Linear Regression Model and Level) and Decomposition
Liners - Missing values- Dummy creation - Variable • Understanding standard metrics (Variable significance, R- • Classification of Techniques(Pattern based - Pattern less)
Reduction square/Adjusted R-square, Global hypothesis ,etc) • Basic Techniques - Averages, Smoothening, etc
• Variable Reduction Techniques - Factor & PCA • Validation of Models (Re running Vs. Scoring) • Advanced Techniques - AR Models, ARIMA, etc
Analysis • Standard Business Outputs (Decile Analysis, Error • Understanding Forecasting Accuracy - MAPE, MAD, MSE,
distribution (histogram), Model equation, drivers etc.) etc
Segmentation • Interpretation of Results - Business Validation -
• Introduction to Segmentation Implementation on new data
• Types of Segmentation (Subjective Vs Objective,
Heuristic Vs. Statistical)
Course completion and career assistance
Course completion & Certification criteria What is included in career assistance?
• You shall be awarded an AnalytixLabs certificate only • Post successful course completion, candidates can seek
post the submission and evaluation of mandatory course assistance from AnalytixLabs for profile building. A team
project work. These will be provided as a part of the of seasoned professionals will help you based on your
training. overall education background and work experience. This
will be followed by interview preparation along with
• There is no pass/fail for these assignments and projects . mock interviews (if required)
Our objective is to ensure that trainees get strong hands-
on experience so that they are well-prepared for job • Job referrals are based on the requirements we get from
interviews along with performance at their jobs. various organizations, HR consultants and large pool of
AnalytixLabs’ ex-students working in various companies.
• Incase the assignments and projects are not up-to-the-
mark, trainees are welcome to take help and support for • No one can truthfully provide job guarantee, particularly
improvisation. for good quality job profiles in Analytics. However, most
of our students do get multiple interview calls and good
• While weekly schedule is shared with trainees for regular career options based on the skills they learn during
assignments, candidates get 3 months, post course training. For this there will be continuous support from
completion, to submit their final assignment and our side for as long as required.
projects.
Time and investment
SAS + R + Business Analytics: 80 hours + Practice, INR 30,000 + 15% ST / $1200 (foreign nationals)
SAS + R + Business Analytics (self-paced): 80 hours + Practice, INR 25,000 + 15% ST / $900 (foreign nationals)
R + Business Analytics: 48 hours + Practice, INR 25,000 + 15% ST / $750 (foreign nationals)
Business Analytics: 33 hours + Practice, INR 20,000 + 15% ST / $600 (foreign nationals)
Timing: 6 hours per weekend live training (Saturday & Sunday 3 hours each) + Practice
Training mode: Fully interactive live online class /Class room (In Gurgaon center only)
(In addition to the above, you will also get access to the recordings for future reference and self study)
Components: Learning Management System access for courseware like class recordings - study material, Industry-
relevant project work
Certification: Participants will be awarded a certificate on successful completion of the stipulated requirements
including an evaluation
Base SAS Global Certification: $180 (optional)
SAS Business Analyst Global Certification: $180 (optional)
We provide trainings both in ‘fully interactive live online’ and classroom*
mode
Fully interactive
live online class
with personal
attention
Access to quality
Saves training and 24x7
commuting time practice
and resources in sessions
today’s chaotic available at the
world comfort of your
Ensures place
best use of
time and
Delivered resources
Studies prove
lectures are
that online
recorded and
education beats
can be replayed
the conventional
by individuals as
classroom
per their needs One of strongest
global trends in
education, both
in developing
and developed
countries
*Classroom only available at Gurgaon center
Contact Us
Visit us on: http://www.analytixlabs.in/
For course registration, please visit: http://www.analytixlabs.co.in/course-registration/
For more information, please contact us: http://www.analytixlabs.co.in/contact-us/
Or email: info@analytixlabs.co.in
Call us we would love to speak with you: (+91) 9555219007
Join us on:
Twitter - http://twitter.com/#!/AnalytixLabs
Facebook - http://www.facebook.com/analytixlabs
LinkedIn - http://www.linkedin.com/in/analytixlabs
Blog - http://www.analytixlabs.co.in/category/blog/