Big Data Analytics

Uploaded by

manisha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views3 pages

Big Data Analytics

Uploaded by

manisha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Handbook of B.Tech. Programmes offered by USICT at Affiliated Institutions of the University.

Big Data Analytics L P C

3 3

Discipline(s) / EAE / OAE Semester Group Sub-group Paper Code

CSE-DS 7 PC PC DS-429T
EAE 7 DS-EAE DS-EAE-4 DS-429T

Marking Scheme:
1. Teachers Continuous Evaluation: 25 marks
2. Term end Theory Examinations: 75 marks
Instructions for paper setter:
1. There should be 9 questions in the term end examinations question paper.
2. The first (1st) question should be compulsory and cover the entire syllabus. This question should be
objective, single line answers or short answer type question of total 15 marks.
3. Apart from question 1 which is compulsory, rest of the paper shall consist of 4 units as per the syllabus.
Every unit shall have two questions covering the corresponding unit of the syllabus. However, the student
shall be asked to attempt only one of the two questions in the unit. Individual questions may contain upto
5 sub-parts / sub-questions. Each Unit shall have a marks weightage of 15.
4. The questions are to be framed keeping in view the learning outcomes of the course / paper. The standard
/ level of the questions to be asked should be at the level of the prescribed textbook.
5. The requirement of (scientific) calculators / log-tables / data – tables may be specified if required.
Course Objectives :
1. Understand the Big Data Platform and its Use cases
2. Provide HDFS Concepts and Interfacing with HDFS
3. Provide hands on Hodoop Eco System
4. Exposure to Data Analytics with R
Course Outcomes (CO)
CO 1 Identify Big Data and its Business Implications
CO 2 List the components of Hadoop and Hadoop Eco-System
CO 3 Develop Big Data Solutions using Hadoop Eco System
CO 4 Manage Job Execution in Hadoop Environment
Course Outcomes (CO) to Programme Outcomes (PO) mapping (scale 1: low, 2: Medium, 3: High)
PO01 PO02 PO03 PO04 PO05 PO06 PO07 PO08 PO09 PO10 PO11 PO12
CO 1 2 - - 2 3 - - - 1 - - -
CO 2 - 2 - - - 2 3 - 1 2 - -
CO 3 - 1 - - - 2 - - - 2 - -
CO 4 - 3 - - - 2 - - - 2 - -

UNIT-I

Introduction to Big Data: Introduction to Big Data, Big Data characteristics, Challenges of Conventional
System, Types of Big Data, Intelligent data analysis, Traditional vs. Big Data business approach, Case Study of
Big Data Solutions.

UNIT-II

Hadoop: History of Hadoop, Hadoop Distributed File System: Physical organization of Compte Nodes,
Components of Hadoop Analyzing the Data with Hadoop, Scaling Out, Hadoop Streaming, Design of HDFS,Java
interfaces to HDFS Basics, Developing a Map Reduce Application, How Map Reduce Works, Anatomy of a Map
Reduce Job run, Failures, Job Scheduling, Shuffle and Sort, Task execution, Map Reduce Types and Formats,
Map Reduce Features, Hadoop environment. Setting up a Hadoop Cluster, Cluster specification, Cluster Setup
and Installation, Hadoop Configuration, security in Hadoop, Administering Hadoop, Monitoring-Maintenance,
Hadoop benchmarks, Hadoop in the cloud

Applicable from Batch Admitted in Academic Session 2021-22 Onwards Page 566
Handbook of B.Tech. Programmes offered by USICT at Affiliated Institutions of the University.

UNIT-III

NoSQL: What is NoSQL? NoSQL business drivers; NoSQL case studies; NoSQL data architecture patterns: Key-
value stores, Graph stores, Column family (Bigtable) stores, Document stores, Variations of NoSQL
architectural patterns; Using NoSQL to manage big data: What is a big data NoSQL solution? Understanding
the types of big data problems; Analyzing big data with a shared-nothing architecture; Choosing distribution
models: master-slave versus peer-to-peer; Four ways that NoSQL systems handle big data problems

UNIT – IV

Frameworks: Applications on Big Data Using Pig and Hive, Data processing operators in Pig, Hive services,
HiveQL, Querying Data in Hive, fundamentals of HBase and ZooKeeper, IBM InfoSphere BigInsights and
Streams. Machine Learning: Introduction, Supervised Learning, Unsupervised Learning, Collaborative Filtering.
Big Data Analytics with BigR

Textbook(s):
1. Jiawei Han, Micheline Kamber, Jian Pei, “Data Mining : Concepts and Techniques”, 3rd edition, MK Publisher
2. Tom White “Hadoop: The Definitive Guide” Third Editon, O’reily Media, 2012.

References:
1. Seema Acharya, Subhasini Chellappan, "Big Data Analytics" Wiley 2015.
2. Michael Berthold, David J. Hand, "Intelligent Data Analysis”, Springer, 2007

Applicable from Batch Admitted in Academic Session 2021-22 Onwards Page 567
Handbook of B.Tech. Programmes offered by USICT at Affiliated Institutions of the University.

Big Data Analytics Lab L P C

2 1

Discipline(s) / EAE / OAE Semester Group Sub-group Paper Code

CSE-DS 7 PC PC DS-429P
EAE 7 DS-EAE DS-EAE-4 DS-429P

Marking Scheme:
1. Teachers Continuous Evaluation: 40 marks
2. Term end Theory Examinations: 60 marks
Instructions:
1. The course objectives and course outcomes are identical to that of (Big Data Analytics) as this is the practical
component of the corresponding theory paper.
2. The practical list shall be notified by the teacher in the first week of the class commencement under
intimation to the office of the Head of Department / Institution in which the paper is being offered from the
list of practicals below. Atleast 10 experiments must be performed by the students, they may be asked to
do more. Atleast 5 experiments must be from the given list.

1. Downloading and installing Hadoop; Understanding different Hadoop modes. Startup scripts, Configuration
files
2. Implement the following file management tasks in Hadoop:
i. Adding files and directories
ii. Retrieving files
iii. Deleting files
Hint: A typical Hadoop workflow creates data files (such as log files) elsewhere and copies them into HDFS
using one of the above command line utilities
3. Implement of Matrix Multiplication with Hadoop Map Reduce
4. Write a Map Reduce program that mines weather data. Hint: Weather sensors collecting data every hour at
many locations across the globe gather a large volume of log data, which is a good candidate for analysis
with Map Reduce, since it is semi structured and record-oriented
5. Run a basic Word Count Map Reduce program to understand Map Reduce Paradigm.
6. Implementation of K-means clustering using Map Reduce.
7. Installation of Hive along with practice examples.
8. Installation of HBase, Installing thrift along with Practice examples
9. Run the Pig Latin Scripts to find Word Count.
10. Run the Pig Latin Scripts to find a max temp for each and every year.

Applicable from Batch Admitted in Academic Session 2021-22 Onwards Page 568

BDA Practical File
No ratings yet
BDA Practical File
61 pages
Blda Pract 2024
No ratings yet
Blda Pract 2024
59 pages
2022-23-BDA-LAB Manual
No ratings yet
2022-23-BDA-LAB Manual
59 pages
Bda Lab Manual - Ise 2025-26
No ratings yet
Bda Lab Manual - Ise 2025-26
58 pages
2022-23-BDA-LAB Manual
No ratings yet
2022-23-BDA-LAB Manual
59 pages
BE AIDS R 20 VII VIII Sem Syllabus - Compressed
No ratings yet
BE AIDS R 20 VII VIII Sem Syllabus - Compressed
55 pages
6th Sem AIDS Syllabus 2022 Scheme
No ratings yet
6th Sem AIDS Syllabus 2022 Scheme
52 pages
Syallaus 6 Final
No ratings yet
Syallaus 6 Final
16 pages
6th Sem DS Syllabus 2022 Scheme
No ratings yet
6th Sem DS Syllabus 2022 Scheme
54 pages
BCS714D Syllabus
No ratings yet
BCS714D Syllabus
3 pages
CC ZG522 Course Handout
No ratings yet
CC ZG522 Course Handout
6 pages
Big Data Analytics - Sem 7 CVMU
No ratings yet
Big Data Analytics - Sem 7 CVMU
4 pages
Lab Manual Big Data Analytics Lab (LC-CSE-410G) : Department of Computer Science and Engineering
No ratings yet
Lab Manual Big Data Analytics Lab (LC-CSE-410G) : Department of Computer Science and Engineering
28 pages
Big Data Analytics Comp Syllabus Sem7
No ratings yet
Big Data Analytics Comp Syllabus Sem7
4 pages
Big Daa R18 Manual
No ratings yet
Big Daa R18 Manual
84 pages
Experiment Pgno
No ratings yet
Experiment Pgno
50 pages
Bca Bigdata Fifth - Sem Approved Syllabus
No ratings yet
Bca Bigdata Fifth - Sem Approved Syllabus
23 pages
Syllabus
No ratings yet
Syllabus
4 pages
2CS702-CPD-Odd 23 24
No ratings yet
2CS702-CPD-Odd 23 24
9 pages
Co Po Mapping Bda With Justiificaton
No ratings yet
Co Po Mapping Bda With Justiificaton
4 pages
Big Data Analytics
No ratings yet
Big Data Analytics
2 pages
Big Data Analytics
No ratings yet
Big Data Analytics
2 pages
CCS334 BDA Syllabus
No ratings yet
CCS334 BDA Syllabus
5 pages
AIADS 7th Sem Syllabus Signed
No ratings yet
AIADS 7th Sem Syllabus Signed
19 pages
7th Cssyll
No ratings yet
7th Cssyll
49 pages
Big Data Analytics Course
No ratings yet
Big Data Analytics Course
4 pages
Syllabus Semester7
No ratings yet
Syllabus Semester7
17 pages
Gujarat Technological University: Sr. No. Content Total Hrs % Weightage 1 13
No ratings yet
Gujarat Technological University: Sr. No. Content Total Hrs % Weightage 1 13
3 pages
Notes
No ratings yet
Notes
11 pages
Ccs334 - Big Data Analytics
75% (4)
Ccs334 - Big Data Analytics
2 pages
Bda Syllb
No ratings yet
Bda Syllb
4 pages
Appendix-74
No ratings yet
Appendix-74
42 pages
BDA Manual
No ratings yet
BDA Manual
56 pages
MCA 3rd Semester Big Data Analytics Syllabus
No ratings yet
MCA 3rd Semester Big Data Analytics Syllabus
15 pages
BDA Manual
No ratings yet
BDA Manual
41 pages
Big Data Analytics
No ratings yet
Big Data Analytics
3 pages
Big Data Analytics Course
No ratings yet
Big Data Analytics Course
19 pages
Bda Lab Manual - Bad601
No ratings yet
Bda Lab Manual - Bad601
38 pages
BDA Syllabus
No ratings yet
BDA Syllabus
4 pages
17cs17 - Vcs314 - Big Data Systems
No ratings yet
17cs17 - Vcs314 - Big Data Systems
5 pages
Big Data Analytics Course Guide
No ratings yet
Big Data Analytics Course Guide
2 pages
B.Tech. CS - CE and CSE Syllabus 3rd Year 2024-25
No ratings yet
B.Tech. CS - CE and CSE Syllabus 3rd Year 2024-25
2 pages
BDA Syllabus Final
No ratings yet
BDA Syllabus Final
3 pages
BDA Journal
No ratings yet
BDA Journal
33 pages
6th Sem - Big Data - IsE
No ratings yet
6th Sem - Big Data - IsE
5 pages
20CT1152
No ratings yet
20CT1152
3 pages
20dce017 Bda Pracfil
No ratings yet
20dce017 Bda Pracfil
41 pages
Institute of Technology: Practical List
No ratings yet
Institute of Technology: Practical List
4 pages
CSE 3002 Big Data Technologies - 7sem
No ratings yet
CSE 3002 Big Data Technologies - 7sem
19 pages
Bda 1
No ratings yet
Bda 1
95 pages
Big Data Analytics Syllabus
No ratings yet
Big Data Analytics Syllabus
2 pages
6th Semester Syllabi
No ratings yet
6th Semester Syllabi
15 pages
Bad601 Lab
No ratings yet
Bad601 Lab
32 pages
DSA Practical Index
No ratings yet
DSA Practical Index
3 pages
Syllabus
No ratings yet
Syllabus
7 pages
Big Data 2024
No ratings yet
Big Data 2024
3 pages
Assignment No.1
No ratings yet
Assignment No.1
1 page
Big Data Analytics Course Guide
No ratings yet
Big Data Analytics Course Guide
2 pages
DWDM - External Imp Q's
No ratings yet
DWDM - External Imp Q's
2 pages
Azure Security Infographic
No ratings yet
Azure Security Infographic
1 page
Datacore1 Nutanix
No ratings yet
Datacore1 Nutanix
18 pages
Practice Guide EWM InboundProcess
100% (5)
Practice Guide EWM InboundProcess
19 pages
The Primaver P6 Users Guide To Excel V2
100% (1)
The Primaver P6 Users Guide To Excel V2
34 pages
5235ac2 Contrat PartDedie WE 12.0
No ratings yet
5235ac2 Contrat PartDedie WE 12.0
7 pages
Big Data and Data Analytics Cloudera.
No ratings yet
Big Data and Data Analytics Cloudera.
3 pages
Slide (1) Introduction
No ratings yet
Slide (1) Introduction
26 pages
MB-910T00: Microsoft Dynamics 365 Fundamentals (CRM) : Course Outline
No ratings yet
MB-910T00: Microsoft Dynamics 365 Fundamentals (CRM) : Course Outline
4 pages
MGMT 382 Quiz 1
No ratings yet
MGMT 382 Quiz 1
2 pages
Improving General Ledger Performance
No ratings yet
Improving General Ledger Performance
3 pages
Intra STO Process 5
No ratings yet
Intra STO Process 5
14 pages
Install Robot Framework on Windows 7
No ratings yet
Install Robot Framework on Windows 7
5 pages
Selenium Recipes in Ruby Sample
No ratings yet
Selenium Recipes in Ruby Sample
39 pages
Chapter 2
No ratings yet
Chapter 2
3 pages
Tricentis Datasheet - NeoLoad Continuous Performance Testing
No ratings yet
Tricentis Datasheet - NeoLoad Continuous Performance Testing
2 pages
Systems Design, Implementation, and Operation
No ratings yet
Systems Design, Implementation, and Operation
14 pages
CIT 503 Database Administration and Management
No ratings yet
CIT 503 Database Administration and Management
5 pages
Payroll System for Companies
0% (1)
Payroll System for Companies
36 pages
OpenNMS - mib2openNMS v1.0
No ratings yet
OpenNMS - mib2openNMS v1.0
4 pages
Bugreport 2022 01 04 15 15 29 Dumpstate - Log 29625
No ratings yet
Bugreport 2022 01 04 15 15 29 Dumpstate - Log 29625
3 pages
Requisition Approval Using AME
100% (2)
Requisition Approval Using AME
29 pages
Director Delivery: IT & Healthcare Expertise
No ratings yet
Director Delivery: IT & Healthcare Expertise
5 pages
Post Gree
No ratings yet
Post Gree
180 pages
Abhishek Resume 0821
No ratings yet
Abhishek Resume 0821
1 page
? Top 10 Java Frameworks For 2025! ?
No ratings yet
? Top 10 Java Frameworks For 2025! ?
13 pages
Software Design for Developers
No ratings yet
Software Design for Developers
39 pages
State Board of Cricket Council - Requirement Document 2
No ratings yet
State Board of Cricket Council - Requirement Document 2
4 pages
AEM Learning for Professionals
No ratings yet
AEM Learning for Professionals
32 pages
Hospital Are The Essential Part of Our Live1
No ratings yet
Hospital Are The Essential Part of Our Live1
15 pages

Big Data Analytics

Uploaded by

Big Data Analytics

Uploaded by

Handbook of B.Tech. Programmes offered by USICT at Affiliated Institutions of the University.

Big Data Analytics L P C

Discipline(s) / EAE / OAE Semester Group Sub-group Paper Code

Big Data Analytics Lab L P C

Discipline(s) / EAE / OAE Semester Group Sub-group Paper Code

You might also like