0% found this document useful (0 votes)

72 views3 pages

Syllabus

Uploaded by

Akshat 31

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views3 pages

Syllabus

Uploaded by

Akshat 31

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Course Code Course Theory Practical Tutorial Theory Practical/ Tutorial Total

Name Oral
ITDO8011 Big Data 03 -- -- 03 -- -- 03
Analytics

Examination Scheme
Theory Marks
Course
Course Code Internal assessment End Term
Name Practical Oral Total
Avg. of 2 Sem. Work
Test1 Test 2
Tests Exam
ITDO8011 Big Data
20 20 20 80 -- -- -- 100
Analytics

Course Objectives:

Sr.No Course Objectives

1 To provide an overview of an exciting growing field of Big Data analytics.
2 To discuss the challenges traditional data mining algorithms face when analyzing Big Data.
3 To introduce the tools required to manage and analyze big data like Hadoop, NoSql MapReduce.
4 To teach the fundamental techniques and principles in achieving big data analytics with scalability and streaming
capability.
5 To introduce to the students several types of big data like social media, web graphs and data streams.
6 To enable students to have skills that will help them to solve complex real-world problems in decision support.

Course Outcomes:
Sr. Course Outcomes Cognitive levels of
No attainment as per
Bloom’s Taxonomy
On successful completion, of course, learner/student will be able to:
1 Explain the motivation for big data systems and identify the main sources of Big Data L1,L2,L3
in the real world.
2 Demonstrate an ability to use frameworks like Hadoop, NOSQL to efficiently store, L1,L2,L3
retrieve and process Big Data for Analytics.
3 Implement several Data Intensive tasks using the Map Reduce Paradigm. L1,L2,L3
4 Apply several newer algorithms for Clustering Classifying and finding associations in L1,L2,L3
Big Data.
5 Design algorithms to analyze Big data like streams, Web Graphs and Social Media L6
data.
6 Design and implement successful Recommendation engines for enterprises. L6

Prerequisite: AI and DS

DETAILED SYLLABUS:

Sr. Module Detailed Content Hours CO Mapping

No.

University of Mumbai, B. E. (Information Technology), Rev 2016 286

0 Prerequisite Data Mining, Data Science 02

I Introduction to Introduction to Big Data, Big Data characteristics, types of 03 CO1

Big Data Big Data, Traditional vs. Big Data business approach, Big
Data Challenges, Examples of Big Data in Real Life, Big
Data Applications
Self-learning Topics: Identification of Big Data applications
and its solutions

II Introduction to What is Hadoop? Core Hadoop Components; Hadoop 06 CO2

Big Data Ecosystem; Working with Apache Spark
Frameworks What is NoSQL? NoSQL data architecture patterns: Key-
value stores, Graph stores, Column family (Bigtable) stores,
Document stores, MongoDB
Self-learning Topics:HDFS vs GFS, MongoDB vs other
NoSQL system, Implementation of Apache Spark

III MapReduce MapReduce: The Map Tasks, Grouping by Key, The Reduce 07 CO3
Paradigm Tasks, Combiners, Details of MapReduce Execution, Coping
With Node Failures. Algorithms Using MapReduce: Matrix-
Vector Multiplication by MapReduce , Relational-Algebra
Operations, Computing Selections by MapReduce,
Computing Projections by MapReduce, Union, Intersection,
and Difference by MapReduce, Computing Natural Join by
MapReduce, Grouping and Aggregation by MapReduce,
Matrix Multiplication, Matrix Multiplication with One
MapReduce Step . Illustrating use of MapReduce with use of
real life databases and applications.
Self-learning Topics:Implementation of MapReduce
algorithms like Word count, Matrix-Vector and Matrix-
Matrix algorithm
IV Mining Big Data The Stream Data Model: A DataStream-Management System, 07 CO4
Streams Examples of Stream Sources, Stream Queries, Issues in
Stream Processing. Sampling Data in a Stream : Sampling
Techniques. Filtering Streams: The Bloom Filter Counting
Distinct Elements in a Stream : The Count-Distinct Problem,
The Flajolet-Martin Algorithm, Combining Estimates, Space
Requirements . Counting Ones in a Window: The Cost of
Exact Counts, The Datar-Gionis-Indyk, Motwani Algorithm,
Query Answering in the DGIM Algorithm.
Self-learning Topics: Streaming services like Apache
Kafka/Amazon Kinesis/Google Cloud DataFlow.
Standard spark streaming library.
Integration with IOT devices to capture real time stream data.

V Big Data Mining Frequent Pattern Mining : Handling Larger Datasets in Main 07 CO5
Algorithms Memory Basic Algorithm of Park, Chen, and Yu. The SON
Algorithm and MapReduce. Clustering Algorithms: CURE
Algorithm. Canopy Clustering, Clustering with MapReduce
Classification Algorithms: Overview SVM classifiers,
Parallel SVM, KNearest Neighbor classifications for Big
Data, One Nearest Neighbour.
Self-learning Topics: Standard libraries included with spark
like graphX, MLlib

University of Mumbai, B. E. (Information Technology), Rev 2016 287

VI Big Data Link Analysis : PageRank Definition, Structure of the web, 07 CO6
Analytics dead ends, Using Page rank in a search engine, Efficient
Applications computation of Page Rank: PageRank Iteration Using
MapReduce, Topic sensitive Page Rank, link Spam, Hubs and
Authorities, HITS Algorithm.
Mining Social- Network Graphs : Social Networks as
Graphs, Types , Clustering of Social Network Graphs, Direct
Discovery of Communities, Counting triangles using Map-
Reduce.
Recommendation Engines: A Model for Recommendation
Systems, Content-Based Recommendations, Collaborative
Filtering
Self-learning Topics: Sample applications like social media
feeds, multiplayer game interactions, retail industry, financial
data analysis. Use case like location data, real-time stock
trades, log monitoring etc

Text Books:

1. Anand Rajaraman and Jeff Ullman ―Mining of Massive Datasets‖, Cambridge University Press.
2. Alex Holmes ―Hadoop in Practice‖, Manning Press, Dreamtech Press.
3. Professional NoSQL Paperback, by Shashank Tiwari, Dreamtech Press
4. Rajkumar Buyya, ,Rodrigo N. Calheiros and Amir Vahid Dastjerdi, ―Big Data Principles and Paradigms‖, Morgan Kaufmann

References Books:

1. Analytics in a Big Data World: The Essential Guide to Data Science and its Applications, Bart Baesens , WILEY Big Data
Series.
2. Big Data Analytics with R and Hadoop by Vignesh Prajapati Paperback, Packt Publishing Limited
3. Hadoop: The Definitive Guide by Tom White, O'Reilly Publications

Online References:

1. https://nptel.ac.in/courses/106/104/106104189/
2. https://nptel.ac.in/courses/106106142/
3. https://nptel.ac.in/courses/106105186/

Assessment:

Internal Assessment (IA) for 20 marks:

 IA will consist of Two Compulsory Internal Assessment Tests. Approximately 40% to 50% of syllabus content
must be covered in First IA Test and remaining 40% to 50% of syllabus content must be covered in Second IA
Test

 Question paper format

 Question Paper will comprise of a total of six questions each carrying 20 marks Q.1 will be compulsory and
should cover maximum contents of the syllabus

 Remaining questions will be mixed in nature (part (a) and part (b) of each question must be from different
modules. For example, if Q.2 has part (a) from Module 3 then part (b) must be from any other Module randomly
selected from all the modules)

A total of four questions need to be answered.

University of Mumbai, B. E. (Information Technology), Rev 2016 288

Information Technology Engineering Syllabus Sem Viii Mumbai University
No ratings yet
Information Technology Engineering Syllabus Sem Viii Mumbai University
60 pages
BDA Syllabus - Sem VII - Mumbai University
No ratings yet
BDA Syllabus - Sem VII - Mumbai University
3 pages
Introduction of Subject
No ratings yet
Introduction of Subject
28 pages
Big Data Analytics Comp Syllabus Sem7
No ratings yet
Big Data Analytics Comp Syllabus Sem7
4 pages
Big Data Analytics Course Guide
No ratings yet
Big Data Analytics Course Guide
2 pages
IOT Analytics - AI361
No ratings yet
IOT Analytics - AI361
3 pages
SEM VII BDA Syllabus Theory
No ratings yet
SEM VII BDA Syllabus Theory
4 pages
COMP9313: Big Data Management
No ratings yet
COMP9313: Big Data Management
79 pages
Syllabus
No ratings yet
Syllabus
7 pages
BDA Syllabus
No ratings yet
BDA Syllabus
3 pages
CS8091 Big Data Analytics
No ratings yet
CS8091 Big Data Analytics
28 pages
MCAD2232 (PRESS) BIG DATA and Its Applications
No ratings yet
MCAD2232 (PRESS) BIG DATA and Its Applications
140 pages
College La Iruthu Come Back Bone Only For
No ratings yet
College La Iruthu Come Back Bone Only For
2 pages
Module - 1
No ratings yet
Module - 1
84 pages
21cs71BDA Question Bank
No ratings yet
21cs71BDA Question Bank
4 pages
Big Data Syllabus
No ratings yet
Big Data Syllabus
6 pages
Big Data Analytics
No ratings yet
Big Data Analytics
2 pages
Big Data SV Publication
No ratings yet
Big Data SV Publication
142 pages
Big Data analyticsNEW SYLLABUS FRAMING
No ratings yet
Big Data analyticsNEW SYLLABUS FRAMING
3 pages
Final Lesson Plan
No ratings yet
Final Lesson Plan
8 pages
CS8091 Bigdata Analytics Lessonplan With Date
No ratings yet
CS8091 Bigdata Analytics Lessonplan With Date
11 pages
Big Data-2
No ratings yet
Big Data-2
3 pages
Course Pack BDA
No ratings yet
Course Pack BDA
6 pages
B.Tech. CS - CE and CSE Syllabus 3rd Year 2024-25
No ratings yet
B.Tech. CS - CE and CSE Syllabus 3rd Year 2024-25
2 pages
BDA Syllabus
No ratings yet
BDA Syllabus
4 pages
Techknowledge Publication: Big Data Analytics
No ratings yet
Techknowledge Publication: Big Data Analytics
156 pages
BCA - 409 Syallabus
No ratings yet
BCA - 409 Syallabus
2 pages
Bca Bigdata Fifth - Sem Approved Syllabus
No ratings yet
Bca Bigdata Fifth - Sem Approved Syllabus
23 pages
Big Data Analytics-Digital Notes
No ratings yet
Big Data Analytics-Digital Notes
86 pages
Big Data - 2 Marks-1
No ratings yet
Big Data - 2 Marks-1
1 page
Data Science and Big Data Analytics
No ratings yet
Data Science and Big Data Analytics
2 pages
22IS61 Big Data Analytics 2025
No ratings yet
22IS61 Big Data Analytics 2025
4 pages
Big Data Analytics
No ratings yet
Big Data Analytics
3 pages
CS8091 Bigdata QB 2022-2023 Final
No ratings yet
CS8091 Bigdata QB 2022-2023 Final
6 pages
10bda Lesson Plan 24-25
No ratings yet
10bda Lesson Plan 24-25
3 pages
CS8091 Syllabus
No ratings yet
CS8091 Syllabus
2 pages
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
No ratings yet
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
3 pages
Big Data Analytics for B.Tech Students
No ratings yet
Big Data Analytics for B.Tech Students
119 pages
113 Ce 74
No ratings yet
113 Ce 74
4 pages
B.tech.-CSE - IBM 2023-24 Syllabus.
No ratings yet
B.tech.-CSE - IBM 2023-24 Syllabus.
1 page
No SQL Database in Bda
No ratings yet
No SQL Database in Bda
84 pages
BDA Unit 1
No ratings yet
BDA Unit 1
36 pages
r18 - Big Data Analytics - Cse (DS)
0% (1)
r18 - Big Data Analytics - Cse (DS)
1 page
Syllabus Sem 7
No ratings yet
Syllabus Sem 7
10 pages
Big Data & Hadoop Mastery Guide
No ratings yet
Big Data & Hadoop Mastery Guide
2 pages
Bda Ap
No ratings yet
Bda Ap
13 pages
Ds603Pc: Big Data Analytics B.Tech. III Year II Sem. L T P C 3 0 0 3 Course Objectives
No ratings yet
Ds603Pc: Big Data Analytics B.Tech. III Year II Sem. L T P C 3 0 0 3 Course Objectives
1 page
Big Data Analytics - Sem 7 CVMU
No ratings yet
Big Data Analytics - Sem 7 CVMU
4 pages
Bda U1
No ratings yet
Bda U1
80 pages
BDA - Unit-1
No ratings yet
BDA - Unit-1
24 pages
J. B. Institute of Engineering and Technology
No ratings yet
J. B. Institute of Engineering and Technology
1 page
4.7.1 Bda-Mba
No ratings yet
4.7.1 Bda-Mba
2 pages
BD Course Handout (Spring 2024)
No ratings yet
BD Course Handout (Spring 2024)
4 pages
Big Data Analytics for B.Tech Students
No ratings yet
Big Data Analytics for B.Tech Students
134 pages
CSE704 Data Analytics Syllabus Theory
No ratings yet
CSE704 Data Analytics Syllabus Theory
2 pages
Cryptocurrency Price Prediction Report
No ratings yet
Cryptocurrency Price Prediction Report
28 pages
MUSDVBADR 4.0.5 Jan 22
No ratings yet
MUSDVBADR 4.0.5 Jan 22
203 pages
MSME Questionnaire Form
No ratings yet
MSME Questionnaire Form
2 pages
WhatsApp Terms Conditions
No ratings yet
WhatsApp Terms Conditions
5 pages
WhatsApp Privacy Policy
No ratings yet
WhatsApp Privacy Policy
5 pages
Ai DS Ii May
No ratings yet
Ai DS Ii May
1 page
Human Following Robot Report
50% (2)
Human Following Robot Report
21 pages
Lecture 2 Numbering Systems-1
No ratings yet
Lecture 2 Numbering Systems-1
51 pages
Computer Asssignment 6
No ratings yet
Computer Asssignment 6
3 pages
Beginning JSP 2-From Novice To Professional
No ratings yet
Beginning JSP 2-From Novice To Professional
39 pages
2015 IT Risk Assessment Template
No ratings yet
2015 IT Risk Assessment Template
12 pages
OPC UA vs MQTT: Interoperability Challenges
No ratings yet
OPC UA vs MQTT: Interoperability Challenges
7 pages
Green Cloud Computing Term Paper
No ratings yet
Green Cloud Computing Term Paper
5 pages
01 VMS Overview&Concepts
100% (4)
01 VMS Overview&Concepts
32 pages
MS MF RMD Motor CAN Protocol V2.35
No ratings yet
MS MF RMD Motor CAN Protocol V2.35
26 pages
EDME 22.1.0 Installation Guide 02
No ratings yet
EDME 22.1.0 Installation Guide 02
97 pages
Mini Project Report On:: Arduino Based Samrt Notice Board
No ratings yet
Mini Project Report On:: Arduino Based Samrt Notice Board
12 pages
Compiler Building Tutorial: Jack W. Crenshaw
No ratings yet
Compiler Building Tutorial: Jack W. Crenshaw
306 pages
TIB973 Consys 24.4
No ratings yet
TIB973 Consys 24.4
40 pages
Oracle FUsion Application New Oracle Cloud Console
No ratings yet
Oracle FUsion Application New Oracle Cloud Console
5 pages
Sads
No ratings yet
Sads
190 pages
Skyscope Article BTC Seed Finding
No ratings yet
Skyscope Article BTC Seed Finding
6 pages
Assignment of Rstudio PDF
No ratings yet
Assignment of Rstudio PDF
7 pages
Movie Collection Binary File Program
No ratings yet
Movie Collection Binary File Program
16 pages
Ciena 5100 5200 For Service Providers DS
No ratings yet
Ciena 5100 5200 For Service Providers DS
5 pages
Final Exam
No ratings yet
Final Exam
12 pages
Enterprise Networking, Security, and Automation - OSPF Features and Characteristics
No ratings yet
Enterprise Networking, Security, and Automation - OSPF Features and Characteristics
5 pages
XN120 Consolidated Manual
No ratings yet
XN120 Consolidated Manual
200 pages
RS232 Protocol for C210 Inkjet Printer
No ratings yet
RS232 Protocol for C210 Inkjet Printer
25 pages
Computer Architecture - Memory System
100% (1)
Computer Architecture - Memory System
22 pages
NNTN7392 IMPRES BattReader UG
No ratings yet
NNTN7392 IMPRES BattReader UG
63 pages
Unit 4 Societal Impacts
No ratings yet
Unit 4 Societal Impacts
11 pages
Automation and Integration Solutions For Electric Power Systems
No ratings yet
Automation and Integration Solutions For Electric Power Systems
16 pages
Cucumber BDD
No ratings yet
Cucumber BDD
16 pages
Blockchain and Distributed Ledger Technology (DLT)
No ratings yet
Blockchain and Distributed Ledger Technology (DLT)
10 pages
IAL IT Scheme-of-Work U4 011019
No ratings yet
IAL IT Scheme-of-Work U4 011019
35 pages
Review On Cyber Crime and Security
No ratings yet
Review On Cyber Crime and Security
4 pages

Syllabus

Uploaded by

Syllabus

Uploaded by

Course Code Course Theory Practical Tutorial Theory Practical/ Tutorial Total

Sr.No Course Objectives

Sr. Module Detailed Content Hours CO Mapping

University of Mumbai, B. E. (Information Technology), Rev 2016 286

I Introduction to Introduction to Big Data, Big Data characteristics, types of 03 CO1

II Introduction to What is Hadoop? Core Hadoop Components; Hadoop 06 CO2

University of Mumbai, B. E. (Information Technology), Rev 2016 287

Internal Assessment (IA) for 20 marks:

 Question paper format

A total of four questions need to be answered.

University of Mumbai, B. E. (Information Technology), Rev 2016 288

You might also like