SYLLABUS
CHANDIGARH UNIVERSITY, GHARUAN
Big Data Analytics L T P C
Subject Code Total Contact Hours : 45Hours
(CST-301) Common to all Specializations of CSE 3rd 3 0 0 3
Year
Marks-100
Internal-40 External-60
Course Objectives
To have good understanding of BigData concepts and design of HDFS for handling
Big Data.
To learn how to use PIG, HIVE, SCOOP, HBASE, OOZIE and FLUME.
To understand Map Reduce Architecture.
To understand daemons of Hadoop and their functionality.
Unit Course Outcomes
I Be familiar in Handling BigData for storage and retrieval using Hadoop
II Be familiar with the Job Tracker, Task Tracker with Name Node and Data Nodes.
III Familiar with uses of FLUME, OOZIE,HBASE,HIVE and SCOOP
Contents of the Syllabus
UNIT-I [15h]
Introduction to BigData and Hadoop: BigData and its Characterstics, Problems with BigData,
Handling BigData, Difference between Structured, Semi- Structured and Unstructured Data.
Introduction to Hadoop, Scope of Hadoop, Components of Hadoop,
Hadoop Distributed File System: Introduction of HDFS, HDFS Design, HDFS role in Hadoop, Features
of HDFS, Daemons of Hadoop and its functionality- Name Node, Data Node, Secondary Name Node,
Job Tracker, Task Tracker.
UNIT-II [15h]
HDFS Architecture: Concept of Nodes, Racks and Data Center. Basic Configuration for HDFS. Data
Organization- Blocks and Replication. Anatomy of File Write, Anatomy of File Read. Rack Awareness,
Heartbeat Signal. Storing and Reading Data into HDFS.
MapReduce: Introduction of MapReduce, MapReduce Architecture, Data flow in MapReduce- Splits,
Mapper , Partitioner, Sort and shuffle, Combiner, Reducer. Difference between Block and InputSplit.
Role of RecordReader, Basic Configuration of MapReduce, MapReduce life cycle- Driver Code,
Mapper and Reducer. Working of MapReduce.
UNIT-III [15h]
Introduction to PIG, SQOOP and HIVE: Introduction to PIG Data Flow Engine, Uses of PIG, Modes of
Execution in PIG- Local Mode and MapReduce Mode. Introduction to SQOOP, Use of SQOOP,
Introduction to HIVE, HIVE Architecture.
Introduction to HBASE, OOZIE and FLUME: Introduction to HBASE, Basic Fundamentals of HBase.
Introduction to OOZIE, Use of OOZIE, Introduction to Flume, Uses of Flume, Flume Architecture- Flume
Master, Flume Collectors, Flume Agents.
Text Books:
1. Data Analytics by Radha Shankarmani, M. Vijayalakshmi by Technical Publications.
2. Big Data Analytics with R and Hadoop by Vignesh Prajapati.
Reference Books:
1. Hadoop - The Definitive Guide by Tome White, O’REILLY Publications.
2. Hadoop In Practice by Alex Holmes, DreamTech Publications.