0% found this document useful (0 votes)

40 views10 pages

Intro-Databases For Big Data

Uploaded by

Xenos Playground aka Boxman Studios

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views10 pages

Intro-Databases For Big Data

Uploaded by

Xenos Playground aka Boxman Studios

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

DATS310 d

Databases for Big Data

DR. RICHA SHARMA

C O M M O N W E A LT H U N I V E R S I T Y

1
Introduction
 Architecture for databases:
 Focuses on storage and organization of information to
allow easy access and modification (insert, update, delete
operation) of data.

 Database design and application development depends a

lot on database architecture!

 Architectural design of Database varies just as network

topology varies.

 Helps in identifying which database design is best suitable

for the problem at hand, i.e. the application to be
developed!
2
Tools/Technologies for Big Data
 Few Examples:
 Apache Hadoop, Spark, Kafka, Hive, Storm

 MongoDB and CouchDB

 Redis, Cassandra and Neo4j

 Druid and Google Big Query

 AWS DynamoDB

 Google Big Query

 Tableau

3
Questions to explore
 Type of database – does the problem at hand requires
relational database, key-value pair database, columnar
database, document-oriented database or graph
database?

 Nature of problem and usage of database – does the

problem require flexibility or does it require parallel
processing?

 Communication interface of database – are we going to

interact with database through an interactive command-like
interface or through the application requiring database
connectivity and programming language interfacing?

4
Questions to explore
 Unique characteristic of database – Any database will support
writing data and reading it back again, but what makes it
unique? Some allow querying on arbitrary fields; some
provide indexing for rapid lookup; some support ad hoc
queries, while queries must be planned for others.
 Performance – How does this database function and at what
cost? How about replication? Is this database tuned for
reading, writing, or some other operation?
 Scalability – Scalability closely related to performance and
point to explore is if the database is geared more for
horizontal scaling (MongoDB, HBase, DynamoDB) or
traditional vertical scaling (Postgres, Neo4J, Redis), or
something in between.
5
RDBMS vs Big Databases

6
Key-Value Pair Database
 Simplest database model, storing data as key-value (KV) pair
just like a hash-table.
 Some KV implementations provide a means of iterating
through the keys, but not all!
 A file system can be considered a key-value store assuming
the file path as the key and the file contents as the value.
 Since this database model doesn’t require complex data
structures for storage, it can be incredibly performant in a
number of scenarios but generally won’t be helpful when we
have complex query and aggregation requirements.
 Example: Redis, DynamoDB, Voldemort, Riak etc.

7
Columnar Database
 Columnar, or column-oriented, databases are so named
because these database store the data from a given column
(in the two-dimensional table sense) together, as opposite to
row-oriented databases (RDBMS).

 These databases make adding columns to table quite

inexpensive, and this is done on a row-by-row basis.

 Each row can have a different set of columns, or none at all,

allowing tables to remain sparse without incurring a storage
cost for null values.

 With respect to structure, columnar is about midway between

relational and key-value. Example: HBase, Cassandra etc.
8
Document Database
 Meant to store documents, considering a document like a
hash, with a unique ID field, and values that may be any of a
variety of types, including more hashes.
 Documents can contain nested structures, and so they exhibit
a high degree of flexibility, allowing for variable domains.
 But, the system imposes few restrictions on incoming data, as
long as it meets the basic requirement of being expressible as
a document.
 Different document databases take different approaches with
respect to indexing, ad hoc querying, replication, consistency,
and other design decisions.
 Example: MongoDB, CouchDB etc.
9
Graph Database
 Less commonly used database styles, but graph databases
are best for working with highly interconnected data.

 A graph database consists of nodes and relationships

between nodes.

 Both nodes and relationships can have properties and key-

value pairs that store data.

 Real strength of graph databases is traversing through the

nodes by following relationships..

 Example: Neo4J, Polyglot etc.

Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
43 pages
Chap 4
No ratings yet
Chap 4
18 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
Unit 2
No ratings yet
Unit 2
65 pages
4.1 Intro Nosql-Converted-133751863122661863
No ratings yet
4.1 Intro Nosql-Converted-133751863122661863
43 pages
Bda CHP 3
No ratings yet
Bda CHP 3
75 pages
Unit 2
No ratings yet
Unit 2
26 pages
BD Unit 4
No ratings yet
BD Unit 4
45 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
NoSQL Database Overview Lecture
No ratings yet
NoSQL Database Overview Lecture
22 pages
Advance Database
No ratings yet
Advance Database
5 pages
Lecture 3.1.2
No ratings yet
Lecture 3.1.2
47 pages
No SQL
No ratings yet
No SQL
12 pages
Big Data Unit 3
No ratings yet
Big Data Unit 3
374 pages
Types of NoSQL Databases - GeeksforGeeks
No ratings yet
Types of NoSQL Databases - GeeksforGeeks
9 pages
Unit 5
No ratings yet
Unit 5
36 pages
Module 3 Bigdata Analytics
No ratings yet
Module 3 Bigdata Analytics
19 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
NoSQL Database
No ratings yet
NoSQL Database
45 pages
MongoDB Slides Until ClassTest
No ratings yet
MongoDB Slides Until ClassTest
221 pages
Unit 6
No ratings yet
Unit 6
143 pages
NoSQL Unit 1 & 2 QnA
No ratings yet
NoSQL Unit 1 & 2 QnA
18 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
45 pages
No SQL
No ratings yet
No SQL
32 pages
Unit 3
No ratings yet
Unit 3
7 pages
Unit III (FSWD)
No ratings yet
Unit III (FSWD)
27 pages
Chapter 6b - No SQL
No ratings yet
Chapter 6b - No SQL
27 pages
CH.5 NOSQL Database For Business Applications
No ratings yet
CH.5 NOSQL Database For Business Applications
21 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
Unit 3 Nosql Databases Adt
No ratings yet
Unit 3 Nosql Databases Adt
64 pages
Database Advice Guide
No ratings yet
Database Advice Guide
19 pages
Nosql
No ratings yet
Nosql
10 pages
NoSQL Databases Explained
No ratings yet
NoSQL Databases Explained
8 pages
No SQL
No ratings yet
No SQL
38 pages
BIG Data - Storing Data
No ratings yet
BIG Data - Storing Data
40 pages
Session 8 - NoSQL
No ratings yet
Session 8 - NoSQL
17 pages
Types of Databases
No ratings yet
Types of Databases
9 pages
NOSQL Lecture 1 Notes
No ratings yet
NOSQL Lecture 1 Notes
31 pages
3.2NOSQL Categories
No ratings yet
3.2NOSQL Categories
7 pages
3.2NOSQL Categories
No ratings yet
3.2NOSQL Categories
7 pages
Bda Notes (Unit-2)
No ratings yet
Bda Notes (Unit-2)
26 pages
BDA Module 5 - Part1 (No SQL) 2023
No ratings yet
BDA Module 5 - Part1 (No SQL) 2023
32 pages
D B M S: ATA ASE Anage Me NT Ystem
No ratings yet
D B M S: ATA ASE Anage Me NT Ystem
114 pages
Types of Databases Explained
No ratings yet
Types of Databases Explained
5 pages
Data 1
No ratings yet
Data 1
4 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
DATABASE II, Note
No ratings yet
DATABASE II, Note
16 pages
06 NoSQL
No ratings yet
06 NoSQL
80 pages
Ebook Database Advice Guide
No ratings yet
Ebook Database Advice Guide
19 pages
DBMS (UNIT-6) (Advances in Databases and Big Data)
No ratings yet
DBMS (UNIT-6) (Advances in Databases and Big Data)
103 pages
Seminar Topic Nosql
No ratings yet
Seminar Topic Nosql
73 pages
cp5293 Big Data Analytics Unit 5 PDF
No ratings yet
cp5293 Big Data Analytics Unit 5 PDF
28 pages
BIG Data 2
No ratings yet
BIG Data 2
18 pages
No SQL
No ratings yet
No SQL
32 pages
NoSQL DATABSES
No ratings yet
NoSQL DATABSES
12 pages
NoSQL for Developers and IT Pros
No ratings yet
NoSQL for Developers and IT Pros
3 pages
Unit - 1 Part - 1
No ratings yet
Unit - 1 Part - 1
20 pages
Unit 3 NoSQL
No ratings yet
Unit 3 NoSQL
98 pages
Chapter 08 2
No ratings yet
Chapter 08 2
20 pages
Lhu Comp 200: Chapter 2 (2 C) Application Layer
No ratings yet
Lhu Comp 200: Chapter 2 (2 C) Application Layer
37 pages
Chapter 06
No ratings yet
Chapter 06
46 pages
Chapter 04
No ratings yet
Chapter 04
29 pages
Chapter 14
No ratings yet
Chapter 14
35 pages
Chapter 02
No ratings yet
Chapter 02
45 pages
Review of DB Concepts
No ratings yet
Review of DB Concepts
27 pages
Chapter 3 J v8.0 V04
No ratings yet
Chapter 3 J v8.0 V04
150 pages
SQL Triggers & Functions
No ratings yet
SQL Triggers & Functions
16 pages
Columnar Databases for Data Analysts
No ratings yet
Columnar Databases for Data Analysts
18 pages
Eliot PsychoanalyticInterpretationGroup 1920
No ratings yet
Eliot PsychoanalyticInterpretationGroup 1920
21 pages
SQL Views & Procedures
No ratings yet
SQL Views & Procedures
23 pages
Query Optimization
No ratings yet
Query Optimization
10 pages
CAP Theorem
No ratings yet
CAP Theorem
15 pages
SQL Queries5
No ratings yet
SQL Queries5
20 pages
Deutsch GroupFormation 1973
No ratings yet
Deutsch GroupFormation 1973
20 pages
Chapter 6 Management A Practical Introduction
No ratings yet
Chapter 6 Management A Practical Introduction
6 pages
Review - Normal Forms2
No ratings yet
Review - Normal Forms2
17 pages
SQL Functions
No ratings yet
SQL Functions
18 pages
Examining Maslow's Hierarchy Need Theory in The Social Media Adoption
No ratings yet
Examining Maslow's Hierarchy Need Theory in The Social Media Adoption
11 pages
Quality Indicators For The Care of Older Adults W Disabilities in Longterm Care Wbased On Maslow Hierarchy of Needs
No ratings yet
Quality Indicators For The Care of Older Adults W Disabilities in Longterm Care Wbased On Maslow Hierarchy of Needs
7 pages
A Suggested Modification To Maslow's Need Hierarchy
No ratings yet
A Suggested Modification To Maslow's Need Hierarchy
6 pages
21st Century Boys v01, (2007) (Joufu + Obxist)
100% (1)
21st Century Boys v01, (2007) (Joufu + Obxist)
197 pages
Relativism in Ethics - William Shaw
No ratings yet
Relativism in Ethics - William Shaw
4 pages
86EIGHTY-SIX Vol 10 Light Novel Fragmental Neoteny - Asato Asato
No ratings yet
86EIGHTY-SIX Vol 10 Light Novel Fragmental Neoteny - Asato Asato
289 pages
BLAME! Master Edition v01 (2016) (Digital) (Danke-Empire)
100% (1)
BLAME! Master Edition v01 (2016) (Digital) (Danke-Empire)
396 pages
21st Century Boys v02, (2007) (Obxist)
No ratings yet
21st Century Boys v02, (2007) (Obxist)
205 pages
BLAME! Master Edition v02 (2016) (Digital) (Danke-Empire)
No ratings yet
BLAME! Master Edition v02 (2016) (Digital) (Danke-Empire)
364 pages
Works of Arthur Schopenhauer - Arthur Schopenhauer
100% (2)
Works of Arthur Schopenhauer - Arthur Schopenhauer
2,370 pages
BLAME! Master Edition v03 (2017) (Digital) (Danke-Empire)
100% (1)
BLAME! Master Edition v03 (2017) (Digital) (Danke-Empire)
341 pages
Recent Trends in Database Technology
No ratings yet
Recent Trends in Database Technology
9 pages
(2022) Knowledge Graph - A Giude Tour (21 Pages)
No ratings yet
(2022) Knowledge Graph - A Giude Tour (21 Pages)
21 pages
Database Security for Students
No ratings yet
Database Security for Students
17 pages
AI Solution for Pharma EHR Challenges
No ratings yet
AI Solution for Pharma EHR Challenges
8 pages
Free Oracle 1z0 184 25 Exam Questions by Britt
100% (1)
Free Oracle 1z0 184 25 Exam Questions by Britt
11 pages
BE AIDS R 20 V VI Sem Syllabus Compressed
No ratings yet
BE AIDS R 20 V VI Sem Syllabus Compressed
59 pages
Big Data Report
60% (5)
Big Data Report
20 pages
Ragbuilder Env
No ratings yet
Ragbuilder Env
7 pages
Neo4j - WP Fraud Detection With Graph Databases
No ratings yet
Neo4j - WP Fraud Detection With Graph Databases
12 pages
Neo4j Arangodb Mongodb Comparison
No ratings yet
Neo4j Arangodb Mongodb Comparison
3 pages
Unlocking DBT: Design and Deploy Transformations in Your Cloud Data Warehouse Cameron Cyr Download
No ratings yet
Unlocking DBT: Design and Deploy Transformations in Your Cloud Data Warehouse Cameron Cyr Download
62 pages
Mongodb and Neo4j Practicals
No ratings yet
Mongodb and Neo4j Practicals
12 pages
Report
No ratings yet
Report
86 pages
Ejas 12348
No ratings yet
Ejas 12348
27 pages
Database Admin & MySQL Basics FAQ
No ratings yet
Database Admin & MySQL Basics FAQ
123 pages
Dbms All Units Notes
No ratings yet
Dbms All Units Notes
140 pages
NoSQL for Computer Engineers
No ratings yet
NoSQL for Computer Engineers
37 pages
Magic Quadrant For C 763557 NDX
No ratings yet
Magic Quadrant For C 763557 NDX
51 pages
AWS Database Products Infographic
No ratings yet
AWS Database Products Infographic
1 page
Building Web Applications With Python and Neo4j 1st Edition Sumit Gupta PDF Download
100% (10)
Building Web Applications With Python and Neo4j 1st Edition Sumit Gupta PDF Download
61 pages
Fdsa Unit 1
No ratings yet
Fdsa Unit 1
25 pages
Nosql Technology
No ratings yet
Nosql Technology
8 pages
Data Science Unit-1 B.sc. III Sem. MDC
No ratings yet
Data Science Unit-1 B.sc. III Sem. MDC
10 pages
Exploration of LLM Multi-Agent Application Implementation Based On LangGraph+CrewAI.18241v1
No ratings yet
Exploration of LLM Multi-Agent Application Implementation Based On LangGraph+CrewAI.18241v1
3 pages
Graph Technology Buyers Guide EN A4
No ratings yet
Graph Technology Buyers Guide EN A4
34 pages
Distributed Databases & Security
No ratings yet
Distributed Databases & Security
27 pages
RAG for LLMs: A Comprehensive Survey
No ratings yet
RAG for LLMs: A Comprehensive Survey
26 pages
Neptune PDF
No ratings yet
Neptune PDF
34 pages
Database As A Service
No ratings yet
Database As A Service
61 pages

Intro-Databases For Big Data

Uploaded by

Intro-Databases For Big Data

Uploaded by

DATS310 d

Databases for Big Data

DR. RICHA SHARMA

 Database design and application development depends a

 Architectural design of Database varies just as network

 Helps in identifying which database design is best suitable

 MongoDB and CouchDB

 Redis, Cassandra and Neo4j

 Druid and Google Big Query

 Google Big Query

 Nature of problem and usage of database – does the

 Communication interface of database – are we going to

 These databases make adding columns to table quite

 Each row can have a different set of columns, or none at all,

 With respect to structure, columnar is about midway between

 A graph database consists of nodes and relationships

 Both nodes and relationships can have properties and key-

 Real strength of graph databases is traversing through the

 Example: Neo4J, Polyglot etc.

You might also like