0% found this document useful (0 votes)

4 views11 pages

NOSQL

Uploaded by

kaisu0726

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views11 pages

NOSQL

Uploaded by

kaisu0726

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

NOSQL and Query Optimization

NoSQL:
A NoSQL (originally referring to "non-SQL" or "non-relational") database provides a
mechanism for storage and retrieval of data that is modeled in means other than the
tabular relations used in relational databases. Such databases have existed since
the late 1960s, but the name "NoSQL" was only coined in the early 21st
century, triggered by the needs of Web 2.0 companies. NoSQL databases are
increasingly used in big data and real-time web applications. NoSQL systems are
also sometimes called Not only SQL to emphasize that they may support SQL-like
query languages or sit alongside SQL databases in polyglot-persistent architectures.
Motivations for this approach include simplicity of design, simpler "horizontal"
scaling to clusters of machines (which is a problem for relational databases), finer
control over availability, and limiting the object-relational impedance mismatch. The
data structures used by NoSQL databases (e.g. key–value pair, wide column, graph,
or document) are different from those used by default in relational databases,
making some operations faster in NoSQL. The particular suitability of a given NoSQL
database depends on the problem it must solve. Sometimes the data structures used
by NoSQL databases are also viewed as "more flexible" than relational database
tables.

NoSQL databases are generally classified into four main categories:

1. Document databases: These databases store data as semi-structured

documents, such as JSON or XML, and can be queried using document-
oriented query languages.
2. Key-value stores: These databases store data as key-value pairs, and
are optimized for simple and fast read/write operations.
3. Column-family stores: These databases store data as column families,
which are sets of columns that are treated as a single entity. They are
optimized for fast and efficient querying of large amounts of data.
4. Graph databases: These databases store data as nodes and edges,
and are designed to handle complex relationships between data.

Query optimization
Query optimization is a feature of many relational database management
systems and other databases such as NoSQL and graph databases. The query
optimizer attempts to determine the most efficient way to execute a given query by
considering the possible query plans.[1]
Generally, the query optimizer cannot be accessed directly by users: once queries
are submitted to the database server, and parsed by the parser, they are then
passed to the query optimizer where optimization occurs. [2][3] However, some
database engines allow guiding the query optimizer with hints.
Most query optimizers represent query plans as a tree of "plan nodes". A plan node
encapsulates a single operation that is required to execute the query. The nodes are
arranged as a tree, in which intermediate results flow from the bottom of the tree to
the top. Each node has zero or more child nodes—those are nodes whose output is
fed as input to the parent node. For example, a join node will have two child nodes,
which represent the two join operands, whereas a sort node would have a single
child node (the input to be sorted). The leaves of the tree are nodes which produce
results by scanning the disk, for example by performing an index scan or a
sequential scan.

Database Optimizer
A query optimizer chooses an optimal index and access paths to execute the
query. At a very high level, SQL optimizers decide the following before creating the
execution tree:

1. Query rewrite based on heuristics, cost or both.

2. Index selection.
o Selecting the optimal index(es) for each of the table (keyspaces in
Couchbase N1QL, collection in case of MongoDB)
o Depending on the index selected, choose the predicates to push down,
see the query is covered or not, decide on sort and pagination strategy.
3. Join reordering
4. Join type

Implementation
Join ordering

Two possible query plans for the triangle query R(A, B) ⋈ S(B, C) ⋈ T(A, C); the first
joins S and T first and joins the result with R, the second joins R and S first and joins
the result with T
The performance of a query plan is determined largely by the order in which the
tables are joined. For example, when joining 3 tables A, B, C of size 10 rows, 10,000
rows, and 1,000,000 rows, respectively, a query plan that joins B and C first can take
several orders-of-magnitude more time to execute than one that joins A and C first.
Most query optimizers determine join order via a dynamic programming algorithm
pioneered by IBM's System R database project. This algorithm works in two stages:

1. First, all ways to access each relation in the query are computed.
Every relation in the query can be accessed via a sequential scan. If
there is an index on a relation that can be used to answer
a predicate in the query, an index scan can also be used. For each
relation, the optimizer records the cheapest way to scan the relation,
as well as the cheapest way to scan the relation that produces records
in a particular sorted order.
2. The optimizer then considers combining each pair of relations for
which a join condition exists. For each pair, the optimizer will consider
the available join algorithms implemented by the DBMS. It will
preserve the cheapest way to join each pair of relations, in addition to
the cheapest way to join each pair of relations that produces its output
according to a particular sort order.
3. Then all three-relation query plans are computed, by joining each two-
relation plan produced by the previous phase with the remaining
relations in the query.
Sort order can avoid a redundant sort operation later on in processing the query.
Second, a particular sort order can speed up a subsequent join because it clusters
the data in a particular way.
Types of NoSQL Database

 Document-based databases
 Key-value stores
 Column-oriented databases
 Graph-based databases

Document-Based Database:

The document-based database is a nonrelational database. Instead of storing the

data in rows and columns (tables), it uses the documents to store the data in the
database. A document database stores data in JSON, BSON, or XML documents.
Documents can be stored and retrieved in a form that is much closer to the data
objects used in applications which means less translation is required to use these
data in the applications. In the Document database, the particular elements can be
accessed by using the index value that is assigned for faster querying.
Collections are the group of documents that store documents that have similar
contents. Not all the documents are in any collection as they require a similar schema
because document databases have a flexible schema.
Key features of documents database:
 Flexible schema: Documents in the database has a flexible schema. It
means the documents in the database need not be the same schema.
 Faster creation and maintenance: the creation of documents is easy and
minimal maintenance is required once we create the document.
 No foreign keys: There is no dynamic relationship between two documents
so documents can be independent of one another. So, there is no
requirement for a foreign key in a document database.
 Open formats: To build a document we use XML, JSON, and others.

Key-Value Stores:

A key-value store is a nonrelational database. The simplest form of a NoSQL

database is a key-value store. Every data element in the database is stored in key-
value pairs. The data can be retrieved by using a unique key allotted to each element
in the database. The values can be simple data types like strings and numbers or
complex objects.
A key-value store is like a relational database with only two columns which is the key
and the value.
Key features of the key-value store:
 Simplicity.
 Scalability.
 Speed.

Column Oriented Databases:

A column-oriented database is a non-relational database that stores the data in

columns instead of rows. That means when we want to run analytics on a small
number of columns, you can read those columns directly without consuming memory
with the unwanted data.
Columnar databases are designed to read data more efficiently and retrieve the data
with greater speed. A columnar database is used to store a large amount of data.
Key features of columnar oriented database:
 Scalability.
 Compression.
 Very responsive.
Graph-Based databases:

Graph-based databases focus on the relationship between the elements. It stores

the data in the form of nodes in the database. The connections between the nodes
are called links or relationships.
Key features of graph database:
 In a graph-based database, it is easy to identify the relationship between
the data by using the links.
 The Query’s output is real-time results.
 The speed depends upon the number of relationships among the database
elements.
 Updating data is also easy, as adding a new node or edge to a graph
database is a straightforward task that does not require significant schema
changes.
Querying in NoSQL
Querying in NoSQL:
Suppose we want to get specific results on the transport database.
{
"Brand":"Benz"
"Max_Speed":250
"Color":"Green"
}
1. To display the vehicles which have a speed greater than 100.
Query:
>db.transport.find({Max_speed:
{$gt:100}}).pretty()
Output:
{
"Brand":"Benz"
"Max_Speed":250
"Color":"Green"
}
2. To display the vehicles which have a speed equal to 250.
Query:
>db.transport.find({Max_speed:
{$eq:250}}}.pretty()
Output:
{
"Brand":"Benz"
"Max_Speed":250
"Color":"Green"
}
$eq – This operator is used to check 2 values and returns the data which is equal to
the specified value. So like this, we have $gte ( greater than or equal to ), $lte ( lesser
than or equal to ), $lt( less than ), $ne( Not equal ) in NoSQL.
Indexing in MongoDB

Indexing in MongoDB :
MongoDB uses indexing in order to make the query processing more efficient. If
there is no indexing, then the MongoDB must scan every document in the
collection and retrieve only those documents that match the query. Indexes are
special data structures that stores some information related to the documents such
that it becomes easy for MongoDB to find the right data file. The indexes are order
by the value of the field specified in the index.
Creating an Index :
MongoDB provides a method called createIndex() that allows user to create an
index.
Syntax –

db.COLLECTION_NAME.createIndex({KEY:1})
The key determines the field on the basis of which you want to create an index and
1 (or -1) determines the order in which these indexes will be arranged(ascending or
descending).
Example –

db.mycol.createIndex({“age”:1})
{
“createdCollectionAutomatically” : false,
“numIndexesBefore” : 1,
“numIndexesAfter” : 2,
“ok” : 1
}
The createIndex() method also has a number of optional parameters.
These include:

 background (Boolean)
 unique (Boolean)
 name (string)
 sparse (Boolean)
 expireAfterSeconds (integer)
 hidden (Boolean)
 storageEngine (Document)

Drop an index:
In order to drop an index, MongoDB provides the dropIndex() method.
Syntax –

db.NAME_OF_COLLECTION.dropIndex({KEY:1})
The dropIndex() methods can only delete one index at a time. In order to delete (or
drop) multiple indexes from the collection, MongoDB provides the dropIndexes()
method that takes multiple indexes as its parameters.
Syntax –

db.NAME_OF_COLLECTION.dropIndexes({KEY1:1, KEY2, 1})

The dropIndex() methods can only delete one index at a time. In order to delete (or
drop) multiple indexes from the collection, MongoDB provides the dropIndexes()
method that takes multiple indexes as its parameters.
Get description of all indexes :
The getIndexes() method in MongoDB gives a description of all the indexes that
exists in the given collection.
Syntax –

db.NAME_OF_COLLECTION.getIndexes()
It will retrieve all the description of the indexes created within the collection.

ordering data sets in NoSQL databases.

 One way to create indexes and order data sets in NoSQL is to use
the clustering order option, which allows you to specify the sort order of the
data within a partition.
 Another way is to use a time bucketing technique, which limits the number of
records per partition by grouping them into time intervals.
 You can also use different types of NoSQL databases, such as document,
key-value, column-family or graph, depending on your data model and query
needs.
 You should also consider the size of your data sets and the performance of
your NoSQL system, as they may affect the query speed and efficiency.
NOSQL in Cloud

NoSQL Cloud Database Services are cloud-based database services that provide
scalable, high-performance, and cost-effective solutions for storing and retrieving
data. NoSQL (Not Only SQL) databases are designed to handle large volumes of
unstructured, semi-structured, and structured data, and can easily scale
horizontally to accommodate increased data volumes.

Cloud-based NoSQL databases offer several advantages over traditional on-

premise databases. These include:

1. Scalability: Cloud-based NoSQL databases can easily scale horizontally

by adding more servers to the cluster. This allows for seamless scalability
as data volumes increase.
2. High availability: NoSQL cloud databases are designed to be highly
available and can provide reliable uptime and performance, which is
critical for many applications.
3. Reduced cost: Cloud-based NoSQL databases can be more cost-
effective than traditional on-premise databases because they eliminate
the need for expensive hardware and infrastructure. This can be
particularly beneficial for small to medium-sized businesses that do not
have the resources to invest in expensive hardware.
4. Improved performance: Cloud-based NoSQL databases can provide high
performance and low latency, making them well-suited for applications
that require fast and efficient data access.
5. Flexibility: Cloud-based NoSQL databases are designed to handle
unstructured, semi-structured, and structured data, making them a
flexible solution for a wide range of applications.

Some popular NoSQL Cloud Database Services include:

1. Amazon DynamoDB: A fully managed NoSQL database service offered

by Amazon Web Services (AWS) that provides fast and predictable
performance with seamless scalability.
2. Google Cloud Datastore: A NoSQL document database service that is
fully managed and offers automatic scaling, high availability, and low
latency.
3. Microsoft Azure Cosmos DB: A globally distributed, multi-model database
service that provides high availability, low latency, and flexible data
modeling.
4. MongoDB Atlas: A fully managed global cloud database service for
MongoDB that provides automated backups, advanced security, and
easy scalability.
5. Overall, NoSQL Cloud Database Services provide a flexible, scalable,
and cost-effective solution for storing and retrieving data in the cloud.
They offer several advantages over traditional on-premise databases and
can be an excellent choice for businesses of all sizes that need to store
and manage large volumes of data.

Chapter 5: No SQL Data Management and Mongodb: Unit-2
No ratings yet
Chapter 5: No SQL Data Management and Mongodb: Unit-2
65 pages
CH.5 NOSQL Database For Business Applications
No ratings yet
CH.5 NOSQL Database For Business Applications
21 pages
Unit 5 - 230601 - 174540-1
No ratings yet
Unit 5 - 230601 - 174540-1
14 pages
Module 1 Introduction
No ratings yet
Module 1 Introduction
9 pages
Chapter 1 - Introducing Big Data & NoSQL
No ratings yet
Chapter 1 - Introducing Big Data & NoSQL
14 pages
NoSQL DATABSES
No ratings yet
NoSQL DATABSES
12 pages
NGD Chap1
No ratings yet
NGD Chap1
22 pages
1842 Week6 NoSQL
No ratings yet
1842 Week6 NoSQL
51 pages
Unit III (FSWD)
No ratings yet
Unit III (FSWD)
27 pages
Module 5 - NoSQL Databases
No ratings yet
Module 5 - NoSQL Databases
33 pages
NOSQL
No ratings yet
NOSQL
25 pages
Bda Unit-5 PDF
No ratings yet
Bda Unit-5 PDF
83 pages
NoSQL Databases: Types and Features
No ratings yet
NoSQL Databases: Types and Features
59 pages
NoSQL: A Guide for IT Students
No ratings yet
NoSQL: A Guide for IT Students
15 pages
NOSQL Concept 2
No ratings yet
NOSQL Concept 2
4 pages
No SQL Lecture Notes
No ratings yet
No SQL Lecture Notes
17 pages
NoSQL Notes
No ratings yet
NoSQL Notes
11 pages
No SQL
No ratings yet
No SQL
32 pages
NoSQL for Developers and IT Pros
No ratings yet
NoSQL for Developers and IT Pros
3 pages
No SQL
No ratings yet
No SQL
12 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
38 pages
No SQL
No ratings yet
No SQL
10 pages
NoSql Report SR
No ratings yet
NoSql Report SR
11 pages
01 NSQL
No ratings yet
01 NSQL
5 pages
NoSQL Lec
No ratings yet
NoSQL Lec
45 pages
Big Data Unit 3
No ratings yet
Big Data Unit 3
374 pages
41 NoSQL Introduction
No ratings yet
41 NoSQL Introduction
18 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
8 pages
10gen Top 5 NoSQL Considerations
No ratings yet
10gen Top 5 NoSQL Considerations
10 pages
Unit 3 Nosql Databases Adt
No ratings yet
Unit 3 Nosql Databases Adt
64 pages
No SQL
No ratings yet
No SQL
3 pages
No SQL
No ratings yet
No SQL
24 pages
No SQL
No ratings yet
No SQL
38 pages
BDT Unit 4
No ratings yet
BDT Unit 4
93 pages
No SQL DB
No ratings yet
No SQL DB
18 pages
NoSQL Databases Notes
No ratings yet
NoSQL Databases Notes
5 pages
Aggregate Models in Big Data
No ratings yet
Aggregate Models in Big Data
3 pages
Mongo DB Exp 1-Content Beyond The Syllabus
No ratings yet
Mongo DB Exp 1-Content Beyond The Syllabus
13 pages
Unit 5
No ratings yet
Unit 5
36 pages
Unit 3 NoSQL
No ratings yet
Unit 3 NoSQL
98 pages
Unit III FSWD Mongodb
No ratings yet
Unit III FSWD Mongodb
40 pages
NoSQL Tutorial - New
No ratings yet
NoSQL Tutorial - New
10 pages
Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
Lecture 3.1.2
No ratings yet
Lecture 3.1.2
47 pages
NoSQL PDF
No ratings yet
NoSQL PDF
21 pages
Introduction To Nosql: What Is A Nosql Database Used For?
No ratings yet
Introduction To Nosql: What Is A Nosql Database Used For?
6 pages
Understanding NoSQL Database Types
No ratings yet
Understanding NoSQL Database Types
3 pages
Non Relational Database-NoSQL
No ratings yet
Non Relational Database-NoSQL
4 pages
NoSQL Databases: A Developer's Guide
No ratings yet
NoSQL Databases: A Developer's Guide
36 pages
NoSQL Databases Overview
No ratings yet
NoSQL Databases Overview
7 pages
DBS-C01-S02-B-03-Relational Databases
No ratings yet
DBS-C01-S02-B-03-Relational Databases
3 pages
CS22512 Honors New
No ratings yet
CS22512 Honors New
33 pages
NoSQL Databases: Overview & Benefits
No ratings yet
NoSQL Databases: Overview & Benefits
8 pages
Unit 2
No ratings yet
Unit 2
26 pages
Full Stack-Unit-Iii
No ratings yet
Full Stack-Unit-Iii
56 pages
NoSQL Complete QB
No ratings yet
NoSQL Complete QB
43 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
Gabriel Duarte Luna Resume
No ratings yet
Gabriel Duarte Luna Resume
2 pages
Data Engineer Intern Assignment
No ratings yet
Data Engineer Intern Assignment
3 pages
Management Information System
No ratings yet
Management Information System
12 pages
IOT Car Parking System, Car Parking Using Iot
No ratings yet
IOT Car Parking System, Car Parking Using Iot
50 pages
WAEC Data Processing Past Question Paper and Answer PDF Download
73% (66)
WAEC Data Processing Past Question Paper and Answer PDF Download
9 pages
Callmanager Database Replication
No ratings yet
Callmanager Database Replication
53 pages
Applies To:: OEM 13c: What Is The Impact of Anti-Virus Software Running On OEM Servers (Doc ID 2667352.1)
No ratings yet
Applies To:: OEM 13c: What Is The Impact of Anti-Virus Software Running On OEM Servers (Doc ID 2667352.1)
1 page
Solution Manual For Concepts of Database Management, 9th Edition, Joy L. Starks, Philip J. Pratt, Mary Z. Last Download
100% (12)
Solution Manual For Concepts of Database Management, 9th Edition, Joy L. Starks, Philip J. Pratt, Mary Z. Last Download
33 pages
CANdb++ Database Guide
No ratings yet
CANdb++ Database Guide
25 pages
Chapter - 1 - Database System Concepts and Architecture
No ratings yet
Chapter - 1 - Database System Concepts and Architecture
42 pages
Matlab and MySQL
No ratings yet
Matlab and MySQL
17 pages
Electric Distribution
0% (1)
Electric Distribution
110 pages
Information and Information Systems (Greenwood Press, 1991 Paperback: Praeger)
No ratings yet
Information and Information Systems (Greenwood Press, 1991 Paperback: Praeger)
16 pages
PRODUCT OVERVIEW - Forcepoint DLP - V5
100% (2)
PRODUCT OVERVIEW - Forcepoint DLP - V5
35 pages
Json Relational Duality Developers Guide
No ratings yet
Json Relational Duality Developers Guide
97 pages
Technical Interview Topics For Campus Placement
No ratings yet
Technical Interview Topics For Campus Placement
1 page
GSM Optimization Handbook
No ratings yet
GSM Optimization Handbook
32 pages
ZaZaRemote Setup & Troubleshooting Guide
No ratings yet
ZaZaRemote Setup & Troubleshooting Guide
2 pages
CP7019-Managing Big Data-Anna University - Question Paper
75% (4)
CP7019-Managing Big Data-Anna University - Question Paper
4 pages
1 Ijetst PDF
No ratings yet
1 Ijetst PDF
9 pages
The State of DePIN
No ratings yet
The State of DePIN
38 pages
Aspiring Software Test Engineer
No ratings yet
Aspiring Software Test Engineer
2 pages
Online Fire Reporting System-1
No ratings yet
Online Fire Reporting System-1
3 pages
Compliance Dashboard v0.6
No ratings yet
Compliance Dashboard v0.6
449 pages
DAOstack White Paper V1.0
No ratings yet
DAOstack White Paper V1.0
33 pages
Xi3 RM Sap HR Ug en
No ratings yet
Xi3 RM Sap HR Ug en
694 pages
NIST - Towards A Reference Architecture For BIG DATA
No ratings yet
NIST - Towards A Reference Architecture For BIG DATA
36 pages
FALCON Towers Management System
No ratings yet
FALCON Towers Management System
33 pages
DN4.0 Deepskilling Handbook Java FSE
No ratings yet
DN4.0 Deepskilling Handbook Java FSE
33 pages
CA4 KQSProg Guide Rev G
No ratings yet
CA4 KQSProg Guide Rev G
29 pages

NOSQL

Uploaded by

NOSQL

Uploaded by

NOSQL and Query Optimization

NoSQL databases are generally classified into four main categories:

1. Document databases: These databases store data as semi-structured

1. Query rewrite based on heuristics, cost or both.

The document-based database is a nonrelational database. Instead of storing the

A key-value store is a nonrelational database. The simplest form of a NoSQL

Column Oriented Databases:

A column-oriented database is a non-relational database that stores the data in

Graph-based databases focus on the relationship between the elements. It stores

db.NAME_OF_COLLECTION.dropIndexes({KEY1:1, KEY2, 1})

ordering data sets in NoSQL databases.

Cloud-based NoSQL databases offer several advantages over traditional on-

1. Scalability: Cloud-based NoSQL databases can easily scale horizontally

Some popular NoSQL Cloud Database Services include:

1. Amazon DynamoDB: A fully managed NoSQL database service offered

You might also like