KEMBAR78
DMS Chapter 5 | PDF | Database Transaction | Databases
0% found this document useful (0 votes)
77 views21 pages

DMS Chapter 5

Uploaded by

juimakhare
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
77 views21 pages

DMS Chapter 5

Uploaded by

juimakhare
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 21

Database Management System

313302
Unit - V Database Administration
5.1 Introduction to database administration: - Types of database users, Create and
delete users, Assign privileges to users
5.2 Transaction: Concept, Properties & States of Transaction
5.3 Database Backup: Types of Failures, Causes of Failure, Database backup
introduction, types of database backups: Physical & Logical
5.4 Data Recovery – Recovery concepts, recovery techniques- roll forward,
Rollback
5.5 Overview of Advanced database concepts: - Data Warehouse, Data lakes, Data
mining, Big data, Mongo DB, DynamoDB,
---------------------------
Introduction To Database Administration:
database users
Database users in DBMS can access the database and retrieve the data from the
database using applications and interfaces provided by the Database Management
System (DBMS).
Database users in DBMS can be categorized based on their interaction with the
databases. According to the tasks performed by the database users on the databases, we
can categorize them into four categories as follows:
 Naive Users
 Application Programmers
 Sophisticated Users
 Specialized Users
Naive Users
These naive users are those users who don’t have any database knowledge. They
depend on pre-developed applications like Bank Management Systems, Library
Management Systems, Hospital Management Systems, and Railway Ticket Booking
Systems (IRCTC) and get the desired result.
Application Programmers
These programmers write the code for an application program that uses the
database. The application programmer can make the application according to user
requirements and control software that runs an entire computer system. The
application program is written in any programming language like C#, .net, JAVA, etc., and
focuses on business, engineering, and science program.
Sophisticated Users
Sophisticated users are the type of database users in DBMS who know DBMS
(DDL & DML commands) and are familiar with the database. Sophisticated users can be
business analysts, engineers, scientists, system analysts, etc.
Sophisticated users can develop or access their database applications according to
the requirements, without actually writing the program code for it. These users are also
known as SQL programmers as they can interact with the database directly using SQL
queries using query processors. Using SQL queries, they can fetch the data from the
database. They can also delete, update or insert new data into the database.
Specialized Users
The special users are responsible for writing specialized database-related
programs and also have the task of creating the actual database as well as implementing
technical controls needed to enforce policies and decisions.
Database Administrator (DBA)
The database administrator is responsible for the overall planning of the
company's data resources, for the design of data and for the day-to-day operational
aspects of data management.
A database administrator is a person responsible for the installation,
con5iguration, upgradation, maintenance and monitoring databases in an organization.
Responsibilities of DBA in enterprises:
 Designing schema.
 Deciding on the storage and access methods.
 Selecting database software and hardware.
 Designing the means of reorganizing databases periodically.
 Designing database searching strategies.
 Designing authorization checks and validation procedures.
 Designing restart and recovery procedures to take care of system crashes.
 Specifying techniques for monitoring database performance.
The operations Database Administration management of database administration deals
with data problems arising on a day-to-day basis. Speci5ically, the responsibilities
include:
 Investigation of errors found in the data.
 Supervision of restart and recovery procedures in the event of a failure.
 Supervision of reorganization of databases.
 Initiation and control of all periodic dumps of data.
Skills required for DBA
 Good communication skills.
 Excellent knowledge of databases architecture and design and RDBMS (Oracle,
SQL Server etc.)
 Knowledge of Structured Query Language (SQL).

Creating a user
Syntax: CREATE USER username IDENTIFIED BY Password;

Once connected as SYSTEM, simply issue the CREATE USER command to generate a new
account.
CREATE USER books_admin IDENTIFIED BY MyPassword;
With our new books_admin account created, we can now begin adding privileges to the
account using the GRANT statement.
you’ll want to ensure the user has privileges to actually connect to the database and
create a session using GRANT CREATE SESSION.
GRANT CREATE SESSION, CONNECT, RESOURCE, TO books_admin;
System Privilege
System privileges are rights and restriction that are implemented on databases to
control which controls which users can access how much data in the database.
Object privileges
Object privileges are rights and restriction to change contents of database objects.

Grant: This command is basically used for giving access or permission to the users for
accessing the database.
Syntax :
GRANT privilege_name
ON object_name
TO {user_name | role_name | PUBLIC}
[WITH GRANT OPTION];

Example:
GRANT SELECT, INSERT
ON employees
TO user1;
REVOKE : This command is to take away or remove or REVOKE privileges/permission
from users.
Syntax:
REVOKE privilege_name
ON object_name
FROM {user_name | role_name | PUBLIC}
[RESTRICT / CASCADE];

Example:
REVOKE SELECT, INSERT
ON employees
FROM user1;.

The "PUBLIC" keyword is used to GRANT or REVOKE privileges for all users in the
database.
RESTRICT: Will revoke privileges of only current grant privileges, not dependent grant
privileges.
CASCADE: Will revoke privileges along with all dependent grant privileges.

Transaction: Concept, Properties & States of Transaction


Transaction:
Simple Query 5ired on DBMS is called SQL Operation.
Collection of multiple operations of one or more SQL statements that combined together
to form a single logical unit of work.

Types of Operations
Read Operation:
Read operation is used to read the value of X from the database and stores it in a buffer
in main memory.
Write Operation:
Write operation is used to write the value back to the database from the buffer.
Transaction Structure and Boundaries
Transaction Structure
The Transaction consists of all SQL operations executed between the begin and end
transaction.
Transaction Boundaries
Transaction must ends either by executing a COMMIT or ROLLBACK command.
Fundamental Properties of Transaction / ACID properties
ACID is an acronym that stands for atomicity, consistency, isolation, and durability
(ACID).
Atomicity
It states that all operations of the transaction take place at once if not, the transaction is
aborted.
There is no midway, i.e., the transaction cannot occur partially. Each transaction is
treated as one unit and either run to completion or is not executed at all.
It means that either all or none of its operations need to be executed.
Consistency
This means that integrity constraints must be maintained so that the database is
consistent before and after the transaction. It refers to the correctness of a database.
Consistency means transaction will never leave your database in a half 5inished
(inconsistent) state.
Isolation
Isolation property ensures that each transaction must remain unaware of other
concurrently executing transactions.
Isolation property keeps multiple transactions separated from each other until they are
completed.
Durability
The results of a transaction that has been successfully committed to the database will
remain unchanged even after database fails.
The durability property guarantees that all the changes made by transaction on the
database will be available permanently, although there is any type of failure after the
transaction completes execution.
Transaction States
Each and every transactions in DBMS, pass through several states during their lifespan.
These states, known as "Transaction States," collectively constitute the "Transaction
Lifecycle."

Active: This is the initial state of every transaction. In this state, the transaction is
being executed. The transaction remains in this state as long as it is executing SQL
statements.
Partially Committed: When a transaction executes its 5inal statement, it is said to be
in a 'partially committed' state. At this point, the transaction has passed the
modi5ication phase but has not yet been committed to the database. If a failure
occurs at this stage, the transaction will roll back.
Failed: If a transaction is in a 'partially committed' state and a problem occurs that
prevents the transaction from committing, it is said to be in a 'failed' state. When a
transaction is in a 'failed' state, it will trigger a rollback operation.
Aborted: If a transaction is rolled back and the database is restored to its state
before the transaction began, the transaction is said to be 'aborted.' After a
transaction is aborted, it can be restarted again, but this depends on the policy of the
transaction management component of the DBMS.
Committed: When a transaction is in a 'partially committed' state and the commit
operation is successful, it is said to be 'committed.' At this point, the transaction has
completed its execution and all of its updates are permanently stored in the
database.
Database System Failure
Like any other real world database systems also suffer from failure a variety of causes :
disk crash, power outage, software error , a 5ire in the machine room even rain . The
result of any failure is loss of information.
Type and Causes of Failure
1. Hardware Failure / System crash
There is a hardware malfunction that causes the loss of the content of volatile storage,
and brings transaction processing to a halt. The content of non-volatile storage remains
intact, and is not corrupted or changed.
2. Software Failure
The database software or the operating system may be corrupted or failed to work
correctly, that may causes the loss of the content of volatile storage, and brings about
database failure.
3. Media failure
A disk block loses its content as a result of either a head crash or failure during a data-
transfer operation. Copies of the data on other disks such as tapes, CDs are used to
recover from the failure.
4. Network Failure
The problem with network interface card can cause network failure. There may be
problem with network connection.
5. Transaction failure
There are two types of errors that may cause a transaction to fail;
(a) Logical error: The transaction can no longer continue with its normal execution
because of some internal condition, such as wrong input values, data not found in
database, data over5low, or resource limit exceeded etc.
(b) System error: The system has entered an undesirable state like deadlock as a result,
transaction cannot continue with its normal execution.
6. Application software error
The problem with software accessing the data from database. This may cause database
failure as data cannot be updated using such application to it.
7. Physical disasters
The problem caused due to 5lood, 5ire, earthquake etc.
Database Backup
Data Backup do mean by storing all our 5iles from our computer in another location. In
this way, if there is ever any loss of data on your primary machine, you still have copy of
same data with us on another machine.
Data Backup is easy to do and can save you great amounts of time as well as ensure that
your data is secure in the case of disaster.
Database Backup Techniques
A database backup is a copy of storage that is stored on a server. Backup is used to
prevent unexpected data loss. If original data gets lost, then with the help of a backup, it
is easy to gain access to the data again.
Full Backup -Physical Backup:
Full database backup is maintained at one recovery site as backup copy of that site.
A full and incremental database back up option targets to make it more feasible to store
several copies of the source data.
At 5irst step a full backup (of all 5iles) is made. After that, any number of updates can be
applied to that as incremental backup.
Differential Backup
A differential backup copies 5iles that have been created or changed since the last full
backup.
Only last modi5ication done in data will be saved as a backup.
It does not make back up of 5iles already backed up on system.
If you are performing a combination of full and differential backups, restoring 5iles and
folders requires that you have the last full as well as the last differential backup.
Partial Backup (Log based Backup or Logical Backup)
We will save log 5ile for all transaction done on database as a backup.
----------------------------------------
Database Recovery Concepts
Database recovery refers to the process of restoring a database to original (Correct)
state as it was before database failure occurs.
Types of Database Recovery / Recovery Phases
1. Forward Database Recovery / Roll Forward (REDO Phase)
2. Backward Database Recovery / Roll Backward (UNDO Phase)
Database Recovery Techniques
a) Log Based Recovery
b) Shadow Paging Recovery
c) Checkpoints
---------------------------------
Log Based Recovery
Log-based recovery is a data recovery technique used in database management systems
(DBMS) that relies on logs to restore a database to a consistent state after a failure. This
method records changes made to the database in a log 5ile, which can be used to redo or
undo operations to recover data.
The log is a sequence of log records, recording all the updated activities in the database.
In stable storage, logs for each transaction are maintained. Any operation which is
performed on the database is recorded on the log. Prior to performing any modi5ication
to the database, an updated log record is created to re5lect that modi5ication. An update
log record represented as: <Ti, Xj, V1, V2> has these 5ields:
1. Transaction identi=ier: Unique Identi5ier of the transaction that performed the
write operation.
2. Data item: Unique identi5ier of the data item written.
3. Old value: Value of data item prior to write.
4. New value: Value of data item after write operation.
Other types of log records are:
1. <Ti start> : It contains information about when a transaction Ti starts.
2. <Ti commit> : It contains information about when a transaction Ti commits.
3. <Ti abort> : It contains information about when a transaction Ti aborts.
Undo and Redo Operations
Because all database modi5ications must be preceded by the creation of a log record, the
system has available both the old value prior to the modi5ication of the data item and
new value that is to be written for data item. This allows system to perform redo and
undo operations as appropriate:
1. Undo: using a log record sets the data item speci5ied in log record to old value.
2. Redo: using a log record sets the data item speci5ied in log record to new value.
The database can be modi=ied using two approaches –
1. Deferred Modi=ication Technique: If the transaction does not modify the
database until it has partially committed, it is said to use deferred modi5ication
technique.
2. Immediate Modi=ication Technique: If database modi5ication occur while the
transaction is still active, it is said to use immediate modi5ication technique.
Recovery using Log records
When the system is crashed, then the system consults the log to 5ind which transactions
need to be undone and which need to be redone.
1. If the log contains the record <Ti, Start> and <Ti, Commit> or <Ti, Commit>, then
the Transaction Ti needs to be redone.
2. If log contains record<Tn, Start> but does not contain the record either <Ti,
commit> or <Ti, abort>, then the Transaction Ti needs to be undone.
------------------------------
Checkpoint – Recovery Related Structures
The Checkpoint is used to declare a point before which the DBMS was in a consistent
state, and all transactions were committed. During transaction execution, such
checkpoints are traced. After execution, transaction log 5iles will be created. Upon
reaching the savepoint/checkpoint, the log 5ile is destroyed by saving its update to the
database. Then a new log is created with upcoming execution operations of the
transaction and it will be updated until the next checkpoint and the process continues.
Why do We Need Checkpoints?
Whenever transaction logs are created in a real-time environment, it eats up lots of
storage space. Also keeping track of every update and its maintenance may increase the
physical space of the system. Eventually, the transaction log 5ile may not be handled as
the size keeps growing. This can be addressed with checkpoints. The methodology
utilized for removing all previous transaction logs and storing them in permanent
storage is called a Checkpoint.
Steps to Use Checkpoints in the Database
1. Write the begin_checkpoint record into a log.
2. Collect checkpoint data in stable storage.
3. Write the end_checkpoint record into a log.
The behavior when the system crashes and recovers when concurrent transactions are
executed is shown below:
Understanding Checkpoints in multiple Transactions
Transactions and operations of the above diagram:

Transaction 1 Transaction 2 Transaction 3 Transaction 4


(T1) (T2) (T3) (T4)

START

START

COMMIT

START

COMMIT

START

FAILURE

For the immediate modi5ication technique, the recovery operations are:


 The recovery system reads the logs backward from the end to the last checkpoint
i.e. from T4 to T1.
 It will keep track of two lists – Undo and Redo.
 Whenever there is a log with instructions <Tn, start>and <Tn, commit> or only
<Tn, commit> then it will put that transaction in Redo List. T2 and T3 contain
<Tn, Start> and <Tn, Commit> whereas Here, T2, and T3 are in the redo list.
 Whenever a log record with no instruction of commit or abort is found, that
transaction is put to Undo List Here, T4 has <Tn, Start> but no <Tn, commit> as it
is an ongoing transaction. T1 will have only <Tn, Start>. T1 and T4 will be put on
the undo list.
All the transactions in the redo list are deleted with their previous logs and then redone
before saving their logs. All the transactions in the undo list are undone and their logs
are deleted.
Obviously, the undo operation does not need to be applied when the deferred
modi5ication technique is being employed.
Advantages of Checkpoints
 A database checkpoint keeps track of change information and enables
incremental database backup.
 A database storage checkpoint can be mounted, allowing regular 5ile system
operations to be performed.
 Database checkpoints can be used for application solutions which include
backup, recovery or database modi5ications.
Disadvantages of Checkpoints
1. Database storage checkpoints can only be used to restore from logical errors (E.g. a
human error).
2. Because all the data blocks are on the same physical device, database storage
checkpoints cannot be used to restore 5iles due to a media failure.
------------------------------------
Shadow Paging
It is not always convenient to maintain logs of all transaction for the purpose of
recovery.
An alternative is to use a system of shadow paging.
This is where the database is divided into pages that may be stored in any order on the
disk.
In order to identify the location of any given page, we use something called a page table.
During the life of a transaction two-page tables are maintained as below,
 Shadow page table
 Current page table.
When a transaction begins both of these page tables point to the same locations
(are identical).
During the lifetime of a transaction the shadow page table doesn't change at all.
However, during the lifetime of a transaction update values etc. may be changed.
For pages updated by the transaction, two versions are kept. The old version is
referenced by the shadow directory and the new version by the current directory.
So, whenever we update a page in the database we always write the updated page to a
new location.
This means that when we update our current page table it re5lect the changes that have
been made by that transaction.

Page Table Shadow Paging


As we can see the shadow page table shows the state of the database just prior to a
transaction, and the current page table shows the state of the database during or after a
transaction has been completed.
Process of recovery
We now have a system whereby if we ever want to undo the actions of a transaction all
we have to do is recover the shadow page table to be the current page table.
As such this method makes the shadow page table particularly important, and so it must
always be stored on stable storage. On disk we store a single pointer location that points
to the address of the shadow page table. This means that to swap the shadow table for
the current page table (committing the data) we just need to update this single pointer
(very unlikely to fail during this very short fast operation). In case of no failure means
while committing transaction just discard the shadow directory. In case of multi-user
environment with concurrent transaction, logs and checkpoints must be incorporated in
shadow paging.
Advantages:
Shadow page method does not require any Undo or Redo algorithm for recovery
purpose.
Recovery using this method will be faster.
No overhead for writing log records.
Disadvantages:
Data fragmentation: The main disadvantage of this technique is the updated Data will
suffer from fragmentation as the data is divided up into pages that may or not be in
linear order for large sets of related hence, complex storage management strategies.
Garbage collection: Garbage will accumulate in the pages on the disk as data is
updated and pages lose any references. For example if i have a page that contains a data
item X that is replaced with a new value then a new page will be created. Once the
shadow page table is updated nothing will reference the old value of X. The operation to
migrate between current and shadow directories must be implemented as an atomic
mode.
Commit Overhead:
If the directory size is large, the overhead of writing shadow directories to disk as
transaction commit is signi5icant.
-----------------------------------------
Overview of Advanced database concepts: - Data Warehouse, Data lakes, Data
mining, Big data, Mongo DB, DynamoDB
Data Warehouse
The term Data Warehouse was coined by Bill Inmon in 1990, which he de5ined in the
following way: "A warehouse is a subject-oriented, integrated, time-variant and non-
volatile collection of data in support of management's decision-making process".
Characteristics / Features of Data Warehouse
A common way of introducing data warehousing is refer to the Characteristics of Data
Warehouse .
i. subject-oriented
ii. integrated
iii. Non-Volatile
iv. time-variant
Subject-Oriented
Data warehouses are designed to help analyse data. For example, to learn more about
for e.g., banking data, a warehouse can be built that concentrates on transactions, loans,
etc.
This warehouse can be used to answer questions like "Which customer has taken
maximum loan amount for last year?"
This ability to de5ine a data warehouse by subject matter, loan in this case, makes the
data warehouse subject oriented.
Integrated
Data is integrated from various heterogeneous data sources to construct a data
warehouse.
The heterogeneous data sources comprise relational databases, 5lat 5iles, external data
5iles, on-line transaction records etc.
After data collection, data is preprocessed and cleaned before applying data integration
techniques.
Non-Volatile
Data Warehouse is integrated Non-volatile means that, once data entered into the
warehouse, it cannot be removed or changed because the purpose of a warehouse is to
analyze the data.
Time Variant
Data warehouse stores all historical data of organization. So for analysis purpose,
analyst can retrieve old data for future predication or to analyze trends or patterns.
But transactional system stores only recent data, so to retrieve old data is not possible.
For example, transactional system hold most recent job or address of employee, where
as warehouse stores all previous jobs or addresses of employee.
Data Warehouses and Data Marts
Data mart is subset of data warehouse of major data subject.
Data mart is created for speci5ic purpose as per business needs.
It is the repository of business data for particular type of data consumers like sales or
operations for the department level.
A data mart is typically established as one dimensional model or star schema which is
composed of a fact table and multi-dimensional table.
Data Warehouse Data Mart
Data Warehouse is enterprise wide and Data mart is department wise
has large storage capacity.

It is used for analysis and application It is application dependent.


independent.
It stores historical and summarised data It stores the data for summarised data of
of enterprise particular major subject
of data warehouse.

It is highly 5lexible. It is restrictive.

Implementation takes months to year. Implementation is done usually in


months

Generally size is from 100 GB to 1 TB. Generally size is less than 100 GB.

----------------------
Data Lake
Data Lake is a Modern Approach to Data Storage. A data lake is like a vast, digital
reservoir where organizations can store enormous amounts of raw data in its original
format. Imagine a huge pool where you can pour in all types of information from text
and images to videos and sensor readings - without having to organize it 5irst.
Key features of data lakes:
Flexibility: Data lakes can hold structured data (like spreadsheets), semi-structured
data (like JSON 5iles), and unstructured data (like social media posts) all in one place.
Scalability: They can grow as needed, allowing businesses to store virtually unlimited
amounts of data.
Cost-effective: Storing data in its raw form is often cheaper than traditional databases.
Data exploration: Scientists and analysts can dive into this "lake" of information to
discover new insights and patterns.
Big Data Compatible: Data Lakes work well with technologies designed to process
huge amounts of information quickly.
----------------------
Data Mining
Data mining is the process of extracting knowledge or insights from large amounts of
data using various statistical and computational techniques. The data can be structured,
semi-structured or unstructured, and can be stored in various forms such as databases,
data warehouses, and data lakes.
The primary goal of data mining is to discover hidden patterns and relationships in the
data that can be used to make informed decisions or predictions. This involves exploring
the data using various techniques such as clustering, classi5ication, regression analysis,
association rule mining, and anomaly detection.
Data mining has a wide range of applications across various industries, including
marketing, 5inance, healthcare, and telecommunications. For example, in marketing,
data mining can be used to identify customer segments and target marketing
campaigns, while in healthcare, it can be used to identify risk factors for diseases and
develop personalized treatment plans.
Data mining is also called Knowledge Discovery in Database (KDD).
The knowledge discovery process includes Data cleaning, Data integration, Data
selection, Data transformation, Data mining, Pattern evaluation, and Knowledge
presentation.
Data Mining Knowledge Discovery in Database
(KDD).
KDD is Data Mining are a 5ield of Data Data Mining is one of the step in KDD
Mining is one of computer science, which process , it applies the appropriate
the step in the KDD helps humans in algorithm based on the goal of the KDD
extracting useful, previously process for identifying patterns from data
undiscovered knowledge from data. It
makes use of tools and theories for the
same.

----------------------------
Big Data
Data which are very large in size is called Big Data. Normally we work on data of size
MB(WordDoc ,Excel) or maximum GB(Movies, Codes) but data in Peta bytes i.e. 10^15
byte size is called Big Data. It is stated that almost 90% of today's data has been
generated in the past 3 years.
3V's of Big Data
1. Velocity: The data is increasing at a very fast rate. It is estimated that the volume
of data will double in every 2 years.
2. Variety: Now a days data are not stored in rows and column. Data is structured
as well as unstructured. Log 5ile, CCTV footage is unstructured data. Data which
can be saved in tables are structured data like the transaction data of the bank.
3. Volume: The amount of data which we deal with is of very large size of Peta
bytes.
-----------------------------
Dynamo DB
Amazon's NOSQL database called DynamoDB in form of table is a collection of various
items and each item is a collection of attributes. In a relational database, a table has a
5ixed schema of tables with primary key and list of its columns with data types.
All tuples are of same schema.
DynamoDB Data model contains,
 Table
 Items
 Attributes
DynamoDB requires only a primary key and does not require to de5ine all of the
attribute names and data types in advance.
Each attribute of DynamoDB in an item is a name-value pair.
Primary Key
In order to create table we must specify primary key column name which identi5ies item
set in table uniquely.
Partition Key
A simple primary key has only one attribute known as the partition key.
DynamoDB uses the partition key as input to hash function for internal use.
Partition Key and Sort Key
A composite primary key is made of two attributes.
The 5irst attribute is the partition key, and the second attribute is the sort key.
All items with the similar partition key are stored together, sorting is done in sorted
order using sort key value.
Example
Consider a class of Students with ID column as primary key ,
StudentClass{ID,…}
{
Id=1011
SubjectName=”DMS”
ISBN=”111-11111101”
Authors=[“Author1”, ”Author2”]
Price=150
PageCount=500
}
{
Id=1015
SubjectName=”CGR”
ISBN=”101-110011101”
Authors=[“Author3”, ”Author4”]
Price=1543
PageCount=700
}
-----------------------------
MongoDB
1. NoSQL
NoSQL means "Not only SQL", NoSQL databases can use SQL-like query concepts.
NoSQL includes all databases that is not a traditional relational database management
system (RDBMS).
NoSQL databases are more specialized for various types of data types.
It is more ef5icient and better performing than RDBMS servers in most instances.
2. MongoDB
MongoDB is an open-source document type database.
MongoDB is implemented in C++.
High performance, high availability and automatic scaling are the important features.
MongoDB has its own ad-hoc query language with rich features set.
3. MongoDB Features
High performance as compared to traditional SQL.
Good support for embedded data models.
Faster query processing using index support from embedded documents and arrays.
Higher availability provided by MongoDB's replication facility.
Automatic failover using multiple replicas set by MongoDB a server which maintains the
same data set by providing redundancy and increasing data availability.
MongoDB provides horizontal scalability.
It also provides automatic shading techniques and distributes data across a cluster of
machines.
4. Collection and Document
Collection is equivalent to a table in RDBMS. A collection is a group of documents which
exists within a single database. Collection is schema less.
A document is a set of key-value pairs. Documents have dynamic schema. RDBMS
terminology with MongoDB terminology.
RDBMS MongoDB
Database Database
Table Collection
Tuple/Row Document
Column Field
Table Join Embedded Documents
Primary Key Prinary Key (Default key id provided by mongodb itself)
5. Document Database
In Mongo DB document is used to store data.
Each document consists of 5ields and values.
Mongo DB documents and JSON objects are similar to each other.
Other documents, arrays etc. can be included in the 5ield values.
6. Sample Mongo DB Document
A Mongo DB document is as follows,
{
movie "Hanuman",
Ticket: 100,
Screen : 02
}
7. CRUD operation in MongoDB
Create or insert a new document
db.collection.insert()
db.collection.insertOne()
db.collection.insertMany()
Read
db.collection.5ind()
Update
db.collection.update()
Delete
Read
db.collection.remove()

-----------------------------

You might also like