0% found this document useful (0 votes)

22 views12 pages

Unit 4 Distributed DBMS by ANS

Uploaded by

Peter Parker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views12 pages

Unit 4 Distributed DBMS by ANS

Uploaded by

Peter Parker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Unit 4 -Distributed Database Management System By ABNS

A distributed database management system (DDBMS) is a centralized software system that manages a
distributed database in a manner as if it were all stored in a single location.
It is used to create, retrieve, update and delete distributed databases.
Features
• It is used to create, retrieve, update and delete distributed databases.
• It ensures that the data modified at any site is universally updated.
• It is used in application areas where large volumes of data are processed
• It is designed for heterogeneous database platforms.
Factors Encouraging DDBMS
• Distributed Nature of Organizational Units −
• Need for Sharing of Data − The multiple organizational units often need to
communicate with each other and share their data and resources.
• Support for Both OLTP and OLAP − Online Transaction Processing (OLTP) and
Online Analytical Processing (OLAP) Distributed database systems aid both these processing by
providing synchronized data.
• Database Recovery :-Replication of data automatically helps in data recovery
if database in any site is damaged.
• Support for Multiple Application Software
Applications of Distributed Database:
• It is used in Corporate Management Information System.
• It is used in multimedia applications.
• Used in Military’s control system, Hotel chains etc.
• It is also used in manufacturing control system.
Advantages of Distributed Database System :
1) There is fast data processing as several sites participate in request processing.
2) It possess reduced operating cost.
3) It is easier to expand the system by adding more sites.
Disadvantages of Distributed Database System :
1) The system becomes complex to manage and control.
2) The security issues must be carefully managed.
3) There is need of some standardization for processing of distributed database
system.

1
2
Types of Distributed Databases
Distributed databases can be broadly classified into homogeneous and heterogeneous distributed database
environments

Homogeneous Distributed Databases

In a homogeneous distributed database, all the sites use identical DBMS and operating
systems. Its properties are −
• The sites use very similar software.
• The sites use identical DBMS or DBMS from the same vendor.
• The database is accessed through a single interface as if it is a single database.
There are two types of homogeneous distributed database −
Autonomous
Non-autonomous
Heterogeneous Distributed Databases
In a heterogeneous distributed database, different sites have different operating systems,
DBMS products and data models. Its properties are −

• Different sites use dissimilar schemas and software.

• Query processing is complex due to dissimilar schemas.
• Transaction processing is complex due to dissimilar software.
Types of Heterogeneous Distributed Databases
Federated
Un-federated

Distributed DBMS Architectures

DDBMS architectures are generally developed depending on three parameters −
• Distribution − It states the physical distribution of data across the different sites.
• Autonomy − It indicates the distribution of control of the database system and the
degree to which each constituent DBMS can operate independently.
• Heterogeneity − It refers to the uniformity or dissimilarity of the data models, system
components and databases.

3
Architectural Models
• Client - Server Architecture for DDBMS
• Peer - to - Peer Architecture for DDBMS
• Multi - DBMS Architecture

Client-server architecture:
This is a two-level architecture where the functionality is divided into servers and clients.
Server does most of the data management work
– query processing
– data management
– Optimization
– Transaction management etc
Client performs
– Application
– User interface
– DBMS Client model
The two different client - server architecture are −
Single Server Multiple Client
Multiple Server Multiple Client

4
Peer- to-Peer Architecture for DDBMS
In these systems, each peer acts both as a client and a server for imparting database services.The peers
share their resource with other peers and co-ordinate their activities
This architecture generally has four levels of schemas: –
Individual internal schema definition at each site, local internal schema
Enterprise view of data is described the global conceptual schema.
Local organization of data at each site is describe in the local conceptual schema.
User applications and user access to the database is supported by external schemas
Major Components of a Peer-to-Peer System
(i)User Processor
(ii) Data processor
User Processor
• User-interface handler
• checks if the user query can be processed.
• Translates global queries into local one.
Data processor
• Local query optimizer
• Responsible for choosing the the best access path
• Local Recovery Manager

5
Multi - DBMS Architectures
This is an integrated database system formed by a collection of two or more autonomous
database systems.
Multi-DBMS can be expressed through six levels of schemas −

• Multi-database View Level − Depicts multiple user views comprising of subsets of

the integrated distributed database.

• Multi-database Conceptual Level − Depicts integrated multi-database that comprises

of global logical multi-database structure definitions.

• Multi-database Internal Level − Depicts the data distribution across different sites
• Local database View Level − Depicts public view of local data.
• Local database Conceptual Level − Depicts local data organization at each site.
• Local database Internal Level − Depicts physical data organization at each site.
There are two design alternatives for multi-DBMS −
Model with multi-database conceptual level.

6
Fragmentation
Fragmentation is the task of dividing a table into a set of smaller tables.
The subsets of the table are called fragments.
Fragmentation can be of three types:
horizontal, vertical, and hybrid (combination of horizontal and vertical). Horizontal fragmentation can
further beclassified into two techniques: primary horizontal fragmentation and derived horizontal
fragmentation. Fragmentation increases parallelism and provides better disaster recovery. Here, there is
only one copy of each fragment in the system, i.e. no redundant data.
Advantages :
As the data is stored close to the usage site, the efficiency of the database system will increase.
Permits a number of transactions to executed concurrently
3. Increases level of concurrency
Disadvantages :
Access speeds may be very high if data from different fragments are needed
Lack of back-up copies of data in different sites may render the database ineffective in
case of failure of a site.
The three fragmentation techniques are −
• Vertical fragmentation
• Horizontal fragmentation
• Hybrid fragmentation
Horizontal Fragmentation:
In Horizontal Fragmentation, the relational table or schema is broken down into a group of one and more
rows, and each row gets one fragment of the schema. It is also called splitting by rows.
Vertical Fragmentation
In vertical fragmentation, the fields or columns of a table are grouped into fragments. each fragment
should contain the primary key field(s) of the table. Vertical fragmentation can be used to enforce privacy
of data.
Hybrid fragementation
The combination of vertical fragmentation of a table followed by further horizontal fragmentation of some
fragments is called mixed or hybrid fragmentation.
For defining this type of fragmentation we use the SELECT and the PROJECT operations of relational
algebra.

7
Replication
Data replication is the process of storing separate copies of the database at two or more sites.
It is a popular fault tolerance technique of distributed databases
Advantages of Data Replication
• Reliability − In case of failure of any site, the database system continues to work since a copy is available
at another site(s).
• Quicker Response − Availability of local copies of data ensures quick query processing and consequently
quick response time.
Disadvantages of Data Replication
• Increased Storage Requirements − Maintaining multiple copies of data is associated with increased
storage costs.
• Increased Cost and Complexity of Data Updating − Each time a data item is updated, the update needs to
be reflected in all the copies of the data at the different sites. This requires complex synchronization
techniques and protocols.
Some commonly used replication techniques are
Snapshot replication
Near-real-time replication
Pull replication

Aspect Replication Fragmentation

Creating multiple copies of the same data Dividing a database into smaller,
Definition
across different nodes. manageable pieces (fragments).

Improves performance and scalability by

Purpose Enhances data availability and reliability.
distributing data.

Data Ensures all copies are synchronized (eventual Each fragment can be independently
Consistency consistency). updated; consistency may vary.

Storage May lead to increased storage requirements Can optimize storage by only storing
Efficiency due to multiple copies. necessary fragments.

Performance Can improve read performance but may slow Can enhance performance by reducing
Impact down writes due to synchronization. the amount of data scanned.

More complex due to synchronization Complexity arises from managing

Complexity
mechanisms. fragments and their distribution.

8
Recovery technique for distributed database
Backup and Restoration
The most basic recovery technique is the regular backup of the database. DBAs can schedule backups to
run periodically, storing snapshots of the database. In the event of a failure, the latest backup is restored .

Recovery from Disk Failure

A disk failure or hard crash causes a total database loss. To recover from this hard crash, a new disk is
prepared, then the operating system is restored, and finally the database is recovered using the database
backup and transaction log.
Checkpointing
Checkpoint is a point of time at which a record is written onto the database from the buffers. in case of a
system crash, the recovery manager does not have to redo the transactions that have been committed
before checkpoint.
The two types of checkpointing techniques are −
• Consistent checkpointing
• Fuzzy checkpointing
Database Replication
Database replication involves copying and maintaining the same set of data across multiple databases. This
ensures redundancy, improves availability, and can enhance performance by distributing workload.
Point-in-Time Recovery
Point-in-Time Recovery (PITR) allows restoring a database to a specific moment in time, rather than just the
latest backup. It involves using transaction logs to roll forward or backward to the desired timestamp,
Transaction Recovery Using UNDO / REDO
UNDO all faulty transactions and transactions that may be affected by the faulty transactions.
REDO all transactions that are not faulty but have been undone due to the faulty transactions.

CONCURRENCY CONTROL IN DISTRIBUTED DBMS

Concurrency controlling techniques ensure that multiple transactions are executed simultaneously while
maintaining the ACID properties of the transactions and serializability in the schedules.
Concurrency Control Mechanisms
Distributed Two-phase Locking Algorithm
The basic principle of distributed two-phase locking is same as the basic two-phase locking protocol.
However, in a distributed system there are sites designated as lock managers. A lock manager controls lock
acquisition requests from transaction monitors.
two-phase locking approaches can be of three types : Distributed two-phase locking , Primary copy two-
phase locking, Centralized two-phase locking

9
Distributed Timestamp Concurrency Control
in a distributed system, any site’s local physical/logical clock readings cannot be used as global timestamps,
since they are not globally unique.
So, a timestamp comprises of a combination of site ID and that site’s clock reading.
For implementing timestamp ordering algorithms, each site has a schedule, The scheduler puts the request
to the corresponding queue in increasing timestamp order.
Conflict Graphs
A conflict graph is created for the classes to which active transactions belong. This contains a set of vertical,
horizontal, and diagonal edges.
The conflict graphs are analyzed to ascertain whether two transactions within the same class or across two
different classes can be run in parallel.
Distributed Optimistic Concurrency Control Algorithm
Distributed optimistic concurrency control algorithm extends optimistic concurrency control algorithm. For
this extension, two rules are applied −
Rule 1 − According to this rule, a transaction must be validated locally at all sites when it executes.
Rule 2 − According to this rule, after a transaction passes local validation test, it should be globally
validated. Commit Protocols
in a distributed system, the transaction manager should convey the decision to commit to all the servers in the
various sites where the transaction is being executed.

When processing is complete at each site, it reaches the partially committed transaction state and waits for all
other transactions to reach their partially committed states.

When it receives the message that all the sites are ready to commit, it starts to commit. In a distributed system,
either all sites commit or none of them does.The different distributed commit protocols are −

• One-phase commit
• Two-phase commit
• Three-phase commit
One-Phase Commit
It is the simplest commit protocol. In this commit protocol, there is a controlling site, and there are a variety of
slave sites where the transaction is performed.

10
Two-Phase Commit

It is the second type of commit protocol in DBMS. It was introduced to reduce the vulnerabilities of the one phase
commit protocol. There are two phases in the two-phase commit protocol.

Prepare Phase

Commit/Abort Phase

Three Phase Commit Protocol

It is the second type of commit protocol in DBMS. It was introduced to address the issue of blocking. In this commit
protocol, there are three phases: –

Prepare Phase, prepare to commit phase, commit/abort phase

Advantages of Commit Protocol in DBMS

• It basically also helps to ensure that the integrity of the data is maintained throughout the database.

• It will also helps to maintain the atomicity which means that either all the operations in a transaction are
completed successfully or not done at all.

• The commit protocol provide mechanisms for system recovery in the case of system failures.
11
Two Phase locking protocol
Every transaction will lock and unlock the data item in two different phases.

Growing Phase − All the locks are issued in this phase. No locks are released, after all changes to data-items are
committed and then the second phase (shrinking phase) starts.

Shrinking phase − No locks are issued in this phase, all the changes to data-items are noted (stored) and then locks
are released.

The following way shows how unlocking and locking work

with 2-PL.

Transaction T1:

o Growing phase: from step 1-3

o Shrinking phase: from step 5-7

o Lock point: at 3

Transaction T2:

o Growing phase: from step 2-6

o Shrinking phase: from step 8-9

o Lock point: at 6

Two phase locking is of two types –

Strict two phase locking protocol

A transaction can release a shared lock after the lock point, but it cannot release any exclusive lock until the
transaction commits.

Rigorous two phase locking protocol

A transaction cannot release any lock either shared or exclusive until it commits.

Tybca Recent Trends in It Chpter 1
No ratings yet
Tybca Recent Trends in It Chpter 1
16 pages
Advanced Data Base Management Systems
No ratings yet
Advanced Data Base Management Systems
35 pages
Distributed DB
No ratings yet
Distributed DB
16 pages
ADT Unit 1 To 5
No ratings yet
ADT Unit 1 To 5
160 pages
Team:DBMS: by Navdeep Kaur Assistant Professor Computer Science Department
No ratings yet
Team:DBMS: by Navdeep Kaur Assistant Professor Computer Science Department
19 pages
Ddis U1-3
No ratings yet
Ddis U1-3
40 pages
Distribution Database
No ratings yet
Distribution Database
52 pages
Distributed Database
100% (1)
Distributed Database
24 pages
Distributed Databases
No ratings yet
Distributed Databases
39 pages
Unit - 2 (1) DBMS
No ratings yet
Unit - 2 (1) DBMS
25 pages
Distributed Databases
No ratings yet
Distributed Databases
46 pages
Adt Unitnotes 1to3
No ratings yet
Adt Unitnotes 1to3
107 pages
10 Distributeddbms
No ratings yet
10 Distributeddbms
56 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
24 pages
ADBMS
No ratings yet
ADBMS
84 pages
Distributed Systems
No ratings yet
Distributed Systems
25 pages
Midterm Elective Database Notes
No ratings yet
Midterm Elective Database Notes
14 pages
Unit 2-DBP
No ratings yet
Unit 2-DBP
44 pages
CH 4
No ratings yet
CH 4
16 pages
Distributed DBMS Architecture
No ratings yet
Distributed DBMS Architecture
49 pages
Chapter 7 Distributed Database Systems
No ratings yet
Chapter 7 Distributed Database Systems
27 pages
Unit-Iii Distributed Database: System
No ratings yet
Unit-Iii Distributed Database: System
55 pages
DB Unit-2
No ratings yet
DB Unit-2
27 pages
Types of Distributed Data Base System - 49724
No ratings yet
Types of Distributed Data Base System - 49724
37 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
MC4202 - Adavanced Database Technology
No ratings yet
MC4202 - Adavanced Database Technology
159 pages
Distributed Database Vs Conventional Database
50% (2)
Distributed Database Vs Conventional Database
4 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
Distributed Database
No ratings yet
Distributed Database
9 pages
Chapter 5 - Distributed Databases Roobera
No ratings yet
Chapter 5 - Distributed Databases Roobera
58 pages
ADT Notes
No ratings yet
ADT Notes
36 pages
Advantages of Distributed Database
No ratings yet
Advantages of Distributed Database
6 pages
NoSQL & Distributed Databases Overview
No ratings yet
NoSQL & Distributed Databases Overview
124 pages
Distributed Databases
No ratings yet
Distributed Databases
27 pages
Adb CH 4
No ratings yet
Adb CH 4
14 pages
Chapter 4 - Distributed Database System
No ratings yet
Chapter 4 - Distributed Database System
52 pages
Chapter - 6 Distributed Database System
No ratings yet
Chapter - 6 Distributed Database System
50 pages
Types of Distributed Databases.: Homogeneous Distributed Databases System Heterogeneous Distributed Database System
No ratings yet
Types of Distributed Databases.: Homogeneous Distributed Databases System Heterogeneous Distributed Database System
22 pages
Distributed Database Systems Guide
0% (1)
Distributed Database Systems Guide
54 pages
Distributed Database System
No ratings yet
Distributed Database System
9 pages
Topic 7 DDBMS
No ratings yet
Topic 7 DDBMS
28 pages
Parallel & Distributed DBMS Guide
No ratings yet
Parallel & Distributed DBMS Guide
58 pages
Distributed
No ratings yet
Distributed
83 pages
Chapter 4 Distributed Database Systems
No ratings yet
Chapter 4 Distributed Database Systems
69 pages
Adt Unit I
No ratings yet
Adt Unit I
18 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
5 pages
Unit 1 DISTRIBUTED DATABASE
No ratings yet
Unit 1 DISTRIBUTED DATABASE
6 pages
Unit 4 DDBMS
No ratings yet
Unit 4 DDBMS
58 pages
Lecture 8 - Distributed Database Management Systems
No ratings yet
Lecture 8 - Distributed Database Management Systems
60 pages
Lecture 8 - Distributed Databases
No ratings yet
Lecture 8 - Distributed Databases
4 pages
Distributed Database System
No ratings yet
Distributed Database System
4 pages
ADS Chapter 7 Distributed Database
No ratings yet
ADS Chapter 7 Distributed Database
16 pages
Distributeddbms Er. Inderjeet Bal
No ratings yet
Distributeddbms Er. Inderjeet Bal
60 pages
Chapter 6 Distributed System Management
No ratings yet
Chapter 6 Distributed System Management
12 pages
Note On Parallel and Distributed Database
No ratings yet
Note On Parallel and Distributed Database
10 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
4th Sem Syllabus
No ratings yet
4th Sem Syllabus
14 pages
Unit 5 Concurrency Control
No ratings yet
Unit 5 Concurrency Control
34 pages
DBMS QB
No ratings yet
DBMS QB
16 pages
T2 - BCS401 - Operating System - DR Atul
No ratings yet
T2 - BCS401 - Operating System - DR Atul
3 pages
Transaction Management in DBMS
No ratings yet
Transaction Management in DBMS
47 pages
Introduction To Transaction Processing Concepts and Theory
No ratings yet
Introduction To Transaction Processing Concepts and Theory
52 pages
Hoffer Mdm11e PP Ch11-JSF
No ratings yet
Hoffer Mdm11e PP Ch11-JSF
33 pages
Database Management Exam Papers
100% (1)
Database Management Exam Papers
7 pages
Unit 5 Transaction and Concurrency Control
No ratings yet
Unit 5 Transaction and Concurrency Control
92 pages
DDMQBA
No ratings yet
DDMQBA
27 pages
Dbms Unit 5 Part2
No ratings yet
Dbms Unit 5 Part2
14 pages
Concurrency Control in Databases
No ratings yet
Concurrency Control in Databases
17 pages
Compusoft, 2 (12), 396-399 PDF
No ratings yet
Compusoft, 2 (12), 396-399 PDF
4 pages
21CS53 Assignment2
No ratings yet
21CS53 Assignment2
3 pages
DBMS Transaction Essentials
No ratings yet
DBMS Transaction Essentials
13 pages
ARM Locking Techniques Explained
No ratings yet
ARM Locking Techniques Explained
14 pages
Characteristics: Acid (Disambiguation) Atomicity Consistency Isolation Durability
No ratings yet
Characteristics: Acid (Disambiguation) Atomicity Consistency Isolation Durability
5 pages
Model Question Paper-Scheme and Solution
No ratings yet
Model Question Paper-Scheme and Solution
6 pages
Database Concurrency
No ratings yet
Database Concurrency
39 pages
Unit 5
No ratings yet
Unit 5
22 pages
Database Management Course Guide
No ratings yet
Database Management Course Guide
104 pages
Mobile Commerce: Features and Uses
No ratings yet
Mobile Commerce: Features and Uses
12 pages
Dbms Unit - 5 (Concurrency Control)
No ratings yet
Dbms Unit - 5 (Concurrency Control)
30 pages
Unit3 Part1 ClassTest Questions
No ratings yet
Unit3 Part1 ClassTest Questions
2 pages
CS403
No ratings yet
CS403
186 pages
Multi-Version Concurrency Control (MVCC) in PostgreSQL
No ratings yet
Multi-Version Concurrency Control (MVCC) in PostgreSQL
2 pages
18cs53 Dbms Module 5
No ratings yet
18cs53 Dbms Module 5
25 pages
MCA Year II Syllabus
No ratings yet
MCA Year II Syllabus
19 pages
Information Tech 240815 PDF
No ratings yet
Information Tech 240815 PDF
24 pages
Chapter 5
No ratings yet
Chapter 5
83 pages

Unit 4 Distributed DBMS by ANS

Uploaded by

Unit 4 Distributed DBMS by ANS

Uploaded by

Unit 4 -Distributed Database Management System By ABNS

Homogeneous Distributed Databases

• Different sites use dissimilar schemas and software.

Distributed DBMS Architectures

• Multi-database View Level − Depicts multiple user views comprising of subsets of

• Multi-database Conceptual Level − Depicts integrated multi-database that comprises

Aspect Replication Fragmentation

Improves performance and scalability by

More complex due to synchronization Complexity arises from managing

Recovery from Disk Failure

CONCURRENCY CONTROL IN DISTRIBUTED DBMS

Three Phase Commit Protocol

Prepare Phase, prepare to commit phase, commit/abort phase

Advantages of Commit Protocol in DBMS

The following way shows how unlocking and locking work

o Growing phase: from step 1-3

o Shrinking phase: from step 5-7

o Growing phase: from step 2-6

o Shrinking phase: from step 8-9

Two phase locking is of two types –

Rigorous two phase locking protocol

You might also like