Lecture Notes 4
Lecture Notes 4
UNIT 4
TRANSACTION MANAGEMENT
Transaction:
Transaction-The transaction is a set of logically related operation. It contains a group of tasks.
A transaction isItan action
refers toor series of of
execution actions. It isuser
any one performed by in
program a single
dbms.user to
perform operations for accessing the contents of the database.
Example: Suppose an employee of bank transfers Rs 800 from X's account to Y's account.
(Or) tasks:
This small transaction contains several low-level
Open_Account(X) (Or)
Old_Balance = X.balance
New_Balance = Old_Balance - 800
It also referred to as an event that which occur on a database with read/write operation.
X.balance = New_Balance
Close_Account(X)
Y's Account
Open_Account(Y)
Old_Balance = Y.balance
New_Balance = Old_Balance + 800
Y.balance = New_Balance
Close_Account(Y)
Operations of Transaction:
Following are the main operations of transaction:
Read(X): Read operation is used to read the value of X from the database and stores it in a
buffer in main memory.
Write(X): Write operation is used to write the value back to the database from the buffer.
Let's take an example to debit transaction from an account which consists of following
operations:
R(X);
X = X - 500; W(X); 1
DATABASE MANAGEMENT SYSTEMS
If the transaction fails after completion of T1 but before completion of T2.( say, after
write(X) but before write(Y)), then amount has been deducted from X but not added to
Y. This results in an inconsistent database state. Therefore, the transaction must be
executed in entirety in order to ensure correctness of database state.
Consistency:
The database must remain in consistence state even after performing any kind of
transaction ensuring correctness of the database.
If we execute a particular transaction in isolation (or) together with other transaction in
multiprogramming environment ,the transaction should give same result in any case.
2
DATABASE MANAGEMENT SYSTEMS
Each transaction, run by itself with no concurrent execution of other transactions, must
preserve the consistency of the database. This property is called consistency and the
DBMS assumes that it holds for each transaction. Ensuring this property of a transaction
is the responsibility of the user.
example:
When executing multiple transactions concurrently & trying to access shared resources the
system should create an order such that the only one transaction can access the shared
resource at the same time & release it after completion of it’s execution for other transaction.
This property ensures that multiple transactions can occur concurrently without leading to
inconsistency of database state. Transactions occur independently without interference.
Changes occurring in a particular transaction will not be visible to any other transaction until
that particular change in that transaction is written to memory or has been committed.
Note: To achieve isolation you should use locking mechanism among shared resources.
example:
Suppose T has been executed till Read (Y)
and then T’’ starts. As a result , interleaving
Let X= 500, Y = 500. of operations takes place due to which T’’
Consider two transactions T and T”. reads correct value of X but incorrect value
of Y and sum computed by
T’’: (X+Y = 50, 000+500=50, 500)
is thus not consistent with the sum at end
of transaction:
T: (X+Y = 50, 000 + 450 = 50, 450).
This results in database inconsistency, due
to a loss of 50 units. Hence, transactions
must take place in isolation and changes
should be visible only after a they have
been made to the main memory.
3
DATABASE MANAGEMENT SYSTEMS
Durability:
This property states that once after the transaction is completed the changes that made should
be permanent & should be recoverable even after system crash/power failure.
This property ensures that once the transaction has completed execution, the updates and
modifications to the database are stored in and written to disk and they persist even is system
failure occurs. These updates now become permanent and are stored in a non-volatile
memory.
Transaction states:
1. start
2. partially committed
3. committed
4. failed
5. aborted or terminate
4
DATABASE MANAGEMENT SYSTEMS
Active - This is the first state of transaction and here the transaction is being executed. For
example, updating or inserting or deleting a record is done here. But it is still not
saved to the database. When we say transaction it will have set of small steps, and
those steps will be executed here.
Partially Committed - This is also an execution phase where last step in the transaction is
executed. But data is still not saved to the database. In example of calculating total
marks, final display the total marks step is executed in this state.
Committed - In this state, all the transactions are permanently saved to the database. This step
is the last step of a transaction, if it executes without fail.
Failed - If a transaction cannot proceed to the execution state because of the failure of the
system or database, then the transaction is said to be in failed state. In the total mark
calculation example, if the database is not able fire a query to fetch the marks, i.e.;
very first step of transaction, then the transaction will fail to execute.
Aborted - If a transaction is failed to execute, then the database recovery system will make sure
that the database is in its previous consistent state. If not, it brings the database to
consistent state by aborting or rolling back the transaction. If the transaction fails in
the middle of the transaction, all the executed transactions are rolled back to it
consistent state before executing the transaction. Once the transaction is aborted it is
either restarted to execute again or fully killed by the DBMS.
Durability and atomicity can be ensured by using Recovery manager which is available by default in
every DBMS.
5
DATABASE MANAGEMENT SYSTEMS
2. The scheme also assumes that the database is simply a file on disk.
3. A pointer called db-pointer is maintained on disk; it points to the current copy of the
database.
4. In the shadow-copy scheme, a transaction that wants to update the database first
creates a complete copy of the database. All updates are done on the new database
copy, leaving the original copy, the shadow copy, untouched. If at any point the
transaction has to be aborted, the system merely deletes the new copy. The old copy of
the database has not been affected.
5. If the transaction completes, it is committed as follows.
6. First, the operating system is asked to make sure that all pages of the new copy of the
database have been written out to disk. (Unix systems use the flush command for this
purpose.)
7. After the operating system has written all the pages to disk, the database system
updates the pointer db-pointer to point to the new copy of the database; the new copy
then becomes the current copy of the database. The old copy of the database is then
deleted.
We now consider how the technique handles transaction and system failures.
First, consider transaction failure. If the transaction fails at any time before db-pointer is
updated, the old contents of the database are not affected. We can abort the trans- action by
just deleting the new copy of the database. Once the transaction has been committed, all the
updates that it performed are in the database pointed to by db- pointer. Thus, either all
updates of the transaction are reflected, or none of the effects are reflected, regardless of
transaction failure.
6
DATABASE MANAGEMENT SYSTEMS
Now consider the issue of system failure. Suppose that the system fails at any time before the
updated db-pointer is written to disk. Then, when the system restarts, it will read db-pointer
and will thus see the original contents of the database, and none of the effects of the
transaction will be visible on the database. Next, suppose that the system fails after db-pointer
has been updated on disk. Before the pointer is updated, all updated pages of the new copy of
the database were written to disk. Again, we assume that, once a file is written to disk, its
contents will not be damaged even if there is a system failure. Therefore, when the system
restarts, it will read db-pointer and will thus see the contents of the database after all the
updates performed by the transaction.
LOGS:
Logs keep track of actions carried out by transactions which can be used for the
recovery of database in case of failure.
Logs files should be stored always on stable storage devices.
When a transaction begins its execution it is recorded in the log as follows
<Tn, start>
7
DATABASE MANAGEMENT SYSTEMS
CONCURRENT EXECUTION:
Executing a set of transactions simultaneously in a pre emptive and time shared method.
TRANSACTION SCHEDULES:
Schedule:
Complete schedule:
Example:
T1 T2
A=1000
Read(A)
A=A+100
Write(A) Read(A)
B=A-100
Write(B)
Commit
Read(B)
Write(B)
Commit
8
DATABASE MANAGEMENT SYSTEMS
SERIALIZABILITY:
1. Conflict serializability.
2. View serializability.
1. Conflict serializability:
Conflict Equivalent: Two schedules are said to be conflict equivalent when one can be
transformed to another by swapping non-conflicting operations.
Conflict Serializable: A schedule is called conflict serializable if it can be transformed into a serial
schedule by swapping non-conflicting operations.
Conflicting operations: Two operations are said to be conflicting if all below conditions are
satisfied:
it refers to two instructions of two different transactions may want to access same data to
perform read/write operation.
If two different transactions are both for read operation, then there is no conflict and
can allowed to execute any order.
If one instruction performing read operation and other instruction performing write
operation there will be conflict hence instruction ordering is important.
If both transactions performing write operation then there will be in conflict so ordering
the transaction can be done.
9
DATABASE MANAGEMENT SYSTEMS
2. View serializability:
This is another type of serializability that can be derived by creating another schedule out of an
existing Schedule.
Two schedules S1 and S2 over the same set of transactions --any transaction that appears in
either S1 or S2 must also appear in the other are view equivalent under these conditions:
1. If Ti reads the initial value of object A in S1, it must also read the initial value of A in S2.
2. If Ti reads a value of A written by Tj in S1, it must also read the value of A written by Tj in S2.
3. For each data object A, the transaction (if any) that performs the final write on A in S1 must
also perform the final write on A in S2.
The above two schedules are view serializable or view equivalence, if the transactions in both
schedules performs the actions in similar manner.
The above two schedules satisfy result view equivalence if the two schedule produces the same
Result after execution.
Ex:
s1:R1(A),W1(A),R2(A),W2(A),R1(B),W1(B),R2(B),W2(B)
10
DATABASE MANAGEMENT SYSTEMS
If you try to the read the value which is not written on to the data base(not
committed) will leads to write-read conflict which is called dirty read operation.
In above example, T1 write operation on data item A is not committed but it is being read by
T2. So reading an uncommitted data will leads to inconsistency in database which is called dirty
read operation.
2. un repeatable reading data operation(RW conflicts):
Reading the same object twice before committing the transaction might yield an inconsistency
Unrepeatable problem means we get different values in different reads. For example in S1 say
T2 read initially x=5 then T1 updated x=1 so now T2 will read x=1 here T2 has read two
different values during consecutive reads This shouldn't have been allowed as T1 has not
committed
11
DATABASE MANAGEMENT SYSTEMS
WW conflicts if one transaction could over write the value of an object A which has
been already modified by other transaction while first transaction still in progress .this kind of
conflict refer to blind write conflict.
Recoverability:
It refers to the process of undoing the changes made to the database in case of any transaction
failure due to system crash or any other reason.
Recoverability Schedule:
1. Irrecoverable schedules.
2. Recoverable schedules with cascade rollback.
3. Cascade less recoverability.
1. Irrecoverable schedules: schedules which can't be recovered
If transaction T2 read the value updated by Transaction T1 followed by write operation commit
then this schedule is called Irrecoverable Schedule. If transaction1 failed before committing
Example:
12
DATABASE MANAGEMENT SYSTEMS
IF transaction T2 reading a value updated by T1 & commit of T2 is delay till the commit of T1, it
is called recoverable schedule with cascade roll back.
3. Cascade less recoverability:
It refers to if T2 read value updated by T1 only after T1 is committed.
Example:
13
DATABASE MANAGEMENT SYSTEMS
Implementation of Isolation:
When more than one instruction of several transaction are being executed concurrently
by using some sharable resources , the execution of instruction of one transaction
should not interrupted the execution of instruction of anther transaction.
1. Access to sharable resources should be order by using some locking mechanism:
Where one transaction locks the sharable resource before starting it’s execution &
release the lock to other transaction after completion of it’s execution.
2. Locking protocols:
Locking mechanism can be implemented by using locking protocols which defined
set of standard rule based on which transaction access, sharable resources.
1. Commit.
2. Save point.
3. Roll back.
explain about usage of above 3 commands with syntaxes.
It is a directed graph which consist of nodes G(V,E) where nodes(v) represents set of
transaction &E represents set of edges {E1,E2,….En}.
The graph contains one node for each transaction Ti. Each edge Ei is of the form
TjTk Where Tj is starting node of edge j&Tk is ending node of edge k.
An edge is constructed between nodes if one of the operation in transaction Tj
appear in the schedule before some conflicting operation in transaction Tk.
Algorithm:
1. Create a node T n the graph for each participating transaction in the schedule.
2. Draw edges from one transaction to anther transaction when satisfy anyone of the following
condition.
Condition 1:
If T1 execute write operation i.e. write(x) followed by T2 execute read operation i.e.
read(x).
Condition 2:
When T1 executes read(x) followed by T2 execute write(x).
Condition 3:
When T1 execute write(x) followed by T2 execute write(x).
3. The given schedule is serializable if there are no cycles in the precedence graph.
14
DATABASE MANAGEMENT SYSTEMS
T1 T2 T3 T4
Read(x)
Read(x)
Write(x)
Read(y)
Read(y)
Write(x)
Read(w)
Write(y)
Read(w)
Read(z)
Write(w)
Read(z)
Write(z)
T1 T2
T3 T4
15
DATABASE MANAGEMENT SYSTEMS
Sol:
S21:R2(X), R1(X),R2(Y),W2(Y),R1(Y),W1(Y)
S22:R2(X),R2(Y),R1(X),W2(Y),R1(Y),W1(Y)
S23:R2(X),R2(Y),W2(Y),R1(X),R1(Y),W1(Y)
The schedule S2 derives 3 more schedules (s21,s22,s23) which is called conflict equivalence
16
DATABASE MANAGEMENT SYSTEMS
Concurrency Control:
Types of Locks:
1. Binary locks
2. Shared /exclusive locks
Binary Locks − A lock on a data item can be in two states; it is either locked or unlocked.
Shared(S)/exclusive(X) − This type of locking mechanism differentiates the locks based
on their uses. If a lock is acquired on a data item to perform a write operation, it is an
exclusive lock. Allowing more than one transaction to write on the same data item
would lead the database into an inconsistent state. Read locks are shared because no
data value is being changed.
Lock Compatibility Matrix controls whether multiple transactions can acquire locks on
the same resource at the same time.
Shared Exclusive
Shared True False
Exclusive False False
If a resource is already locked by another transaction, then a new lock request can be
granted only if the mode of the requested lock is compatible with the mode of the
existing lock.
Any number of transactions can hold shared locks on an item, but if any transaction
holds an exclusive lock on item, no other transaction may hold any lock on the item.
compatible locks held by other transactions have been released. Then the lock is
granted.
17
DATABASE MANAGEMENT SYSTEMS
Lock Granularity :
A database is basically represented as a collection of named data items. The size of the data
item chosen as the unit of protection by a concurrency control program is called GRANULARITY.
Locking can take place at the following level :
Database level.
Table level.
Page level.
Row (Tuple) level.
Attributes (fields) level.
18
DATABASE MANAGEMENT SYSTEMS
Locking protocols:
Simplistic lock-based protocols allow transactions to obtain a lock on every object before a
'write' operation is performed. Transactions may unlock the data item after completing the
‘write’ operation.
1. deadlocks
2. starvation
Pre-claiming protocols evaluate their operations and create a list of data items on which they
need locks. Before initiating an execution, the transaction requests the system for all the locks
it needs beforehand.
If all the locks are granted, the transaction executes and releases all the locks when all its
operations are over. If all the locks are not granted, the transaction rolls back and waits until all
the locks are granted.
19
DATABASE MANAGEMENT SYSTEMS
This locking protocol divides the execution phase of a transaction into three parts.
In the first part, when the transaction starts executing, it seeks permission for the
locks it requires.
The second part is where the transaction acquires all the locks. As soon as the
transaction releases its first lock, the third phase starts.
In third phase, the transaction cannot demand any new locks; it only releases the
acquired locks.
Two-phase locking has two phases, one is growing, where all the locks are being acquired by
the transaction; and the second phase is shrinking, where the locks held by the transaction are
being released.
To claim an exclusive (write) lock, a transaction must first acquire a shared (read) lock and then
upgrade it to an exclusive lock.
20
DATABASE MANAGEMENT SYSTEMS
phase 1: The first phase of Strict-2PL is same as 2PL i.e. when the transaction starts executing, it
seeks permission for the locks it requires.
phase 2:After acquiring all the locks in the first phase, the transaction continues to execute
normally.
phase 3: But in contrast to 2PL, Strict-2PL does not release a lock after using it. Strict-2PL holds
all the locks until the commit point and releases all the locks at a time.
Note: It releases only all exclusive locks but not shared locks after a transaction is committed .
Conservative Two – Phase Locking Protocol is also called as Static Two – Phase Locking
Protocol.
This protocol is almost free from deadlocks as all required items are listed in advanced.
It requires locking of all data items to access before the transaction starts.
21
DATABASE MANAGEMENT SYSTEMS
If a transaction is holding an exclusive lock over an object .It can simply downgrade from
exclusive lock to shared lock after completion of its updation
Similarly a shared lock can be upgraded to exclusive lock on particular data item. when
there is no other transaction is holding exclusive lock on same data item
Strict 2 phase locking protocol can be executed serial/concurrent execution of
transaction
examples for serial and concurrent execution are shown below:
T1 T2 T1 T2
S(A) S(A)
R(A) R(A)
X(A) X(A)
W(A) W(A)
COMMIT commit
X(A) X(A)
W(A) W(A)
COMMIT commit
Serial Concurrent
IMPLEMENTING LOCKS:
Every DBMS maintains a lock manager which maintain two tables called lock table and
transaction table
Lock table consist of information regarding locks on data item holding:
1. No. of transaction holding lock
2. Nature of lock(shared or exclusive)
3. Pointer to the no. of locks requested in queue in given object.
Transaction table:
Transaction table contain list of transactions and their corresponding locks assigned.
22
DATABASE MANAGEMENT SYSTEMS
The most commonly used concurrency protocol is the timestamp based protocol. This
protocol uses either system time or logical counter as a timestamp.
Every transaction has a timestamp associated with it, and the ordering is determined by
the age of the transaction.
This lets the system know when the last ‘read and write’ operation was performed on
the data item.
The TO Protocol ensures serializability among transactions in their conflicting read and
write operations.
The transaction of timestamp (T) is denoted as TS(T).
Data item (X) of read timestamp is denoted by R–timestamp(X).
Data item (X) of write timestamp is denoted by W–timestamp(X).
The below assumptions in Time stamp based ordering protocol are based on THOMAS WRITE
RULE.
23
DATABASE MANAGEMENT SYSTEMS
This rule states if TS(Ti) < W-timestamp(X), then the operation is rejected and Ti is rolled back.
Time-stamp ordering rules can be modified to make the schedule view serializable.
Following are the three basic variants of timestamp-based methods of concurrency control:
24
DATABASE MANAGEMENT SYSTEMS
1. Read phase:
In this phase transaction is executed and all the result will be stored in temporary variables
local to transactions.
2. validation phase:
In this phase the transaction operations are validated without violating the serializability.
3. write phase:
In this phase when a transaction is validated successfully all the values of temporary
variables is updated in the actual data base.
Validation phase:
A transaction is validated based on following time stamp
1. start(ti):
The time at which the transaction ti started it’s execution.
2. validation(ti):
The time at which ti is valid.
3. finish(ti):
The time at which ti finish it’s write operation on the actual data base its execution.
Among two transactions ti&tj, the transactions ti is validated. If it satisfy one of the two
conditions.
1. finish(ti)<start(tj)
2. Start(tj)<finish(ti)<validation(tj)
25
DATABASE MANAGEMENT SYSTEMS
The below example shows the interleaved execution of 3 phases of 2 transactions in which
transaction t14 is validated.
T15
T14
Read(B)
Read(B)
B=B-50
Read(A)
A=A+50
Read(A)
Validate
Display(A+B)
Validate
Write(B)
Write(A)
i. This technique is very efficient when conflicts are rare. The occasional conflicts result in
the transaction roll back.
ii. The rollback involves only the local copy of data, the database is not involved and thus
there will not be any cascading rollbacks.
i. Conflicts are expensive to deal with, since the conflicting transaction must be rolled
back.
ii. Longer transactions are more likely to have conflicts and may be repeatedly rolled
back because of conflicts with short transactions.
26
DATABASE MANAGEMENT SYSTEMS
DEAD LOCKS:
Consider two transaction t1 and t2.if t1 holds lock on data item x and t2 holds lock on data
item y now t1 refers lock over y & t2 request lock over x then deadlock situation occur when
none of the transaction are ready to release locks on x ,y.
The following two techniques can be used for deadlock handling(prevention):
1. wait-die
2. wait-wound
1. wait-die:
In wait die technique the older transaction waited in queue &younger will die.
The older transaction waits for the younger if the younger has accessed the granule first.
The younger transaction is aborted (dies) and restarted if it tries to access a granule
after an older concurrent transaction.
The wait-die based on time stamp of the transaction request for conflicting resources.
For example:
Suppose that transaction T22, T23, T24 have time-stamps 5, 10 and 15 respectively. If T22requests
a data item held by T23 then T22 will wait. If T24 requests a data item held by T23, then T24 will be
rolled back.
For example:
Suppose that Transactions T22, T23, T24 have time-stamps 5, 10 and 15 respectively . If
T22requests a data item held by T23, then data item will be preempted from T23 and
T23 will be rolled back. If T24 requests a data item held by T23, then T24 will wait.
Here the younger transactions are made to wait in queue& older transaction going to abort.
1) Ts (t1) <ts (t2): t2 will be in waiting state &t1 in abort.
2) Ts (t1)>ts (t2):t1 will be in waiting & t2 in abort.
27
DATABASE MANAGEMENT SYSTEMS
This is a simple method available to track if any deadlock situation may arise.
For each transaction entering into the system, a node is created.
When a transaction Ti requests for a lock on an item, say X, which is held by some other
transaction Tj, a directed edge is created from Ti to Tj. If Tj releases item X, the edge
between them is dropped and Ti locks the data item.
The system maintains this wait-for graph for every transaction waiting for some data
items held by others. The system keeps checking if there's any cycle in the graph.
First, do not allow any request for an item, which is already locked by another
transaction. This is not always feasible and may cause starvation, where a transaction
indefinitely waits for a data item and can never acquire it.
The second option is to roll back one of the transactions. It is not always feasible to roll
back the younger transaction, as it may be important than the older one. With the help
of some relative algorithm, a transaction is chosen, which is to be aborted. This
transaction is known as the victim and the process is known as victim selection.
28
DATABASE MANAGEMENT SYSTEMS
CRASH RECOVERY:
Transaction failure
A transaction has to abort when it fails to execute or when it reaches a point from where it
can’t go any further. This is called transaction failure where only a few transactions or
processes are hurt.
Storage Structure:
29
DATABASE MANAGEMENT SYSTEMS
Recovery of data:
When a database is recovered after a failure it should ensure the atomicity property &
following should be done after a crash.
1) we should check the status of all transactions whether they are executed completely
or partially
2) Check for the transaction which are in the middle of execution & should take care of
atomicity property with transaction.
3) We should check whether there are any transactions which can be completed after
recovery.
4) If such transactions are there we should be rollback to previous commit point that
allowed for execution.
5) The recovery of database can be done in 2 ways:
1. By using logs
2. by using shadow paging technique.
30
DATABASE MANAGEMENT SYSTEMS
Logs keep track of actions carried out by transactions which can be used for the recovery of
database in case of failure.
Logs files should be stored always on stable storage devices.
When a transaction begins its execution it is recorded in the log as follows
<Tn, start>
31
DATABASE MANAGEMENT SYSTEMS
Shadow Paging:
Execution of Transaction
32
DATABASE MANAGEMENT SYSTEMS
33
DATABASE MANAGEMENT SYSTEMS
While recovering concurrent transaction it is difficult to recover by using lock files so along with
lock files check points are considered for the recovery of concurrent transaction.
Check point:
It is a point at a time where all transaction are committed & the database in consistence state.
While recovering start from the end transaction till it reaches any check point.
During this process categorized each transaction into UNDO/REDO list.
All the transactions in UNDO list should not be saved.
All the transaction in Redo list should saved and rollback then.
<Tn,start>undo list
<Tn, start>
: Redo list
<Tn, commit>
Granularity:
It refers to dividing the database into a hierarchy of data items on which locks can be applied as
a whole or individual data item.
We can divide database hierarchy files into pages and each page consists of record.
34
DATABASE MANAGEMENT SYSTEMS
• Steal: if a frame is dirty and chosen for replacement, the page it contains is
written to disk even if the modifying transaction is still active.
• No-force: Pages in the buffer pool that are modified by a transaction are not
forced to disk when the transaction commits.
Algorithms for Recovery and Isolation Exploiting Semantics, or ARIES is a recovery algorithm
designed to work with a no-force, steal database approach.
1. Analysis
The analysis step identifies the dirty (updated) pages in the buffer, and the set of
transactions active at the time of the crash. The appropriate point in the log where the
REDO operation should start is also determined
2. REDO
The REDO phase actually reapplies updates from the log to the database. Generally, the
REDO operation is applied to only committed transactions. However, in ARIES, this is not
the case. Certain information in the ARIES log will provide the start point for REDO, from
which REDO operations are applied until the end of the log is reached. In addition,
information stored by ARIES and in the data pages will allow ARIES to determine
whether the operation to be redone has actually been applied to the database and
hence need not be reapplied. Thus only the necessary REDO operations are applied
during recovery.
3. UNDO
During the UNDO phase, the log is scanned backwards and the operations of
transactions that were active at the time of the crash are undone in reverse order. The
information needed for ARIES to accomplish its recovery procedure includes the log, the
Transaction Table, and the Dirty Page Table. In addition, check pointing is used. These
two tables are maintained by the transaction manager and written to the log during
check pointing.
35
DATABASE MANAGEMENT SYSTEMS
1. page table
3. pageLSN
4. RedoLSN
5. Transaction Table
6. Checkpoint Log
For efficient recovery, we need Transaction table and Dirty Page table .
The Transaction Table contains an entry for each active transaction, with information such as
the transaction ID, transaction status, and the LSN of the most recent log record for the
transaction.
The Dirty Page Table contains an entry for each dirty page in the buffer, which includes the
page ID and the LSN corresponding to the earliest update to that page.
36
DATABASE MANAGEMENT SYSTEMS
This Checkpoint log file is accessed during recovery to locate the last checkpoint information.
Information from the last checkpoint is first accessed through the special file. The analysis
phase starts at the begin_checkpoint record and proceeds to the end of the log. When the
end_checkpoint record is encountered, the Transaction Table and Dirty Page Table are accessed
(recall that these tables were written in the log during checkpointing). During analysis, the log
records being analyzed may cause modifications to these two tables. For instance, if an end log
record was encountered for a transaction T in the Transaction Table, then the entry for T is
deleted from that table. If some other type of log record is encountered for a transaction T ,
then an entry for T is inserted into the Transaction Table, if not already present, and the last
LSN field is modified. If the log record corresponds to a change for page P, then an entry would
be made for page P (if not present in the table) and the associated LSN field would be modified.
When the analysis phase is complete, the necessary information for REDO and UNDO has been
compiled in the tables.
ARIES starts redoing at a point in the log where it knows (for sure) that previous changes to
dirty pages have already been applied to the database on disk. It can determine this by finding
the smallest LSN, M, of all the dirty pages in the Dirty Page Table, which indicates the log
position where ARIES needs to start the REDO phase. Any changes corresponding to an LSN <
M, for redoable transactions, must have already been propagated to disk or already been
overwritten in the buffer; otherwise, those dirty pages with that LSN would be in the buffer
(and the Dirty Page Table). So, REDO starts at the log record with LSN = M and scans forward to
the end of the log. For each change recorded in the log, the REDO algorithm would verify
whether or not the change has to be reapplied. For example, if a change recorded in the log
pertains to page P that is not in the Dirty Page Table, then this change is already on disk and
does not need to be reapplied. Or, if a change recorded in the log (with LSN = N, say) pertains to
page P and the Dirty Page Table contains an entry for P with LSN greater than N, then the
change is already present. If neither of these two conditions hold, page P is read from disk and
37
DATABASE MANAGEMENT SYSTEMS
the LSN stored on that page, LSN(P), is compared with N. If N < LSN(P), then the change has
been applied and the page does not need to be rewritten to disk.
Once the REDO phase is finished, the database is in the exact state that it was in when the
crash occurred. The set of active transactions—called the undo_set—has been identified in the
Transaction Table during the analysis phase.
Now, the UNDO phase proceeds by scanning backward from the end of the log and undoing the
appropriate actions. A compensating log record is written for each action that is undone. The
UNDO reads backward in the log until every action of the set of trans-actions in the undo_set
has been undone. When this is completed, the recovery process is finished and normal
processing can begin again.
Example:
Consider the recovery example shown in Figure 23.5. There are three transactions: T1, T2, and
T3. T1 updates page C, T2 updates pages B and C, and T3 updates page A.
38
DATABASE MANAGEMENT SYSTEMS
Figure 23.5(a) shows the partial contents of the log, and Figure 23.5(b) shows the contents of
the Transaction Table and Dirty Page Table. Now, suppose that a crash occurs at this point.
Since a checkpoint has occurred, the address of the associated begin_checkpoint record is
retrieved, which is location 4. The analysis phase starts from location 4 until it reaches the
end. The end_checkpoint record would contain the Transaction Table and Dirty Page Table in
Figure 23.5(b), and the analysis phase will further reconstruct these tables. When the analysis
phase encounters log record 6, a new entry for transaction T3 is made in the Transaction Table
and a new entry for page A is made in the Dirty Page Table. After log record 8 is analyzed, the
status of transaction T2 is changed to committed in the Transaction Table. Figure 23.5(c) shows
the two tables after the analysis phase.
For the REDO phase, the smallest LSN in the Dirty Page Table is 1. Hence the REDO will start
at log record 1 and proceed with the REDO of updates. The LSNs {1, 2, 6, 7} corresponding to
the updates for pages C, B, A, and C, respectively, are not less than the LSNs of those pages (as
shown in the Dirty Page Table). So those data pages will be read again and the updates
reapplied from the log (assuming the actual LSNs stored on those data pages are less then the
corresponding log entry). At this point, the REDO phase is finished and the UNDO phase starts.
From the Transaction Table (Figure 23.5(c)), UNDO is applied only to the active transaction T3.
The UNDO phase starts at log entry 6 (the last update for T3) and proceeds backward in the log.
The backward chain of updates for transaction T3 (only log record 6 in this example) is followed
and undone.
39