Concurrency Control Techniques
Advanced Database Systems
Concurrency Control Techniques
 Concurrency Control is the process of managing simultaneous
  operations on the database without having them interfere with one
  another.
 Prevents interference when two or more users are accessing database
  simultaneously and at least one is updating data.
 Although two transactions may be correct in themselves, interleaving
  of operations may produce an incorrect result.
 Three basic concurrency control techniques:
    Locking methods
    Time stamping
    Optimistic
Locking Method
 The locking method is a mechanism for preventing simultaneous access on a
  shared resource for a critical operation
 A LOCK is a mechanism for enforcing limits on access to a resource in an
  environment where there are many threads of execution.
 Transaction uses locks to deny access to other transactions and so prevent
  incorrect updates.
 Lock prevents another transaction from modifying item or even reading it
 Lock (X): If a transaction T1 applies Lock on data item X, then X is locked and it
  is not available to any other transaction.
 Unlock (X): T1 Unlocks X. X is available to other transactions.
Types of Lock
Shared lock
 Another transaction that tries to read the same data is permitted to read, but a
  transaction that tries to update the data will be prevented from doing so until the
  shared lock is released.
 Shared lock is also called read lock, used for reading data items only.
 Shared lock ensure that a record is not in process of being updated during a read-
  only request.
 Shared locks can also be used to prevent any kind of updates of record.
 It is denoted by Lock-S.
 For example, consider a case where initially A=100 and there are two transactions which
  are reading A. If one of transaction wants to update A, in that case other transaction would
  be reading wrong value. However, Shared lock prevents it from updating until it has
  finished reading.
Types of Lock
Exclusive Lock
 Two write operations from two different transactions or a write from T1 and a read
  from T2 are not allowed.
 Multiple transactions do not modify the same data simultaneously.
 Any transaction that requires an exclusive lock must wait if another transaction
  currently owns an exclusive lock or a shared lock against the requested resource.
 For example, consider a case where initially A=100 when a transaction needs to
  deduct 50 from A. We can allow this transaction by placing X lock on it.
  Therefore, when the any other transaction wants to read or write, exclusive lock
  prevent it.
 When these locks are applied, then a transaction must behave in a special
  way. This special behavior of a transaction is referred to as well-formed.
Locking - Basic Rules
 If transaction has a shared lock on an item, it can read but not update
  the item.
 If a transaction has an exclusive lock on an item, it can both read and
  update the item.
 Reads cannot conflict, so more than one transaction can hold shared
  locks simultaneously on same item.
 Exclusive lock gives transaction exclusive access to that item.
 Some systems allow transaction to upgrade a shared lock to an
  exclusive lock, or vice-versa.
Locking methods: problems
 Deadlock
 A deadlock that may result when two (or more) transactions are each waiting for locks
  held by the other to be released.
 Deadlock - possible solutions
 Only one way to break deadlock: abort one or more of the transactions in the deadlock.
 Deadlock should be transparent to user, so DBMS should restart transaction(s).
 Timeout
  The deadlock detection could be done using the technique of TIMEOUT. Every
  transaction will be given a time to wait in case of deadlock. If a transaction waits for the
  predefined period of time in idle mode, the DBMS will assume that deadlock occurred
  and it will abort and restart the transaction.
Time-Stamping methods
Timestamp: a unique identifier created by DBMS that indicates relative starting time of a
transaction.
Can be generated by:
     using system clock at the time transaction started, or
     Incrementing a logical counter every time a new transaction starts.
Time-stamping: A concurrency control protocol that orders transactions in such a way
that older transactions, transactions with smaller time stamps, get priority in the event of
conflict.
 Conflict is resolved by rolling back and restarting transaction.
 Since there is no need to use lock there will be No Deadlock.
 Let's assume there are two transactions T1 and T2. Suppose the transaction T1 has
  entered the system at 007 times and transaction T2 has entered the system at 009 times.
  T1 has the higher priority, so it executes first as it is entered the system first.
Multi-version concurrency control (MVCC)
 Is a database optimization technique that creates duplicate copies of
  records so that data can be safely read and updated at the same time.
 Keep the old values of a data item when the item is updated.
 With MVCC, DBMS reads and writes don’t block each other.
 Advantages
    diminished need for database locks;
    reduced number of database deadlocks.
 Drawbacks
    Concurrent update control methods are difficult to implement.
    The database grows in size and becomes bloated by multiple versions of DBMS
     records.
How does an MVCC database work
 Every database record has a version number.
 Concurrent reads happen against the record with the highest version
  number.
 Write operations operate on copy of the record, not the record itself.
 Users continue to read the older version while the copy is updated.
 After the write operation is successful, the version id is incremented.
 Subsequent concurrent reads use the updated version.
 When a new update occurs, a new version is again created, continuing the
  cycle.
Optimistic Technique
 Locking and assigning and checking timestamp values may be unnecessary
  for some transactions.
 When transaction reaches the level of executing commit, a check is
  performed to determine whether conflict has occurred. If there is a conflict,
  transaction is rolled back and restarted.
 Based on assumption that conflict is rare and more efficient to let
  transactions proceed without delays to ensure serializability.
 At commit, check is made to determine whether conflict has occurred.
 If there is a conflict, transaction must be rolled back and restarted.
 Potentially allows greater concurrency
Optimistic Technique
Three phases:
1. Optimistic Techniques - Read Phase
    Extends from start until immediately before commit.
    Transaction reads values from database and stores them in local variables. Updates are applied to a
     local copy of the data.
2. Optimistic Techniques - Validation Phase
    Follows the read phase.
    For read-only transaction, checks that data read are still current values.
     If no interference, transaction is committed, else aborted and restarted.
    For update transaction, checks transaction leaves database in a consistent state, with serializability maintained.
3. Optimistic Techniques - Write Phase
    Follows successful validation phase for update transactions.
    Updates made to local copy are applied to the database.
Granularity of data items
Granularity is the size of the data items chosen as the unit of protection by a
concurrency control protocol.
It could be:
 The entire database
 A file
 A page (a section of physical disk in which relations are stored)(sometimes also called a
   block)
 A record
 A field value of a record
 The granularity has effect on the performance of the system.
Granularity of data items
 As locking will prevent access to the data, the size of the data
  required to be locked will prevent other transactions from having
  access.
 If the entire database is locked, then consistency will be highly
  maintained but less performance of the system will be witnessed.
 If a single data item is locked; consistency maybe at risk but
  concurrent processing and performance will be enhanced.
 Thus, as one go from the entire database to a single value,
  performance and concurrent processing will be enhanced but
  consistency will be at risk and needs good concurrency control
  mechanism and strategy.
Using Locks for Concurrency Control in Indexes (Reading)