EXERCISE 1
Construct a B+-tree for the following set of values
(2, 3, 5, 7, 11, 17, 19, 23, 29, 31)
Assume that the tree is initially empty and values are inserted in ascending order.
1) Construct B+-trees for the cases where the number m of pointers that will fit a node is as follows:
a. Four
b. Seven
2) Shows the form of the B+-tree after each operation of the sequence:
Insert 9; Insert 10; Insert 8; Insert 6; Insert 1; Insert 4
for the case m=4.
Solution
Point 1)
. m/2 <= p <= m
k k parent node
<k >=k and <k
leaf node
(m1)/2 <= k <= m1
File
B+-tree:
Not a root or a leaf node: m/2 <= p <= m , where p is a pointer.
Leaf node: (m1)/2 <= k <= m1, where k is a key.
Root: p>= 2 if it is not a leaf
0<=k<= (m1) if it is a leaf.
a) m=4 Internal node: 2<=p<=4 Root: p>= 2 if it is not a leaf;
Leaf node: 2<=k<=3 0<=k<= 3 if it is a leaf
B+tree: Insert 2, 3, 5
2| 3 | 5
B+tree: Insert 7 (leaf split)
5 | |
N N
2 | 3 | 5 |7 |
Split a leaf node:
Let K1, , K m be the set of keys in the ascending order (i.e. 2, 3, 5, 7)
Node N: K1, , K m/2 -1, K m/2 (i.e. 2, 3)
Node N: K m/2+1, , K m (i.e. 5, 7)
Insert K m/2 +1 into the parent node (i.e. 5)
B+tree: Insert 11, 17, 19, 23, 29
5 | 11 | 19
2 | 3 | 5 | 7 | 11 | 17 | 19 | 23 | 29
B+tree: Insert 31 (leaf node split + non leaf node split)
11 | |
N N
5| | 19 | 29 |
2|3| 5|7| 11 | 17 | 19 | 23| 29 | 31 |
Split an internal node N:
Let K1, , K m be the set of keys in the ascending order (i.e. 5, 11, 19, 29)
Node N: K1, , K m/2-1 (i.e. 5)
Node N: K m/2+1, , K m (i.e. 19, 29)
Insert K m/2 into the parent node of N (i.e. 11)
Note that since values are inserted in ascending order, leaves have the minimum number
of keys except the last leaf.
b) Seven.
m=7 Intermediate node : 4<=p<=7 Leaf node: 3<=k<=6
Root: p>= 2 if it is not a leaf ;
0<=k<= 6 if it is a leaf
B+tree: Insert 2, 3, 5, 7, 11, 17
2 | 3 | 5 | 7 | 11 | 17
B+tree: Insert 19 (leaf split)
11 | | | | |
2| 3 | 5| 7| | 11 | 17 | 19 | | |
B+tree: Insert 23, 29, 31
11 | | | | |
2| 3 | 5| 7| | 11 | 17 | 19 | 23 | 29 | 31
Point 2) B+-tree 11 | |
5| | 19 | 29 |
2|3| 5|7| 11 | 17 | 19 | 23| 29 | 31
|
B+tree: Insert 9
11 | |
5| | 19 | 29 |
2|3| 5|7|9 11 | 17 | 19 | 23| 29 | 31 |
B+tree: Insert 10 (leaf split)
11 | |
5| 9| 19 | 29 |
2|3| 5|7| 9 | 10 | 11 | 17 | 19 | 23| 29 | 31 |
B+tree: Insert 8
11 | |
5| 9| 19 | 29 |
2|3| 5|7|8 9 | 10 | 11 | 17 | 19 | 23| 29 | 31|
B+tree: Insert 6 (leaf split)
11 | |
5| 7|9 19 | 29 |
2|3| 5|6| 7|8| 9 | 10 | 11 | 17| 19 | 23| 29 | 31|
B+tree: Insert 11 | |
5| 7|9 19 | 29 |
1| 2 | 3 5|6| 7|8| 9 | 10 | 11 | 17 | 19 | 23| 29 | 31 |
B+tree: Insert 4 (leaf node split + non leaf node split)
5 | 11 |
3 | | 7|9| 19 | 29 |
1| 2 | 3|4| 5|6| 7|8| 9 | 10 | 11 | 17| 19 | 23| 29 | 31
| |
EXERCISE 2 (leaf merge, non leaf merge, leaf keys redistribution)
For the following B+-tree (m = 5) show the form of the tree after each of the of operations of the
sequence:
Delete 17; Delete 20; Delete 34
What is the cost in terms of block transfers for each operation?
B+-tree
17 | | |
< >=
5 | 14 | |
24 | 30 | |
2|3| | 5 |7 |8 | 14 | 16 | | 17 | 20 | 22 | 24 | 27 | 29 | 30 | 34 |
|
Solution
m=5 Intermediate node : 3<=p<=5 Leaf node: 2<=k<=4
Root: p>= 2 if it is not a leaf ;
0<=k<= 4 if it is a leaf
Delete k:
Find the leaf node that contains k;
Delete k from the node;
If the node has too few entries
1) merge nodes (if possible)
2) otherwise redistribute keys
B+tree: Delete 17
17 | | |
5 | 14 | |
24 | 30 | |
2|3| | 5 |7 |8 | 14 | 16 | | 20 | 22 | | 24 | 27 | 29 | 30 | 34 | |
Cost = 3 read + 1 write = 4
B+tree: Delete 20 (leaf merge) 17 | | |
5 | 14 | | k
24 | 30 | |
2|3| | 5 |7 |8 | 14 | 16 | | 20 | 22 | | 24 | 27 | 29 | 30 | 34 | |
N N
Leaf merge:
Let N be the predecessor; let N be the successor
Let k be the value between the two nodes N and N in the parent
Append all keys to N
Delete (k, pointer to N) from the parent
Delete N
17 | | |
Q
Q
5 | 14 | |
30 | | |
2|3| | 5 |7 |8 | 14 | 16 | | 22 | 24 | 27 |29 30 | 34 | |
Non leaf merge:
Node Q too few pointers
Let Q predecessor and Q successor
Let K be the value between the two nodes in parent of Q
Append K and all pointers and values of Q to Q
Delete (K, pointer) from the parent
Delete Q K
17 | | |
Q
Q
5 | 14 | 17 | 30
30 | | |
2|3| | 5 |7 |8 | 14 | 16 | | 22 | 24 | 27 |29 30 | 34 | |
| | |
5 | 14 | 17 | 30
2|3| | 5 |7 |8 | 14 | 16 | | 22 | 24 | 27 |29 30 | 34 | |
Root has only one child. Root can be deleted.
5 | 14 | 17 | 30
2|3| | 5 |7 |8 | 14 | 16 | | 22 | 24 | 27 |29 30 | 34 | |
Cost: 5 read + 2 write = 7
Delete 34: keys redistribution (leaf)
K
5 | 14 | 17 | 30
2|3| | 5 |7 |8 | 14 | 16 | | 22 | 24 | 27 | 29 30 | 34 | |
N (Pj,Kj) N
)
The node N has too few values
Let N be the previous or next child of parent of N
Let K the value between N and N in the parent
Entries of N and N cannot fit in a single node.
We apply redistribution of keys
N borrows an entry from N (assume N predecessor of N)
Let j such that (Pj, Kj) is the last (pointer, value) in N
Remove (Pj, Kj) from N
Insert (Pj, Kj) as first value in N
Replace K by Kj in the parent
5 | 14 | 17 | 29
2|3| | 5 |7 |8 | 14 | 16 | | 22 | 24 | 27 | 29 | 30 | |
Cost = 3 read + 3 write = 6
EXERCISE 3 (non leaf keys redistribuiton)
Show the form of the B+-tree (m=5) after the operation Delete 20
What is the cost of the operation?
17 | | |
1 | 3 | 5 | 14
24 | 30 | |
-1 | 0 | | 1 |2 | | 3 |4 | | 5 |8 | | 14 |16 | 20 | 22 | 24 | 27 | 29 | 30 | 34 | |
| | 39339
Solution
m=5 Intermediate node : 3<=p<=5 Leaf node: 2<=k<=4
Root: p>= 2 if it is not a leaf ;
0<=k<= 4 if it is a leaf
Delete 20
17 | | |
1 | 3 | 5 | 14
24 | 30 | |
-1 | 0 | | 1 |2 | | 3 |4 | | 5 |8 | | 14 |16 | 20 | 22 | 24 | 27 | 29 | 30 | 34 | |
| | 39339
N N
The node N has too few values
Merge between N and N
K
17 | | |
Q
Q
1 | 3 | 5 | 14 (Kj-1,pj)
30 | | |
-1 | 0 | | 1 |2 | | 3 |4 | | 5 |8 | | 14 |16 | 22| 24 | 27 | 29 30 | 34 | |
| 39339
The node Q has too few pointers
No merge with previous or net child of parent of Q
Redistribution of keys
Q borrows an entry from Q
Let j be such that (Kj-1, pj) is the last value pointer in Q
Let K the value between Q and Q in parent of Q
Insert (pj, K) as first value of Q
Remove (Kj-1, pj) from Q
Replace K with Kj-1 in parent of Q
14 | | |
1 | 3 |5 |
17 | 30 | |
-1 | 0 | | 1 |2 | | 3 |4 | | 5 |8 | | 14 |16 | 22| 24 | 27 | 29 30 | 34 | |
| 39339
Cost= 5 read + 4 write = 9