# Module 2: Essential Data Structures
## Module Overview
This module covers fundamental data structures that form the building blocks of
efficient algorithms. Students will learn how different data structures store and
organize data, their operations, time complexities, and appropriate use cases.
## Learning Objectives
By the end of this module, students will be able to:
- Understand the characteristics and operations of common data structures
- Select appropriate data structures for specific problems
- Implement basic data structures from scratch
- Analyze the time and space complexity of data structure operations
- Use built-in data structure implementations effectively
## Lesson 1: Arrays and Dynamic Arrays
### Concepts
- Static arrays vs. dynamic arrays
- Memory allocation and management
- Array operations and their time complexities
- Multi-dimensional arrays
- Common array patterns and techniques
### Interactive Elements
- **Visualization**: Array memory allocation and resizing
- **Interactive Tool**: Explore array operations and their performance
- **Quiz**: Array operations and their time complexities
### Coding Exercise
```python
# Implementing a dynamic array (similar to ArrayList in Java or vector in C++)
class DynamicArray:
def __init__(self):
self.size = 0
self.capacity = 10
self.array = [None] * self.capacity
def get(self, index):
if index < 0 or index >= self.size:
raise IndexError("Index out of bounds")
return self.array[index]
def set(self, index, value):
if index < 0 or index >= self.size:
raise IndexError("Index out of bounds")
self.array[index] = value
def append(self, value):
if self.size == self.capacity:
self._resize(2 * self.capacity)
self.array[self.size] = value
self.size += 1
def _resize(self, new_capacity):
new_array = [None] * new_capacity
for i in range(self.size):
new_array[i] = self.array[i]
self.array = new_array
self.capacity = new_capacity
def __str__(self):
return str([self.array[i] for i in range(self.size)])
# Test the dynamic array
dynamic_array = DynamicArray()
for i in range(15):
dynamic_array.append(i)
print(f"Dynamic array: {dynamic_array}")
print(f"Element at index 7: {dynamic_array.get(7)}")
dynamic_array.set(5, 100)
print(f"After setting index 5 to 100: {dynamic_array}")
```
## Lesson 2: Linked Lists
### Concepts
- Singly linked lists
- Doubly linked lists
- Circular linked lists
- Linked list operations and their time complexities
- Comparison with arrays
### Interactive Elements
- **Visualization**: Linked list operations (insertion, deletion, traversal)
- **Interactive Tool**: Build and manipulate linked lists
- **Quiz**: Linked list operations and their applications
### Coding Exercise
```python
# Implementing a singly linked list
class Node:
def __init__(self, data):
self.data = data
self.next = None
class LinkedList:
def __init__(self):
self.head = None
def append(self, data):
new_node = Node(data)
if not self.head:
self.head = new_node
return
current = self.head
while current.next:
current = current.next
current.next = new_node
def prepend(self, data):
new_node = Node(data)
new_node.next = self.head
self.head = new_node
def delete(self, data):
if not self.head:
return
if self.head.data == data:
self.head = self.head.next
return
current = self.head
while current.next and current.next.data != data:
current = current.next
if current.next:
current.next = current.next.next
def display(self):
elements = []
current = self.head
while current:
elements.append(current.data)
current = current.next
return elements
# Test the linked list
linked_list = LinkedList()
linked_list.append(1)
linked_list.append(2)
linked_list.append(3)
linked_list.prepend(0)
print(f"Linked list: {linked_list.display()}")
linked_list.delete(2)
print(f"After deleting 2: {linked_list.display()}")
```
## Lesson 3: Stacks and Queues
### Concepts
- Stack data structure and LIFO principle
- Queue data structure and FIFO principle
- Deque (double-ended queue)
- Priority queue
- Applications of stacks and queues
### Interactive Elements
- **Visualization**: Stack and queue operations
- **Interactive Tool**: Solve problems using stacks and queues
- **Quiz**: Identifying appropriate use cases for stacks vs. queues
### Coding Exercise
```python
# Implementing a stack
class Stack:
def __init__(self):
self.items = []
def push(self, item):
self.items.append(item)
def pop(self):
if not self.is_empty():
return self.items.pop()
return None
def peek(self):
if not self.is_empty():
return self.items[-1]
return None
def is_empty(self):
return len(self.items) == 0
def size(self):
return len(self.items)
# Implementing a queue
class Queue:
def __init__(self):
self.items = []
def enqueue(self, item):
self.items.append(item)
def dequeue(self):
if not self.is_empty():
return self.items.pop(0)
return None
def peek(self):
if not self.is_empty():
return self.items[0]
return None
def is_empty(self):
return len(self.items) == 0
def size(self):
return len(self.items)
# Test stack and queue
stack = Stack()
for i in range(1, 6):
stack.push(i)
print(f"Stack: {stack.items}")
print(f"Pop: {stack.pop()}")
print(f"Stack after pop: {stack.items}")
queue = Queue()
for i in range(1, 6):
queue.enqueue(i)
print(f"Queue: {queue.items}")
print(f"Dequeue: {queue.dequeue()}")
print(f"Queue after dequeue: {queue.items}")
```
## Lesson 4: Sets and Maps
### Concepts
- Set data structure and operations
- Map/Dictionary data structure and operations
- Hash tables and hash functions
- Collision resolution strategies
- Time complexity analysis of hash-based structures
### Interactive Elements
- **Visualization**: Hash table operations and collision resolution
- **Interactive Tool**: Explore hash function behavior with different inputs
- **Quiz**: Set and map operations and their applications
### Coding Exercise
```python
# Using built-in sets and dictionaries in Python
# Set operations
set_a = {1, 2, 3, 4, 5}
set_b = {4, 5, 6, 7, 8}
union = set_a | set_b
intersection = set_a & set_b
difference = set_a - set_b
symmetric_difference = set_a ^ set_b
print(f"Set A: {set_a}")
print(f"Set B: {set_b}")
print(f"Union: {union}")
print(f"Intersection: {intersection}")
print(f"Difference (A-B): {difference}")
print(f"Symmetric Difference: {symmetric_difference}")
# Dictionary operations
student_scores = {}
student_scores["Alice"] = 95
student_scores["Bob"] = 87
student_scores["Charlie"] = 92
student_scores["David"] = 78
print(f"Student scores: {student_scores}")
print(f"Bob's score: {student_scores.get('Bob')}")
print(f"Eve's score (with default): {student_scores.get('Eve', 'Not found')}")
# Implement a simple hash table
class HashTable:
def __init__(self, size=10):
self.size = size
self.table = [[] for _ in range(size)]
def _hash(self, key):
return hash(key) % self.size
def insert(self, key, value):
hash_index = self._hash(key)
for i, (k, v) in enumerate(self.table[hash_index]):
if k == key:
self.table[hash_index][i] = (key, value)
return
self.table[hash_index].append((key, value))
def get(self, key, default=None):
hash_index = self._hash(key)
for k, v in self.table[hash_index]:
if k == key:
return v
return default
def remove(self, key):
hash_index = self._hash(key)
for i, (k, v) in enumerate(self.table[hash_index]):
if k == key:
del self.table[hash_index][i]
return True
return False
# Test the hash table
hash_table = HashTable()
hash_table.insert("name", "John")
hash_table.insert("age", 30)
hash_table.insert("city", "New York")
print(f"Name: {hash_table.get('name')}")
print(f"Age: {hash_table.get('age')}")
hash_table.remove("age")
print(f"Age after removal: {hash_table.get('age', 'Not found')}")
```
## Lesson 5: Trees and Binary Search Trees
### Concepts
- Tree terminology and properties
- Binary trees and binary search trees
- Tree traversal algorithms (in-order, pre-order, post-order)
- BST operations (insertion, deletion, search)
- Self-balancing trees (introduction)
### Interactive Elements
- **Visualization**: Tree traversal and BST operations
- **Interactive Tool**: Build and manipulate binary search trees
- **Quiz**: Tree properties and operations
### Coding Exercise
```python
# Implementing a binary search tree
class TreeNode:
def __init__(self, key):
self.key = key
self.left = None
self.right = None
class BinarySearchTree:
def __init__(self):
self.root = None
def insert(self, key):
self.root = self._insert_recursive(self.root, key)
def _insert_recursive(self, root, key):
if root is None:
return TreeNode(key)
if key < root.key:
root.left = self._insert_recursive(root.left, key)
elif key > root.key:
root.right = self._insert_recursive(root.right, key)
return root
def search(self, key):
return self._search_recursive(self.root, key)
def _search_recursive(self, root, key):
if root is None or root.key == key:
return root
if key < root.key:
return self._search_recursive(root.left, key)
return self._search_recursive(root.right, key)
def inorder_traversal(self):
result = []
self._inorder_recursive(self.root, result)
return result
def _inorder_recursive(self, root, result):
if root:
self._inorder_recursive(root.left, result)
result.append(root.key)
self._inorder_recursive(root.right, result)
def preorder_traversal(self):
result = []
self._preorder_recursive(self.root, result)
return result
def _preorder_recursive(self, root, result):
if root:
result.append(root.key)
self._preorder_recursive(root.left, result)
self._preorder_recursive(root.right, result)
def postorder_traversal(self):
result = []
self._postorder_recursive(self.root, result)
return result
def _postorder_recursive(self, root, result):
if root:
self._postorder_recursive(root.left, result)
self._postorder_recursive(root.right, result)
result.append(root.key)
# Test the binary search tree
bst = BinarySearchTree()
keys = [50, 30, 70, 20, 40, 60, 80]
for key in keys:
bst.insert(key)
print(f"In-order traversal: {bst.inorder_traversal()}")
print(f"Pre-order traversal: {bst.preorder_traversal()}")
print(f"Post-order traversal: {bst.postorder_traversal()}")
print(f"Search for 40: {'Found' if bst.search(40) else 'Not found'}")
print(f"Search for 45: {'Found' if bst.search(45) else 'Not found'}")
```
## Lesson 6: Heaps and Priority Queues
### Concepts
- Heap data structure and properties
- Min-heap and max-heap
- Heap operations (insertion, extraction, heapify)
- Priority queue implementation using heaps
- Applications of heaps and priority queues
### Interactive Elements
- **Visualization**: Heap operations and heapify process
- **Interactive Tool**: Build and manipulate heaps
- **Quiz**: Heap properties and operations
### Coding Exercise
```python
# Implementing a min-heap
class MinHeap:
def __init__(self):
self.heap = []
def parent(self, i):
return (i - 1) // 2
def left_child(self, i):
return 2 * i + 1
def right_child(self, i):
return 2 * i + 2
def get_min(self):
if not self.heap:
return None
return self.heap[0]
def insert(self, key):
self.heap.append(key)
i = len(self.heap) - 1
self._sift_up(i)
def extract_min(self):
if not self.heap:
return None
min_val = self.heap[0]
last_val = self.heap.pop()
if self.heap:
self.heap[0] = last_val
self._sift_down(0)
return min_val
def _sift_up(self, i):
parent = self.parent(i)
if i > 0 and self.heap[parent] > self.heap[i]:
self.heap[i], self.heap[parent] = self.heap[parent], self.heap[i]
self._sift_up(parent)
def _sift_down(self, i):
min_index = i
left = self.left_child(i)
right = self.right_child(i)
if left < len(self.heap) and self.heap[left] < self.heap[min_index]:
min_index = left
if right < len(self.heap) and self.heap[right] < self.heap[min_index]:
min_index = right
if i != min_index:
self.heap[i], self.heap[min_index] = self.heap[min_index], self.heap[i]
self._sift_down(min_index)
# Implementing a priority queue using the min-heap
class PriorityQueue:
def __init__(self):
self.heap = MinHeap()
def enqueue(self, priority, item):
self.heap.insert((priority, item))
def dequeue(self):
min_val = self.heap.extract_min()
return min_val[1] if min_val else None
def peek(self):
min_val = self.heap.get_min()
return min_val[1] if min_val else None
# Test the min-heap
min_heap = MinHeap()
elements = [5, 3, 8, 1, 2, 7, 6, 4]
for element in elements:
min_heap.insert(element)
print("Min-heap extraction order:")
while min_heap.heap:
print(min_heap.extract_min(), end=" ")
print()
# Test the priority queue
priority_queue = PriorityQueue()
tasks = [(3, "Complete assignment"), (1, "Fix critical bug"), (2, "Prepare
presentation")]
for priority, task in tasks:
priority_queue.enqueue(priority, task)
print("\nPriority queue tasks in order:")
while True:
task = priority_queue.dequeue()
if not task:
break
print(task)
```
## Module Assessment
### Quiz Questions
1. What is the time complexity of accessing an element in an array by index?
2. When would you use a linked list instead of an array?
3. What is the difference between a stack and a queue?
4. How does a hash table achieve O(1) average time complexity for lookups?
5. What property must be maintained in a binary search tree?
6. What is the time complexity of extracting the minimum element from a min-heap?
7. How does a self-balancing tree differ from a regular binary search tree?
8. What data structure would you use to efficiently find the k-th smallest element
in a stream of numbers?
### Programming Challenge
Implement a LRU (Least Recently Used) Cache with O(1) time complexity for both get
and put operations. The LRU Cache should support the following operations:
- `get(key)`: Get the value of the key if the key exists in the cache, otherwise
return -1
- `put(key, value)`: Set or insert the value if the key is not already present.
When the cache reaches its capacity, it should invalidate the least recently used
item before inserting a new item.
### Project
Design and implement a text editor's undo/redo functionality using appropriate data
structures. The editor should support the following operations:
- Insert text
- Delete text
- Undo the last operation
- Redo the last undone operation
## Additional Resources
- Book chapters: Chapter 4 (Data Structures)
- Interactive visualizations: [VisuAlgo](https://visualgo.net/)
- Practice problems: LeetCode Medium Collection (Linked Lists, Trees, Heaps)
- Video: "Data Structures Easy to Advanced Course" by William Fiset