Graph: A Non-Linear Data Structure
Joy Mukherjee
Graph
• G = (V, E)
• V: Set of vertices
• E: Set of edges
1 2 1 2
0 0
3 5 3 5
4 4
Loop, Parallel Edges and Simple Graph
• A loop is an edge whose endpoints are equal.
• Parallel edges are edges having the same pair of endpoints.
• A simple graph is a graph having no loops or parallel edges.
Simple vs. Non-Simple Graph
2
2
1 1
3 3
5 4 5
4
Degree, Order & Size
• Degree of a vertex v in graph G, (dG(v) or d(v)) = Number of non-
loop edges containing v + 2 X Number of loops containing v.
• The maximum degree is Δ(G)
• The minimum degree is δ(G)
• A graph is regular if Δ(G) = δ(G)
• A graph is k-regular if Δ(G) = δ(G) = k
• Order of a graph G = |V|
• Size of a graph G = |E|
Handshaking Lemma: Undirected Graph
• Theorem: If G is an undirected graph, then ∑ v ∈ V(G)d(v) = 2|E|.
• Proof: Each edge has two endpoints.
It contributes to the degree at each endpoint.
A loop also contributes two to the degree of its endpoint.
Handshaking Lemma: Directed Graph
• Theorem: If G is a directed graph, then ∑ v ∈ V(G) indegree(v) = ∑ v ∈ V(G) outdegree(v)
• Proof: Each directed edge e = (u, v) has two endpoints which is directed from u to v.
The incoming edge towards v contributes to its indegree.
The outgoing edge from u contributes to its outdegree.
For a loop at a vertex u, the edge contributes to its indegree and outdegree.
Representation of Graph
• Adjacency Matrix: Given a graph G=(V, E), we create a matrix M of size |V| X |V|,
where M[i][j] is 1 if there is an edge from vertex i to vertex j; otherwise it is 0.
• O(|V|2).
• For undirected graph (no edge has a direction), M is a symmetric matrix.
1 2 M 0 1 2 3 4 5
0 0 1 0 0 1 0
1 0 0 1 0 0 0
0
2 0 0 0 0 0 1
3 5
3 0 1 1 0 0 0
4 0 0 0 1 0 1
5 0 0 1 0 1 0
4
Representation of Graph
• Adjacency List: Array of Linked Lists, where the Array size is |V|, the linked list
corresponding to a vertex v has all its neighbors. O(|V| + |E|)
1 2
0 1 4
0
1 2
3 5
2 5
3 1 2
4 5 3
4 5 2 4
Undirected & Directed Graphs
Indegree of a Vertex: Number of incoming edges
Degree of a Vertex: Number of neighbors
of the vertex Outdegree of a Vertex: Number of outgoing edges
1 2 1 2
0 0
3 5 3 5
4 4
Weighted Graph
• G = (V, E, W)
• V: Set of vertices
• E: Set of edges
• W: E -> R
5 50
1 2 1 2
3 14
2 10
13 2 20
19 73
0 0
3 5 3 5
13 29
84
24 3 30
4 4
Walk, Trail, Path
• Walk: Sequence of vertices in the graph where adjacent vertices must have an edge
between them. Example: 1 2 5 4 3 2 5 is a walk
• Trail: A walk with no repeated edge. Example: 1 2 5 4 3 2 4 is a trail. In trail, vertex can be
repeated.
• Path: A trail/walk with no repeated vertex. Example: 1 2 5 4 3 is a path. If no vertex is
repeated, then edges can’t be repeated.
1 2
0
3 5
4
Connected Graph
• An undirected graph is called a connected graph if there is a path between every
pair of vertices.
1 2 6 7
0
3 5 8
4
Graph Traversal Algorithms
• Given a vertex s (source vertex) in a graph G=(V, E), how do we
systematically explore the other vertices V – {s} in G?
• Breadth-First Search
• Depth-First Search
Breadth First Traversal/Search (BFS)
• Input: G=(V, E)
• Choose a vertex s, and explore its neighbors first, subsequently exploring the
neighbors of the neighbors.
• For an undirected unweighted graph G, BFS gives shortest path from s to all other
reachable vertices of the graph G.
s
If there is a directed edge
from a node to its ancestor in
the BFS tree, then cycle is
s1 s2 s3 detected.
s, s1, s2, s3, s11, s12, s31
s11 s12 s31
Data Structure
• For each node u, we maintain three variables
1. Distance of u from the source s: dist
2. Parent of u: parent
3. Is u already visited during BFS: visited
• Linear Queue
BFS Algorithm
BFS(G, s)
1. for each vertex u in V – {s} dist[u] = inf parent[u] = NULL visited[u] = 0
2. dist[s] = 0 parent[s] = NULL visited[s] = 1
3. Initialize the Queue Q
4. enqueue(Q, s)
5. while (Q is not empty) {
a. u = dequeue(Q)
b. for each v in Adj[u]
if(visited[v] == 0)
i. visited[v] = 1
ii. dist[v] = dist[u] + 1
iii. parent[v] = u
iv. enqueue(Q, v)
}
Breadth First Traversal/Search (BFS)
1 2
Implementation: Linear Queue
0 |2 1 4 | 3 5
0
Queue
3 5
Dist 0 1 1 2 1 2
Parent N 0 0 4 0 4
4
Visited 1 1 1 1 1 1
BFS gives us a BFS tree rooted at s, the
green edges are called tree edges, and
the red edges are called non-tree edges.
Application: BFS
• Shortest Path from a given vertex in an unweighted graph.
• Cycle detection in an undirected graph: During BFS, if we encounter a vertex v
that is already visited and v is not a parent of the current vertex, then “a cycle is
detected”.
• Detecting whether a graph is bipartite or not.
• A graph G= (V, E) is called a bipartite graph, if its vertices can be partitioned into
two sets A and B such that A U B = V and A ∩ B =
NULL, and all the edges are between A and B.
• Bipartite graph does not have an odd cycle.
• Bipartite graph is 2-colourable.
Check for Bipartite Graph
A = All vertices that are at even
1 2 distance from the source
B = All vertices that are at odd distance
from the source
A = {0, 2, 5}
0
B = {1, 3, 4}
3 5
During colouring of neighbours of v, if
a neighbour of v has already coloured
with the same colour as of v, then the
graph is not bipartite.
4
Check for Bipartite Graph
1 2
Odd Cycle
Detected
0
3 5
4
Depth First Traversal/Search (DFS)
• Given a graph G = (V, E), and a source vertex s, we explore s first, followed by any
one neighbour v of s, and we do it recursively from v unless we encounter a dead
end, where we start backtracking to explore the rest of the vertices not yet
explored.
s1 s2 s3
s11 s12 s31
Data Structure
• For each node u, we maintain two variables
1. Parent of u: parent
2. Is u already visited during DFS: visited
• Stack
DFS Algorithm
DFS(G) DFS_VISIT(G, u)
1. for each vertex u in V 1. visited[u] = 1
parent[u] = NULL 2. disc[u] = time = time+1
visited[u] = 0
3. for each v in Adj[u]
2. time = 0 a. if(visited[v] == 0)
3. for each vertex u in V i. parent[v] = u
ii. DFS_VISIT(G, v)
if(visited[u] == 0)
DFS_VISIT(G, u) 4. finish[u] = time = time + 1
Depth First Traversal/Search (DFS)
1 2
021435
Implementation: Stack
0 Parent Null 0 1 4 0 3
3 5 Visited 1 1 1 1 1 1
Start Time 1 8 9 3 2 4
Finish Time 12 11 10 6 7 5
4
Stack
Depth First Traversal/Search (DFS)
2/5 3/4 0124356
1 2
Stack
V
9/14 10/13
0
3 5
1/8
P
Discovery time/Finish Time
4 6 Discovery time: The node found for the first time
11/12
6/7 Finish time: The time at which the backtracking happens
DFS gives us a DFS forest, the green edges are called tree edges, the red edges are called back
edges, the blue edges are called forward edges, and the orange edges are called cross edges.
DFS on Directed Graph
2/5 3/4 • Tree Edge (u, v) :
1. u is parent of v
1 2 2. d[u] < d[v] < f[v] < f[u]
• Forward Edge(u, v):
1. u is not the parent of v
9/14 10/13
0 2. d[u] < d[v] < f[v] < f[u]
3 5
1/8 • Back Edge (u, v): [Cycle]
1. d[v] < d[u] < f[u] < f[v]
4 6 • Cross Edge (u, v):
11/12
6/7 1. (d[u], f[u]) is non-overlapping
with (d[v], f[v])
2. d[v] < f[v] < d[u] < f[u]
DFS on Undirected Graph
2/13 3/4 • Tree Edge (u, v) :
1 2 1. u is parent of v
2. d[u] < d[v] < f[v] < f[u]
5/12 6/11
0
3 5 • Back Edge (u, v): [Cycle]
1/14 1. d[v] < d[u] < f[u] < f[v]
4 6
7/8
9/10
Application: DFS
• Cycle detection in a graph
• Undirected Graph: If it is a tree, then no cycle is detected. Check for back edges. If
exists, then a cycle is detected.
• Directed Graph: If a back edge is found, then a cycle is detected.