0% found this document useful (0 votes)

32 views13 pages

Quick Sort and Linked List Operations

The document discusses the quicksort algorithm, highlighting its advantages over merge sort, such as in-place sorting and average-case efficiency of O(n log n). It also covers the importance of choosing a good pivot and the implications of using randomized pivots to avoid worst-case scenarios. Additionally, it compares lists and arrays in Python, explaining their implementations and performance characteristics.

Uploaded by

Harshdeep Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views13 pages

Quick Sort and Linked List Operations

Uploaded by

Harshdeep Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Week 3

QUICK SORT

Shortcomings of merge sort

Merge needs to create a new list to hold the merged elements

No obvious way to efficiently merge two lists in place

Extra storage can be costly
Inherently recursive

Recursive calls and returns are expensive

Merging happens because elements in the left half need to move to the right half and vice
versa

Consider an input of the form [0,2,4,6,1,3,5,9]

Can we divide the list so that everything on the right?

No need to merge!

Divide and conquer without merging

Suppose the median L is m

Move all values ≤ m to left half of L

Right half has values > m

Recursively sort left and right halves

L is now sorted, no merge!

Recurrence: T(n) = 2T(n/2) - n

Rearrange in a single pass, time O(n)

So T(n) is O(nlogn)

How do we find the median?

Sort and pick up the middle element

But our aim is to sort the list!
Instead pick some value in L - pivot

Split L with respect to the pivot element

Quicksort [C.A.R Hoare]

Choose a pivot element

Typically the first element in the array

Partition L into lower and upper parts with respect to the pivot
Move the pivot between the lower and upper partition
Recursively sort the two partitions

High level view of quicksort

Input list

Identify pivot
Mark lower elements and upper elements
Rearrange the elements as lower-pivot-upper

Recursively sort the lower and upper partitions

Partitioning

Scan the list from left to right

Four segments: Pivot , Lower , Upper , Unclassified
Examine the first unclassified element

If it is larger than the pivot, extend Upper to include this element

If it is less than or equal to the pivot, exchange with the first element in Upper . This
extends Lower and shifts Upper by one position.

Pivot is always the first element

Maintain two indices to mark the end of the Lower and Upper segments
After partitioning, exchange the pivot with the last element of the Lower segment.

def quicksort(L, l, r): # Sort L[l:r]

if (r - l <= 1):
return L
(pivot, lower, upper) = (L[l], l + 1, l + 1)
for i in range(l+1,r):
if L[i] > pivot: # Extend upper segment
upper = upper + 1
else: # Exchange L[i] with start of upper segment
(L[i], L[lower]) = (L[lower], L[i])
# Shift both segments
(lower, upper) = (lower + 1, upper + 1)
# Move pivot between lower and upper
(L[i] L[lower-1]) = (L[lower-1] L[l])
(L[i], L[lower 1]) = (L[lower 1], L[l])
lower = lower - 1
# Recursive calls
quicksort(L,l,lower)
quicksort(L, lower+1,upper)
return L

Summary

Quicksort uses divide and conquer, like merge sort.

By partitioning the list carefully, we avoid a merge step

This allows an in place sort

We can also provide an iterative implementation to avoid the cost of recursive calls
The partitioning strategy described is not the only one used in the literature

Can build the lower and upper segments from opposite ends and meet in the middle
Need to analyze the complexity of quicksort

ANALYSIS OF QUICK SORT

Analysis

Partitioning wrt the pivot takes time O(n)

If the pivot is the median

T(n) = 2T(n/2) + n
T(n) is O(nlogn)

Worst case? Pivot is maximum or minimum

Partitions are of size 0, n - 1

T(n) = T(n - 1) + n
T(n) = n + (n - 1) + ... + 1
T(n) is O(n2 )

Already sorted array, worst case!

However, average case is O(nlogn)

Sorting is a rare situation where we can compute this

Values don't matter, only relative order is important

Analyze behaviour over permutations of {1,2,...,n}
Each input permutation is equally likely
Expected running time is O(nlogn)

Randomizaton
Any fixed choice of pivot allows us to construct worst case input
Instead, choose pivot position randomly at each step
Expected run time is again O(nlogn)

Iterative quicksort

Recursive calls work on disjoint segments

No recombination of results is required

Can explicitly keep track of left and right endpoints of each segment to be sorted.

Quicksort in practice

In practice, quicksort is very fast

Very often the default algorithm used for in-built sort functions

Sorting a column in a spreadsheet

Library sort function in a programming language

Summary

The worst case complexity of quicksort is O(n2 )

However, the average case is O(nlogn)
Randomly choosing the pivot is a good strategy to beat worst case inputs
Quicksort works in-place and can be impleted iteratively
Very fast in practice, and often used for built-in sorting functions

Good example of a situation when the worst case upper bound is pessimistic

CONCLUDING REMARKS ON SORTING ALGORITHMS

Stable Sorting

Often list values are tuples

Rows from a table, with multiple columns / attributes

A list of students, each student entry has a roll number, names, marks, ...
Suppose students have already been sorted by roll number
If we now sort by name, will all students with the same name remain in sorted order with
respect to roll number?
Stability of sorting is crucial in many applications
Sorting on column B should not disturb sorting on column A

The quicksort implementation we described is not stable

Swapping values while partitioning can disturb existing sorted order

Merge Sort is stable if we merge carefully

Do not allow elements from the right to overtake elements on the left
While merging, prefer the left list while breaking ties

Other criteria

Minimizing data movement

Imagine each element is a heavy carton

Reduce the effort of moving values around

Best sorting algorithm?

Quicksort is often the algorithm of choice, despite O(n2 ) worst case

Merge sort is typically used for "external" sorting

Database tables taht are too large to store in memory all at once
Retrieve in parts from the disk and write back
Other O(nlogn) algorithms exist - heapsort
Sometimes hybrid strategies are used

Use divide and conquer for large n

Switch to insertion sort when n becomes small (e.g., n < 16)

DIFFERENCE BETWEEN LISTS AND ARRAYS

Sequences

Two basic ways of storing a seequence of values

Lists
Arrays
Lists

Flexible length
Easy to modify the structure
Values are scattered in memory
Arrays

Fixed size
Allocate a contigous block of memory
Supports random access

Lists
Typically a sequence of nodes
Each node contains a value and points to the next node in the sequence

"Linked" list
Easy to modify

Inserting and deletion is easy via local "plumbing"

Flexible size
Need to follow links to access A[i]

Takes time O(i)

Arrays

Fixed size, declared in advance

Allocate a contiguous block of memory

n times the storage for a single value

"Random" access

Compute offset to A[i] from A[0]

Accessing A[i] takes constant time, independent of i
Inserting and deleting elements is expensive

Expanding and contracting requires moving O(n) elements in the worst

Operations

Exchange A[i] and A[j]

Constant time for arrays

O(n) for lists

Delete A[i] , insert v after A[i]

Constant time for lists if we are already at A[i]

O(n) for arrays

Need to keep implementation in mind when analyzing data structures

For instance, can we use binary search to insert in a sorted sequence?

Either search is slow, or insertion is slow, still O(n)

Summary

Sequences can be stored as lists or arrays

Lists are flexible but accessing an element is O(n)
Arrays support random access but are difficult to expand, contract
Algorithm analysis needs to take into account the underlying implementation.
In Python:

Is the built-in type in Python really a "linked" list?

Numpy library provides arrays - are these faster than lists?

DESIGNING A FLEXIBLE LIST AND OPERATIONS ON THE SAME

Implementing lists in Python

Python class Node

A list is a sequence of nodes

self.value is the stored value

self.next points in the next node

Empty list?

self.value is None

Creating lists

l1 = Node() - empty list

l2 = Node(5) - singleton list
l1.isempty() == True
l2.isempty() == False

class Node:
def __init__(self, v = None):
self.value = v
self.next = None
return
def isempty(self):
if self.value == None:
return True
else:
return False

Appending to a list

Add v to the end of list l

If l is empty, update l.value from None
If at last value, l.next is None

Point next at new node with value v

Otherwise, recursively append to rest of list

def append(self, v):

# append, recursive
if self.isempty():
self.value = v
elif self.next == None:
self.next = Node(v)
else:
self.next.append(v)
return

Iterative implementation

If empty, replace l.value by v

Loop through l.next to end of list
Add v to the end of the list

def appendi(self, v):

# append, iterative
if self.isempty():
self.value = v
return

temp = self
while temp.next != None:
temp = temp.next

temp.next = Node(v)
return

Insert at the start of the list

Want to insert v at head

Create a new node with v
Cannot change where the head points!

Exchange the values v0 , v

Make new node point to head.next
Make head.next point to new node

def insert(self, v):

if self.isempty():
self.value = v
return

newnode = Node(v)

# Exchange values in self and newnode

(self.value, newnode.value) = (newnode.value, self.value)

# Switch links
(self.next, newnode.next) = (newnode, self.next)

return

Delete a value v

Remove first occurence of v

Scan list for first v - look ahead at next node
If next node value is v, bypass it
Cannot bypass the first node in the list

Instead, copy the second node value to head

Bypass second node
Recursive implementation

def delete(self, v):

# delete, recursive
if self.isempty():
return
if self.value == v:
self.value = None
if self.next != None:
self.value = self.next.value
self.next = self.next.next
return
else:
if self.next != None:
self.next.delete(v):
if self.next.value == None:
self.next = None
return

Summary

Use a linked list of nodes to implement a flexible list

Append is easy
Insert requires some care, cannot change where the head points to
When deleting, look one step ahead to bypass the node to be deleted

IMPLEMENTATION OF LISTS IN PYTHON

Lists in Python

Python lists are not implemented as flexible linked lists

Underlying interpretation maps the list to an array

Assign a fixed block when you create a list

Double the size if the list overflows the array
Keep track of the last position of the list in the array

l.append() and l.pop() are constant time, amortised - O(1)

Insertion/deletion require time O(n)
Effectively, Python lists behave more like arrays than lists

Arrays v/s Lists in Python

Arrays are useful for representing matrices

In list notation, these are nested lists
0 1
( )
0 1

that is [[0,1], [1,0]]

Need to be careful when initializing a multidimensional list

zerolist = [0,0,0]
zeromatrix = [zerolist, zerolist, zerolist]

zeromatrix[1][1] = 1
print(zeromatrix)

[[0, 1, 0], [0, 1, 0], [0, 1, 0]]

Mutuability aliases different values

Instead use list comprehension

zeromatrix = [ [0 for i in range(3)] for j in range(3) ]

Numpy Arrays

The Numpy library provides arrays as a basic type

import numpy as np
zeromatrix = np.zeros(shape = (3,3))

Can create an array from any sequence type

newarray = np.array([[0,1],[1,0]])

arange is the equivalent of range for lists

row2 = np.arange(5)

Can operate on amtrix as a whole

C = 3*A + B
C = np.matmul(A,B)
same as C[i,j] = A[i.k].B[k,j]
Very useful for data science

Summary

Python lists are not implemented as flexible linked structures

Instead, allocate an array and double space as needed
Append is cheap, insert is expensive
Arrays can be represented as multidimensional lists,but need to be careful about
mutability, aliasing
Numpy arrays are easier to use

IMPLEMENTATION OF DICTIONARY IN PYTHON

Dictionary

An array/list allows access through positional indices

A dictionary allows access through arbitrary keys

A collection of key-value pairs

Random access - access time is the same for all keys

Implementing a dictionary

The underlying storage is an array

Given an offset i , find A[i] in constant time

Keys have to be mapped to {0,1,..,n-1}

Given an key k , convert it to an offset i

Hash function

h : S -> X maps a set of values S to a small range of integers X = {0,1,...,n-1}

Typically |X| << |S |, so there will be collisions, h(s) = h(s′ ) , s ≠ s′
A good hash function will minimize collisions
SHA-256 is an industry standard hashing function whose range is 256 bits

Use to hash large files - avoid uploading to cloud storage

Hash Table

An array A of size n combined with a hash function h

h maps keys to {0,1,...,n-1}
Ideally, when we create an entry for key k , A[h(k)] will be unused

What if there is already a value at that location?

Dealing with collisions

Open addressing (closed hashing)

Probe a sequence of alternate slots in the same array

Open hashing
Each slot in the array points to a list of values
Insert into the list for the given slot
Dictionary keys in Python must be immutable

If value changes, hash also changes!

Summary

A dictionary is implemented as a hash table

An array plus a hash function

Creating a good hash function is important
Need a strategy to deal with collisions

Open addressing/closed hashing - probe for free space in the array

Open hashing - each slot in the hash table points to a list of key-value pairs
many heuristics/optimizations possible for dea

PDSA Week 3
No ratings yet
PDSA Week 3
33 pages
Sorting and Searching II
No ratings yet
Sorting and Searching II
34 pages
DSD Unit 3 Sorting and Searching
No ratings yet
DSD Unit 3 Sorting and Searching
36 pages
CPT212 07 Sorting - Efficient
No ratings yet
CPT212 07 Sorting - Efficient
25 pages
Sorting Algorithms for CS Students
No ratings yet
Sorting Algorithms for CS Students
34 pages
L11 Sorting&Searching
No ratings yet
L11 Sorting&Searching
61 pages
Quick Sort
No ratings yet
Quick Sort
19 pages
Dsa Unit3
No ratings yet
Dsa Unit3
106 pages
Sorting
No ratings yet
Sorting
54 pages
Sorting and Hashing
100% (1)
Sorting and Hashing
83 pages
Unit 1
No ratings yet
Unit 1
116 pages
Compiled By: Dr. Mohammad Omar Alhawarat: Sorting
No ratings yet
Compiled By: Dr. Mohammad Omar Alhawarat: Sorting
52 pages
Week 6
No ratings yet
Week 6
39 pages
Lecture 7 - Sorting
No ratings yet
Lecture 7 - Sorting
38 pages
Lec 3 Sorting Analysis
No ratings yet
Lec 3 Sorting Analysis
9 pages
DS
No ratings yet
DS
13 pages
07 Sort2
No ratings yet
07 Sort2
87 pages
Topic: Searching and Sorting Algorithm: Joy of Python Using Cloud Computing
No ratings yet
Topic: Searching and Sorting Algorithm: Joy of Python Using Cloud Computing
17 pages
Quicksort Algorithm Explained
No ratings yet
Quicksort Algorithm Explained
18 pages
CS3353 Unit5
No ratings yet
CS3353 Unit5
21 pages
Chapter 5 Sorting Algorithms
No ratings yet
Chapter 5 Sorting Algorithms
37 pages
Merge and Quick
100% (1)
Merge and Quick
23 pages
Week 02 (Complexity of Sorting Algorithms)
No ratings yet
Week 02 (Complexity of Sorting Algorithms)
62 pages
Algorithm Design & Analysis Guide
No ratings yet
Algorithm Design & Analysis Guide
33 pages
Python Week4 Lecture5 Handout
No ratings yet
Python Week4 Lecture5 Handout
9 pages
DSA - Chapter Eight
No ratings yet
DSA - Chapter Eight
33 pages
Sorting Techniques
No ratings yet
Sorting Techniques
6 pages
Top 6 Python Sorting Algorithms
No ratings yet
Top 6 Python Sorting Algorithms
6 pages
L9 Sorting
No ratings yet
L9 Sorting
50 pages
Week-4 Sorting, Dictionaries and Functions
No ratings yet
Week-4 Sorting, Dictionaries and Functions
110 pages
Quicksort Algorithm Explained
No ratings yet
Quicksort Algorithm Explained
22 pages
Dsa Ass 2 by Syed
No ratings yet
Dsa Ass 2 by Syed
11 pages
Module 6 Search Sort Hashing
No ratings yet
Module 6 Search Sort Hashing
62 pages
Lab 8
No ratings yet
Lab 8
8 pages
Quick-Sort Algorithm
No ratings yet
Quick-Sort Algorithm
53 pages
Data Structures & Quick Sort Guide
No ratings yet
Data Structures & Quick Sort Guide
13 pages
Mergesort & Quicksort Explained
No ratings yet
Mergesort & Quicksort Explained
54 pages
Python Sorting Techniques Lab
No ratings yet
Python Sorting Techniques Lab
11 pages
Merge, Quick, Radix Sort
No ratings yet
Merge, Quick, Radix Sort
9 pages
11 Sorting
No ratings yet
11 Sorting
103 pages
Sorting UNIT 5
No ratings yet
Sorting UNIT 5
66 pages
Mergesort: Merge Sort Visualizer
No ratings yet
Mergesort: Merge Sort Visualizer
90 pages
Dsa CH 2
No ratings yet
Dsa CH 2
50 pages
Lec32 35
No ratings yet
Lec32 35
40 pages
Efficient Sorting Algorithms Guide
No ratings yet
Efficient Sorting Algorithms Guide
66 pages
DAA Module 4
No ratings yet
DAA Module 4
11 pages
Quick Sort
No ratings yet
Quick Sort
4 pages
Notes 03 Sorting PDF
No ratings yet
Notes 03 Sorting PDF
126 pages
C Programming and Data Structures 41394658 2025 06-20-08 20
No ratings yet
C Programming and Data Structures 41394658 2025 06-20-08 20
35 pages
s4 Quick Sort
No ratings yet
s4 Quick Sort
27 pages
UNIT V Data Structures OU
No ratings yet
UNIT V Data Structures OU
42 pages
Cse Daa Lab Manual
No ratings yet
Cse Daa Lab Manual
29 pages
Lecture # 23 24 Sorting
No ratings yet
Lecture # 23 24 Sorting
32 pages
Sortings
No ratings yet
Sortings
92 pages
DAA03 Quick Sort Stressen
No ratings yet
DAA03 Quick Sort Stressen
35 pages
Design & Analysis of Algorithms: Submitted To Prof. Hashim Javed Submitted by Affifa ID: 30 21-Aug-2019
No ratings yet
Design & Analysis of Algorithms: Submitted To Prof. Hashim Javed Submitted by Affifa ID: 30 21-Aug-2019
11 pages
Algorithms Project Report
No ratings yet
Algorithms Project Report
7 pages
Sorting Algorithms for Students
No ratings yet
Sorting Algorithms for Students
19 pages
Introduction, Fundamental File Structure Concepts, Managing Files of Records
No ratings yet
Introduction, Fundamental File Structure Concepts, Managing Files of Records
5 pages
Daa Detention Work
No ratings yet
Daa Detention Work
7 pages
Linear-Time Sorting Lecture
No ratings yet
Linear-Time Sorting Lecture
5 pages
Data Structures & Algorithms Course
No ratings yet
Data Structures & Algorithms Course
8 pages
DSU Preparation Guide
No ratings yet
DSU Preparation Guide
3 pages
QB Ada Sem-5 Cse 2024 Odd
No ratings yet
QB Ada Sem-5 Cse 2024 Odd
36 pages
Unit-5 Query Processing and Optimization
No ratings yet
Unit-5 Query Processing and Optimization
40 pages
Insertion Sort Guide with Java Code
No ratings yet
Insertion Sort Guide with Java Code
5 pages
Inverted File
No ratings yet
Inverted File
20 pages
Computer Science Distilled
No ratings yet
Computer Science Distilled
110 pages
CS8381 - Data Structures Laboratory Manual - by LearnEngineering - in
No ratings yet
CS8381 - Data Structures Laboratory Manual - by LearnEngineering - in
39 pages
Syllabus CSC 202
No ratings yet
Syllabus CSC 202
7 pages
Data Structures Unit-V Lecture Notes
No ratings yet
Data Structures Unit-V Lecture Notes
55 pages
Python Practical Index: SR No Aim Date Marks /10 Sign Practical Set - 1
No ratings yet
Python Practical Index: SR No Aim Date Marks /10 Sign Practical Set - 1
5 pages
PASCAL Plus Data Structures, Algorithms, and Advanced Programming PDF
50% (2)
PASCAL Plus Data Structures, Algorithms, and Advanced Programming PDF
657 pages
Algorithm Up To 7 Lectures
No ratings yet
Algorithm Up To 7 Lectures
13 pages
Compititive Programming - I Manual
No ratings yet
Compititive Programming - I Manual
79 pages
Sorting Algorithms Overview
No ratings yet
Sorting Algorithms Overview
20 pages
Selection Sort
50% (2)
Selection Sort
16 pages
IT-209 Data Structures
No ratings yet
IT-209 Data Structures
4 pages
FDS - Unit 3 - MCQ
No ratings yet
FDS - Unit 3 - MCQ
8 pages
UGRD ITE6201 2016S Data Structure Algorithm
No ratings yet
UGRD ITE6201 2016S Data Structure Algorithm
7 pages
04 MergeSort Inversions
No ratings yet
04 MergeSort Inversions
24 pages
Data Structure Syllabus
No ratings yet
Data Structure Syllabus
1 page
DAA-Complete Buddha Series Unit-1 To 5
No ratings yet
DAA-Complete Buddha Series Unit-1 To 5
126 pages
CSCI 311 Project 1 Ver 05
No ratings yet
CSCI 311 Project 1 Ver 05
5 pages
Bengaluru City University: As Per SEP 2024)
No ratings yet
Bengaluru City University: As Per SEP 2024)
24 pages
Data Structures Lab - 3
No ratings yet
Data Structures Lab - 3
3 pages
Lab File - DAA
No ratings yet
Lab File - DAA
49 pages
Interview Preparation Kit
No ratings yet
Interview Preparation Kit
132 pages