0% found this document useful (0 votes)

95 views4 pages

Sorting Lower Bounds

This lecture discusses lower bounds for sorting algorithms. Specifically: 1) Any deterministic comparison-based sorting algorithm must perform Ω(n log n) comparisons to sort n elements in the worst case. This is proven by modeling the algorithm as implicitly playing a game of 20 questions to determine the input ordering. 2) The lower bound of Ω(n log n) comparisons also holds for the average-case performance of deterministic comparison-based sorting algorithms. 3) The lower bound extends to randomized comparison-based sorting algorithms as well, by modeling a randomized algorithm as a probability distribution over deterministic algorithms.

Uploaded by

Trisha Nandi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views4 pages

Sorting Lower Bounds

Uploaded by

Trisha Nandi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Lecture 5

Comparison-based Lower Bounds for Sorting

5.1 Overview

In this lecture we discuss the notion of lower bounds, in particular for the problem of sorting. We show that any deterministic comparison-based sorting algorithm must take (n log n) time to sort an array of n elements in the worst case. We then extend this result to average case performance, and to randomized algorithms. In the process, we introduce the 2-player game view of algorithm design and analysis.

5.2

Sorting lower bounds

So far we have been focusing on the question: given some problem X, can we construct an algorithm that runs in time O(f (n)) on inputs of size n? This is often called an upper bound problem because we are determining an upper bound on the inherent diculty of problem X, and our goal here is to make f (n) as small as possible. In this lecture we examine the lower bound problem. Here, the goal is to prove that any algorithm must take time (g(n)) time to solve the problem, where now our goal is to do this for g(n) as large as possible. Lower bounds help us understand how close we are to the best possible solution to some problem: e.g., if we have an algorithm that runs in time O(n log2 n) and a lower bound of (n log n), then we have a log(n) gap: the maximum possible savings we could hope to achieve by improving our algorithm. Often, we will prove lower bounds in restricted models of computation, that specify what types of operations may be performed on the input and at what cost. So, a lower bound in such a model means that if we want to do better, we would need somehow to do something outside the model. Today we consider the class of comparison-based sorting algorithms. These are sorting algorithms that only operate on the input array by comparing pairs of elements and moving elements around based on the results of these comparisons. In particular, let us make the following denition. Denition 5.1 A comparison-based sorting algorithm takes as input an array [a1 , a2 , . . . , an ] of n items, and can only gain information about the items by comparing pairs of them. Each comparison (is ai > aj ?) returns YES or NO and counts a 1 time-step. The algorithm may also for free 26

5.2. SORTING LOWER BOUNDS

reorder items based on the results of comparisons made. In the end, the algorithm must output a permutation of the input in which all items are in sorted order. For instance, Quicksort, Mergesort, and Insertion-sort are all comparison-based sorting algorithms. What we will show is the following theorem. Theorem 5.1 Any deterministic comparison-based sorting algorithm must perform (n log n) comparisons to sort n elements in the worst case. Specically, for any deterministic comparison-based sorting algorithm A, for all n 2 there exists an input I of size n such that A makes at least log2 (n!) = (n log n) comparisons to sort I. To prove this theorem, we cannot assume the sorting algorithm is going to necessarily choose a pivot as in Quicksort, or split the input as in Mergesort we need to somehow analyze any possible (comparison-based) algorithm that might exist. The way we will do this is by showing that in order to sort its input, the sorting algorithm is implicitly playing a game of 20 questions with the input, trying to gure out in what the order its elements are being given. Proof: Since the algorithm must output a permutation of its input, we can assume the input elements are {1, 2, . . . , n} but in some unknown order. The key to the argument is that (a) two dierent input orders cannot both be correctly sorted by the same permutation, and (b) there are n! dierent orders the input elements could be in. Now, suppose that two dierent initial orderings of these numbers I1 , I2 , are consistent with all the comparisons the sorting algorithm has made so far. Then, the sorting algorithm cannot yet be done since any permutation it outputs at this point cannot be correct for both I1 and I2 (by observation (a) above). So, the sorting algorithm needs at least implicitly to have pinned down which ordering of {1, . . . , n} was given in the input. Let S be the set of input orderings consistent with all answers to comparisons made so far (so, initially, S is the set of all n! possible orderings of the input). We can think of a new comparison as splitting S into two groups: those input orderings for which the answer is YES and those for which the answer is NO. Now, suppose the answer to each comparison is always the one corresponding to the larger group. Then, each comparison cuts down the size of S by at most a factor of 2. Since S initially has size n!, and at the end the algorithm must have reduced |S| down to 1, in this case the algorithm will need to make at least log2 (n!) comparisons before it can halt. We can then solve: log2 (n!) = log2 (n) + log2 (n 1) + . . . + log2 (2) = (n log n). Lets do an example with n = 3. In this case, there are six possible input orderings: {123}, {132}, {213}, {231}, {312}, {321}. Suppose the sorting algorithm rst compares A[0] with A[1]. If the answer is that A[1] > A[0] then we have narrowed down the input to the three possibilities: {123}, {132}, {231}. Suppose the next comparison is between A[1] and A[2]. In this case, the most popular answer is that A[1] > A[2], which removes just one ordering, leaving us with: {132}, {231}.

5.3. AVERAGE-CASE LOWER BOUNDS It now takes one more comparison to nally isolate the input ordering.

Notice that our proof is like a game of 20 Questions in which the responder doesnt actually decide what he is thinking of until there is only one option left. This is legitimate because we just need to show that there is some input that would cause the algorithm to take a long time. In other words, since the sorting algorithm is deterministic, we can take that nal remaining option and then rerun the algorithm on that specic input, and the algorithm will make the same exact sequence of operations. You can also think of the above proof in terms of the number of possible outputs of the sorting algorithm. Any comparison-based sorting algorithm can be thought of as producing a permutation as its output (the permutation that, when applied to the input, produces a sorted array). There are n! permutations and only one of them can be correct for any given input. Each comparison breaks the set of possible outputs into two classes, and the response to the question says which class the correct output is in. By always giving the answer corresponding to the larger class, an adversary can force the algorithm to make log2 (n!) comparisons.

5.3

Average-case lower bounds

In fact, we can generalize the above theorem to show that any comparison-based sorting algorithm must take (n log n) time on average, not just in the worst case. Theorem 5.2 For any deterministic comparison-based sorting algorithm, the Average-Case number of comparisons (the number of comparisons on average on a randomly chosen input permutation) is at least log2 (n!) . Proof: Lets build out the entire decision tree: the tree we get by looking at all possible series of answers that one might get from some ordering of the input. By the previous argument, each leaf of this tree corresponds to a single input permutation (we cant have two permutations at the same leaf, else the algorithm would not be nished). The depth of the leaf is the number of comparisons performed by the sorting algorithm on that input. If the tree is completely balanced, then each leaf is at depth log2 (n!) or log2 (n!) and we are done.1 To prove the theorem, we just need to show that out of all binary trees on a given number of leaves, the one that minimizes their average depth is a completely balanced tree. This is not too hard to see: given some unbalanced tree, we take two sibling leaves at largest depth and move them to be children of the leaf of smallest depth. Since the dierence between the largest depth and the smallest depth is at least 2 (otherwise the tree would be balanced), this operation reduces the average depth of the leaves. Specically, if the smaller depth is d and the larger depth is D, we have removed two leaves of depth D and one of depth d, and we have added two leaves of depth d + 1 and one of depth D 1. Since any unbalanced tree can be modied to have a smaller average depth, such a tree cannot be one that minimizes average depth, and therefore the tree of smallest average depth must in fact be balanced. In fact, if we are a bit more clever in the proof, we can get rid of the oor in the bound.
1 Let us dene a tree to be completely balanced if the deepest leaf is at most one level deeper than the shallowest leaf. Everything would be easier if we could somehow assume n! was a power of 2....

5.4. LOWER BOUNDS FOR RANDOMIZED ALGORITHMS

5.4

Lower bounds for randomized algorithms

Theorem 5.3 The above bound holds for randomized algorithms too. Proof: The argument here is a bit subtle. The rst step is to argue that with respect to counting comparisons, we can think of a randomized algorithm A as a probability distribution over deterministic algorithms. To make things easier, let us only consider algorithms that have some nite upper bound B (like n2 ) on the number of random coin-ips they make. This means we can think of A as having access to a special random bit tape with B bits on it, and every time A wants to ip a coin, it just pulls the next bit o that tape. In that case, for any given string s on that tape, the resulting algorithm As is deterministic, and we can think of A as just the uniform distribution over all those deterministic algorithms As . This means that the expected number of comparisons made by randomized algorithm A on some input I is just Pr(s)(Running time of As on I).
s

If you recall the denition of expectation, the running time of the randomized algorithm is a random variable and the sequences s correspond to the elementary events. So, the expected running time of the randomized algorithm is just an average over deterministic algorithms. Since each deterministic algorithm has average-case running time at least log2 (n!) , any average over them must too. Formally, the average-case running time of the randomized algorithm is avg
inputs I s

[Pr(s)(Running time of As on I)] =

avg [Pr(s)(Running time of As on I)]

=
s

Pr(s) avg(Running time of As on I)

Pr(s) log2 (n!) log2 (n!) .

One way to think of the kinds of bounds we have been proving is to think of a matrix with one row for every possible deterministic comparison-based sorting algorithm (there could be a lot of rows!) and one column for every possible permutation of the n inputs (there are a lot of columns too). Entry (i, j) in this matrix contains the running time of algorithm i on input j. The worstcase deterministic lower bound tells us that for each row i there exists a column ji such that the entry (i, ji ) is large. The average-case deterministic lower bound tells us that for each row i, the average of the elements in the row is large. The randomized lower bound says well, since the above statement holds for every row, it must also hold for any weighted average of the rows. In the language of game-theory, one could think of this as a two-player game (much like rock-paperscissors) between an algorithm player who gets to pick a row and an adversarial input player who gets to pick a column. Each player makes their choice and the entry in the matrix is the cost to the algorithm-player which we can think of as how much money the algorithm-player has to pay the input player. We have shown that there is a randomized strategy for the input player (namely, pick a column at random) that guarantees it an expected gain of (n log n) no matter what strategy the algorithm-player chooses.

Lec 7 Writeup
No ratings yet
Lec 7 Writeup
3 pages
The Decision Tree Model: Definition 1
No ratings yet
The Decision Tree Model: Definition 1
4 pages
Lec04a-Full Version
No ratings yet
Lec04a-Full Version
26 pages
Lec 1
No ratings yet
Lec 1
8 pages
Lower Bound On Comparision Based Sorting: Prof. Prateek Vishnoi
No ratings yet
Lower Bound On Comparision Based Sorting: Prof. Prateek Vishnoi
32 pages
Lower Bounds in Computer Science
No ratings yet
Lower Bounds in Computer Science
38 pages
Introduction To Algorithms: Prof. Shafi Goldwasser
No ratings yet
Introduction To Algorithms: Prof. Shafi Goldwasser
57 pages
Lower Bound For Sorting
No ratings yet
Lower Bound For Sorting
11 pages
Sorting in Linear Time
No ratings yet
Sorting in Linear Time
72 pages
Today's Material: - Lower Bounds On Comparison-Based Sorting - Linear-Time Sorting Algorithms
No ratings yet
Today's Material: - Lower Bounds On Comparison-Based Sorting - Linear-Time Sorting Algorithms
16 pages
L14-15 15.11.2018 LinearSorts
No ratings yet
L14-15 15.11.2018 LinearSorts
39 pages
Sorting: - Dan Barrish-Flood
No ratings yet
Sorting: - Dan Barrish-Flood
21 pages
Advanced Integer Sorting Techniques
No ratings yet
Advanced Integer Sorting Techniques
10 pages
L6 - Linear Time Sort
No ratings yet
L6 - Linear Time Sort
24 pages
Lecture
No ratings yet
Lecture
123 pages
Deterministic Selection and Sorting:: Problem P
No ratings yet
Deterministic Selection and Sorting:: Problem P
18 pages
Sorting: - Dan Barrish-Flood
No ratings yet
Sorting: - Dan Barrish-Flood
21 pages
Lecture 12 AG
No ratings yet
Lecture 12 AG
10 pages
MS 101: Algorithms: Instructor Neelima Gupta Ngupta@cs - Du.ac - in
No ratings yet
MS 101: Algorithms: Instructor Neelima Gupta Ngupta@cs - Du.ac - in
28 pages
Limitations of Algorithm Power: Objectives
No ratings yet
Limitations of Algorithm Power: Objectives
14 pages
Sorting 1
No ratings yet
Sorting 1
40 pages
CS 332: Algorithms: Linear-Time Sorting Algorithms
No ratings yet
CS 332: Algorithms: Linear-Time Sorting Algorithms
24 pages
Quick Sort and Counting Sort
No ratings yet
Quick Sort and Counting Sort
34 pages
Lecture Note MA252 Jan 09
No ratings yet
Lecture Note MA252 Jan 09
28 pages
Performance Comparison of Sorting Algorithms On The Basis of Complexity-542 PDF
No ratings yet
Performance Comparison of Sorting Algorithms On The Basis of Complexity-542 PDF
5 pages
Comparison Sort Lower Bound
No ratings yet
Comparison Sort Lower Bound
12 pages
Data Structures: Sorting Lecture
No ratings yet
Data Structures: Sorting Lecture
36 pages
Lecture 2 Sorting Paradigm
No ratings yet
Lecture 2 Sorting Paradigm
35 pages
Sorting Bound
No ratings yet
Sorting Bound
85 pages
Lecture2 Notes
No ratings yet
Lecture2 Notes
11 pages
L5 Dsa
No ratings yet
L5 Dsa
25 pages
Lecture-3 (DivideAndConquer)
No ratings yet
Lecture-3 (DivideAndConquer)
83 pages
Computer Algoritham For Chennai Univarsity Unit5
No ratings yet
Computer Algoritham For Chennai Univarsity Unit5
11 pages
2IL50 Data Structures: 2017-18 Q3 Lecture 4: Sorting in Linear Time
No ratings yet
2IL50 Data Structures: 2017-18 Q3 Lecture 4: Sorting in Linear Time
34 pages
1 Counting Sort
No ratings yet
1 Counting Sort
8 pages
2 1 Ordenation Algorithms
No ratings yet
2 1 Ordenation Algorithms
60 pages
Gate
No ratings yet
Gate
33 pages
Y24 09 Sorting Linear Time
No ratings yet
Y24 09 Sorting Linear Time
12 pages
Divide and Conquer
No ratings yet
Divide and Conquer
83 pages
Is This The Simplest (And Most Surprising) Sorting Algorithm Ever?
No ratings yet
Is This The Simplest (And Most Surprising) Sorting Algorithm Ever?
7 pages
Rakin Sir Merged
No ratings yet
Rakin Sir Merged
271 pages
8.sorting in Linear Time
No ratings yet
8.sorting in Linear Time
28 pages
Sorting in Linear Time
No ratings yet
Sorting in Linear Time
27 pages
Lectures 11-12 - Countingsort
No ratings yet
Lectures 11-12 - Countingsort
22 pages
Lecture 3 - CS50's Computer Science For Lawyers
No ratings yet
Lecture 3 - CS50's Computer Science For Lawyers
12 pages
05 Sort
No ratings yet
05 Sort
12 pages
CS369N: Beyond Worst-Case Analysis Lecture #5: Self-Improving Algorithms
No ratings yet
CS369N: Beyond Worst-Case Analysis Lecture #5: Self-Improving Algorithms
11 pages
CSE 373 Lecture 19: Wrap-Up of Sorting What's On Our Platter Today?
No ratings yet
CSE 373 Lecture 19: Wrap-Up of Sorting What's On Our Platter Today?
10 pages
Unit 8
0% (1)
Unit 8
24 pages
Set7 Sorting in Linear Time
No ratings yet
Set7 Sorting in Linear Time
49 pages
Algorithms and Complexity - Sorting - Part1-1
No ratings yet
Algorithms and Complexity - Sorting - Part1-1
8 pages
An Illustrative Introduction To Algorithms - Dino Cajic
No ratings yet
An Illustrative Introduction To Algorithms - Dino Cajic
388 pages
Lez 07
No ratings yet
Lez 07
9 pages
7 - Sorting (Linear Sort)
No ratings yet
7 - Sorting (Linear Sort)
38 pages
Soluzioni Cormen
No ratings yet
Soluzioni Cormen
20 pages
Analysis of Algorithms CS 477/677: Linear Sorting Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Linear Sorting Instructor: George Bebis
38 pages
Lecture 06
No ratings yet
Lecture 06
62 pages
Top 75 LeetCode Questions for 2024
No ratings yet
Top 75 LeetCode Questions for 2024
5 pages
Digital Signal Processing R13 Previous Papers
100% (1)
Digital Signal Processing R13 Previous Papers
5 pages
SOL - DU - MBAFT-6202 Decision Modeling and Optimization With Distributed Network
No ratings yet
SOL - DU - MBAFT-6202 Decision Modeling and Optimization With Distributed Network
45 pages
EE 675 Lecture 27th March
No ratings yet
EE 675 Lecture 27th March
4 pages
Differential Equation, Calculus of Variation and Special Function
No ratings yet
Differential Equation, Calculus of Variation and Special Function
22 pages
Dataanalytics Unit-4
No ratings yet
Dataanalytics Unit-4
23 pages
AI Major Handbook
No ratings yet
AI Major Handbook
1 page
3B - Logic Gates, Truth Tables & Logic Circuits
No ratings yet
3B - Logic Gates, Truth Tables & Logic Circuits
11 pages
Sieve of Eratosthenes:: Topics That You Should Know With Sieve
No ratings yet
Sieve of Eratosthenes:: Topics That You Should Know With Sieve
3 pages
ICE3212 Digital Image Processing Lab Curriculum
No ratings yet
ICE3212 Digital Image Processing Lab Curriculum
2 pages
Time Value of Money: Discounted Cash Flow Topics Covered
100% (1)
Time Value of Money: Discounted Cash Flow Topics Covered
16 pages
Perceptron Network
No ratings yet
Perceptron Network
26 pages
Functions of Bounded Variation
No ratings yet
Functions of Bounded Variation
5 pages
Cyber Security PDF
No ratings yet
Cyber Security PDF
5 pages
Nonequilibrium Statistical Mechanics
No ratings yet
Nonequilibrium Statistical Mechanics
5 pages
Completely Randomized Stat 101
No ratings yet
Completely Randomized Stat 101
22 pages
Circular 20250514084921 Revised Datesheet For Be (Cse - Cse-Ai) Reappear Examinations, May-June 2025
No ratings yet
Circular 20250514084921 Revised Datesheet For Be (Cse - Cse-Ai) Reappear Examinations, May-June 2025
5 pages
AI Project
No ratings yet
AI Project
11 pages
CS F211 - Data Structures & Algorithms
No ratings yet
CS F211 - Data Structures & Algorithms
7 pages
Maths c3 Coursework Comparison
100% (2)
Maths c3 Coursework Comparison
6 pages
Proposal Final Draft
No ratings yet
Proposal Final Draft
5 pages
A Project Report On Quantitative Techniques: Submitted To
No ratings yet
A Project Report On Quantitative Techniques: Submitted To
5 pages
Honerkamp Et Al (Eds) - Strucural Elements in Particle Physics and Statistical Mechanics
No ratings yet
Honerkamp Et Al (Eds) - Strucural Elements in Particle Physics and Statistical Mechanics
377 pages
Cs 607 Quiz 1 Solved
No ratings yet
Cs 607 Quiz 1 Solved
3 pages
The Design and Analysis of Computer Algorithms (Aho, Hopcroft & Ullman 1974-01-11) PDF
67% (6)
The Design and Analysis of Computer Algorithms (Aho, Hopcroft & Ullman 1974-01-11) PDF
479 pages
CS1325: Pointers & Sorting Task
No ratings yet
CS1325: Pointers & Sorting Task
5 pages
I. Time Domain Characterization: Up-Sampling Operation
No ratings yet
I. Time Domain Characterization: Up-Sampling Operation
18 pages
Stat Prob Q403.2 Constructing A Frequency Distribution Table
No ratings yet
Stat Prob Q403.2 Constructing A Frequency Distribution Table
30 pages
Multinomial Problem Statement
No ratings yet
Multinomial Problem Statement
28 pages
MBA (BA) I Introduction To Business Analytics With Data Science 2021
No ratings yet
MBA (BA) I Introduction To Business Analytics With Data Science 2021
1 page

Sorting Lower Bounds

Uploaded by

Sorting Lower Bounds

Uploaded by

Lecture 5

Comparison-based Lower Bounds for Sorting

Sorting lower bounds

5.2. SORTING LOWER BOUNDS

Average-case lower bounds

5.4. LOWER BOUNDS FOR RANDOMIZED ALGORITHMS

Lower bounds for randomized algorithms

[Pr(s)(Running time of As on I)] =

avg [Pr(s)(Running time of As on I)]

Pr(s) avg(Running time of As on I)

Pr(s) log2 (n!) log2 (n!) .

You might also like