KEMBAR78
Sorting Analysis | PDF | Time Complexity | Array Data Structure
0% found this document useful (0 votes)
180 views24 pages

Sorting Analysis

Uploaded by

api-254089610
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
180 views24 pages

Sorting Analysis

Uploaded by

api-254089610
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

CMPSC 122 Lab Report Jake Eden and Max Hamill Page 1 of 24

CMPSC 122 Final Project



1. Introduction

Managing data is a large component of Computer Science. This can take on many forms, but this time we focused on
sorting algorithms. More specifically, we wanted to understand some of the more popular sorting techniques used on integer
arrays. The six sorting algorithms were selection sort, bubble sort, insertion sort, quick sort, merge sort, and tree sort. We
essentially wanted to answer the question: how do the different algorithms perform under different conditions?
These conditions were the array size and the pre-existing state of the elements contained within the array. We measured
the running times of the algorithms in clock ticks (naturally, all tests were done on the same machine), and in order for the
clock tick values to return something greater than 0, we needed to have large array sizes. The sizes ranged from 30,000
elements to 75,000 elements in increments of 5,000 (so a total of 10 sizes). The pre-existing state of the elements within the
arrays refers to whether or not the values were in random order, already sorted in ascending order, or sorted in descending
order. We decided to test the sorting algorithms on more than just random values because we wanted to test the algorithms in
ways that may be inefficient. That then allows us to witness the sorting algorithms best case, worse case, and average case
running times. We can also take those times for each array size, and extrapolate what the asymptotic running times are for
each method under each different condition. A deep analysis of our thoughts on each sorting algorithm can be found in
Section 6.

2. Code and Output for Sorts
a. Changes to BST Class Source Code

In order to create the treeSort method, we relied upon previous code we had that implemented a binary search tree
class. We already had a tree sort method, but it only printed out the results of an in-order traversal to the console without
actually storing the sorted values. Therefore, we needed to modify the method to take in an array as an output parameter.
Additionally, the original treeSort relied upon the inorderTraversal method in the BinaryTree class so it was
inorderTraversal and its helper method helperInorderTraversal that needed the most modification. The modified
treeSort, inorderTraversal, and helperInorder, are shown below:


void treeSort(int data[], int logSize)
// PRE: logSize > 0 and data[0..logSize-1] is initialized.
// POST: data[0..logSize-1] is sorted in ascending order and contains all values from
// data[0..logSize-1] at the time of invocation.
{
BinaryTree tree; // BST of data's values

tree = BinaryTree(data, logSize); // Fill tree
tree.inorderTraversal(data); // Sort tree
}

void BinaryTree::inorderTraversal(int data[])
// PRE: data is allocated to a size of at least the number of nodes in this BST.
// POST: data[0..(logical size of this BST)-1] is sorted in ascending order.
{
int counter; // Insertion index for data

counter = 0;
helperInorder(rootPtr, data, counter); // Call helper method
}














CMPSC 122 Lab Report Jake Eden and Max Hamill Page 2 of 24

void BinaryTree::helperInorder(TreeNodePtr & subRoot, int data[], int & counter)
// PRE: subRoot points to a (sub)tree or is NULL, data is allocated to a size of at least the
// number of nodes in this BST, and counter is the index of the next unsorted slot in
// in data.
// POST: data[0..counter-1] is sorted in ascending order and contains values from the BST
// pointed to by subRoot.
{
// Implicit Base Case: empty (sub)tree
if(subRoot != NULL) // This (sub)tree exists
{
if(subRoot->left != NULL) // Recursive Case if left exists
{
helperInorder(subRoot->left, data, counter);
}

data[counter] = subRoot->key; // Insert subRoot's key
counter++;

if(subRoot->right != NULL) // Recursive Case if right exists
{
helperInorder(subRoot->right, data, counter);
}
}
}

b. Interface File Source Code

In order to best organize our sorting methods, they are all within one module, Sorts.h. In addition to the six sorting
algorithm methods, there are two helper methods: partition and merge. The former is a helper method for quickSort and
the latter is a helper method for mergeSort.

// Programmers: Jake Eden and Max Hamill
// Section: 2
// Lab: Final Project
// Date: April 27, 2014
// Description: This interface contains prototypes for the following sorting techniques:
// selection sort, bubble sort, insertion sort, quick sort, merge sort,
// and tree sort.


void selectionSort(int data[], int bottom);
// PRE: bottom is the last index of data, data[0..bottom] is initialized.
// POST: data[0..bottom] is sorted in ascending order and contains all values from data[0..bottom]
// at the time of invocation.

void bubbleSort(int data[], int bottom);
// PRE: bottom is the last index of data, data[0..bottom] is initialized.
// POST: data[0..bottom] is sorted in ascending order and contains all values from data[0..bottom]
// at the time of invocation.

void insertionSort(int data[], int bottom);
// PRE: bottom is the last index of data, data[0..bottom] is initialized.
// POST: data[0..bottom] is sorted in ascending order and contains all values from data[0..bottom]
// at the time of invocation.

void quickSort(int data[], int low, int high);
// PRE: 0 <= low <= high and data[low..high] is initialized.
// POST: data[low..high] is sorted in ascending order and contains all values from data[low..high]
// at the time of invocation.

int partition(int data[], int low, int high);
// PRE: high > low and data[low..high] is initialized.
// POST: FCTVAL == index of pivot
// data[low..FCTVAL] contains all values from data[low..high] at the time of invocation
// less than or equal to data[high]
// data[FCTVAL..high] contains all values from data[low..high] at the time of invocation
// greater than data[high]




CMPSC 122 Lab Report Jake Eden and Max Hamill Page 3 of 24

void mergeSort(int data[], int left, int right);
// PRE: left <= right, data[left..right] is initialized.
// POST: data[left..right] is sorted in ascending order and contains all values from
// data[left..right] at the time of invocation.

void merge(int data[], int left, int mid, int right);
// PRE: left <= mid <= right, both data[left..mid] and data[mid+1..right] are sorted in
// ascending order.
// POST: data[left..right] contains all elements of data[left..mid] and
// data[mid+1..right], sorted in ascending order and contains all values from
// data[left..right] at the time of invocation.

void treeSort(int data[], int logSize);
// PRE: logSize > 0 and data[0..logSize-1] is initialized.
// POST: data[0..logSize-1] is sorted in ascending order and contains all values from
// data[0..logSize-1] at the time of invocation.

c. Implementation File Source Code

Sorts.cpp is the implementation file of the interface Sorts.h, and it contains the code for our six sorting methods and two
associated helper methods.

// Programmers: Jake Eden and Max Hamill
// Section: 2
// Lab: Final Project
// Date: April 27, 2014
// Description: This program implements the Sorts.h interface and contains the following sorting
// techniques: selection sort, bubble sort, insertion sort, quick sort, merge sort,
// and tree sort.

#include "Sorts.h"
#include "BinaryTree.h"
#include <iostream>
#include <iomanip>
using namespace std;

void selectionSort(int data[], int bottom)
// PRE: bottom is the last index of data, data[0..bottom] is initialized.
// POST: data[0..bottom] is sorted in ascending order and contains all values from data[0..bottom]
// at the time of invocation.
{
int maxIndex; // Index of highest value in unsorted array
int temp; // Temp value used for swapping

for (int n = bottom; n >= 1; n--) // Traverse down size of array until sorted
{
maxIndex = 0; // Assume the first value is greatest initially

for (int i = 1; i <= n; i++) // Traverse unsorted array for highest value
{
if (data[i] > data[maxIndex]) // If the value is greater than the current max
maxIndex = i;
}

temp = data[n];
data[n] = data[maxIndex];
data[maxIndex] = temp;
}
}













CMPSC 122 Lab Report Jake Eden and Max Hamill Page 4 of 24

void bubbleSort(int data[], int bottom)
// PRE: bottom is the last index of data, data[0..bottom] is initialized.
// POST: data[0..bottom] is sorted in ascending order and contains all values from data[0..bottom]
// at the time of invocation.
{
int temp; // Temporary value for when swapping

for(int n = bottom; n > 0; n--) // Traverse data until sorted part
{
for(int i = 0; i < n; i++) // Swap higher values towards bottom
{
if(data[i] > data[i+1]) // If two adjacent values are not sorted
{
temp = data[i]; // Swap the two values
data[i] = data[i+1];
data[i+1] = temp;
}
}
}
}

void insertionSort(int data[], int bottom)
// PRE: bottom is the last index of data, data[0..bottom] is initialized.
// POST: data[0..bottom] is sorted in ascending order and contains all values from data[0..bottom]
// at the time of invocation.
{
int key; // Value to be sorted
int counter; // Counter for inner loop

for(int j = 1; j <= bottom; j++) // Traverse the entire array
{
key = data[j]; // Assign key a value to be sorted
counter = j - 1;

while(counter >= 0 && data[counter] > key) // Traverse sorted part until spot for key
{
data[counter+1] = data[counter]; // Shift values to the right
counter--;
}

data[counter+1] = key; // Insert key into sorted position
}
}

void quickSort(int data[], int low, int high)
// PRE: 0 <= low <= high and data[low..high] is initialized.
// POST: data[low..high] is sorted in ascending order and contains all values from data[low..high]
// at the time of invocation.
{
int pivotPoint; // Index of pivot

// Implict Base Case: low >= high
if(high > low) // Recursive Case: data contains
{ // at least 2 elements
pivotPoint = partition(data, low, high); // Partition data
quickSort(data, low, pivotPoint-1); // Recursive call for left partition
quickSort(data, pivotPoint+1, high); // Recursive call for right partition
}
}















CMPSC 122 Lab Report Jake Eden and Max Hamill Page 5 of 24

int partition(int data[], int low, int high)
// PRE: high > low and data[low..high] is initialized.
// POST: FCTVAL == index of pivot
// data[low..FCTVAL] contains all values from data[low..high] at the time of invocation
// less than or equal to data[high]
// data[FCTVAL..high] contains all values from data[low..high] at the time of invocation
// greater than data[high]
{
int pivot; // Value of pivot
int pivIndex; // Index of pivot point
int temp; // Temp value for swapping

pivot = data[high]; // Last value == pivot
pivIndex = low; // Pivot index will begin at low

for(int i = low; i <= high; i++) // Traverse all elements in data
{
if(data[i] <= pivot) // If element belongs left of pivot
{
temp = data[i]; // Swap element into left partition
data[i] = data[pivIndex];
data[pivIndex] = temp;

pivIndex++;
}
}

return pivIndex-1; // -1 to offset final loop iteration
}

void mergeSort(int data[], int left, int right)
// PRE: left <= right, data[left..right] is initialized.
// POST: data[left..right] is sorted in ascending order and contains all values from
// data[left..right] at the time of invocation.
{
int mid; // Midpoint between two sub-arrays

if(left < right) // Is sub-array > one value?
{
mid = (left + right)/2;

mergeSort(data, left, mid); // Recursively sort first half
mergeSort(data, mid+1, right); // Recursively sort second half

merge(data, left, mid, right); // Merge both halves
}
}



























CMPSC 122 Lab Report Jake Eden and Max Hamill Page 6 of 24

void merge(int data[], int left, int mid, int right)
// PRE: left <= mid <= right, both data[left..mid] and data[mid+1..right] are sorted in
// ascending order.
// POST: data[left..right] contains all elements of data[left..mid] and
// data[mid+1..right], sorted in ascending order and contains all values from
// data[left..right] at the time of invocation.
{
int leftIndex; // Start of left (sub)array
int rightIndex; // End of right (sub)array
int tempIndex; // Counter index for temp
int * temp; // Temp array to hold sorted values

leftIndex = left;
rightIndex = mid + 1;
tempIndex = left;
temp = new int[right+1]; // Allocate just enough memory

while(leftIndex <= mid && rightIndex <= right) // While niether half is fully sorted
{
if(data[leftIndex] <= data[rightIndex]) // If left half's value comes first
{
temp[tempIndex] = data[leftIndex]; // Insert left half's value into temp
leftIndex++;
}
else // If right half's value comes first
{
temp[tempIndex] = data[rightIndex]; // Insert right half's value into temp
rightIndex++;
}
tempIndex++;
}

if(leftIndex > mid) // If left half is fully sorted
{
for(int i = rightIndex; i <= right; i++) // Traverse and insert right half
{
temp[tempIndex] = data[i];
tempIndex++;
}
}
else // If right half is fully sorted
{
for(int i = leftIndex; i <= mid; i++) // Traverse and insert left half
{
temp[tempIndex] = data[i];
tempIndex++;
}
}

for(int i = left; i <= right; i++) // Traverse temp
{
data[i] = temp[i]; // Copy temp's value to data
}

delete temp;
}

void treeSort(int data[], int logSize)
// PRE: logSize > 0 and data[0..logSize-1] is initialized.
// POST: data[0..logSize-1] is sorted in ascending order and contains all values from
// data[0..logSize-1] at the time of invocation.
{
BinaryTree tree; // BST of data's values

tree = BinaryTree(data, logSize); // Fill tree
tree.inorderTraversal(data); // Sort tree
}




CMPSC 122 Lab Report Jake Eden and Max Hamill Page 7 of 24

d. Test Driver Source Code

Just to make sure that our sorts are functioning the way we intended, we created a test driver program for all six sorts.
Within this program, we simply created six test arrays (each with the same values for ease of testing), which were each sorted
in ascending order by one of the sorting algorithms. In addition to a main, a print array method (aptly named printArray)
was also implemented so that we could print to the console the results of our sorts. The test driver code is below.

// Programmers: Jake Eden and Max Hamill
// Section: 2
// Lab: Final Project
// Date: April 27, 2014
// Description: This program test drives the sorting methods implemented in Sorts.cpp.

#include "Sorts.h"
#include <iostream>
#include <iomanip>
using namespace std;

void printArray(int data[], int n)
// PRE: n > 0 and a[0..n-1] is initialized.
// POST: Values in a[0..n-1] are printed to the console, delimited by ',' and ending in '.'
{
for(int i = 0; i < n-1; i++) // Traverse data and print out values
{
cout << data[i] << ", ";
}

cout << data[n-1] << "." << endl; // Print out the last value with a '.'
}

int main()
{
int test1[] = {7, 12, 18, 8, 12, 14, 86, 81, 47, 2, 34, 92, 18}; // Test array 1
int test2[] = {7, 12, 18, 8, 12, 14, 86, 81, 47, 2, 34, 92, 18}; // Test array 2
int test3[] = {7, 12, 18, 8, 12, 14, 86, 81, 47, 2, 34, 92, 18}; // Test array 3
int test4[] = {7, 12, 18, 8, 12, 14, 86, 81, 47, 2, 34, 92, 18}; // Test array 4
int test5[] = {7, 12, 18, 8, 12, 14, 86, 81, 47, 2, 34, 92, 18}; // Test array 5
int test6[] = {7, 12, 18, 8, 12, 14, 86, 81, 47, 2, 34, 92, 18}; // Test array 6

cout << "Original Array" << endl;
cout << setw(8);
printArray(test1, 13);

cout << "\nAfter Selection Sort" << endl;
selectionSort(test1,12);
cout << setw(8);
printArray(test1, 13);

cout << "\nAfter Bubble Sort" << endl;
bubbleSort(test2, 12);
cout << setw(8);
printArray(test2, 13);

cout << "\nAfter Insertion Sort" << endl;
insertionSort(test3, 12);
cout << setw(8);
printArray(test3, 13);

cout << "\nAfter Quick Sort" << endl;
quickSort(test4, 0, 12);
cout << setw(8);
printArray(test4, 13);

cout << "\nAfter Merge Sort" << endl;
mergeSort(test5, 0, 12);
cout << setw(8);
printArray(test5, 13);





CMPSC 122 Lab Report Jake Eden and Max Hamill Page 8 of 24

cout << "\nAfter Tree Sort" << endl;
treeSort(test6, 13);
cout << setw(8);
printArray(test6, 13);

return 0;
}

e. Output

This is what the console displayed after we ran the test driver. As you can see, all sorting methods appear functional.
Keep in mind, we have already rigorously tested these algorithms in prior experiments so this output below was more of just
confirmation that the code transferred over correctly.

Original Array
7, 12, 18, 8, 12, 14, 86, 81, 47, 2, 34, 92, 18.

After Selection Sort
2, 7, 8, 12, 12, 14, 18, 18, 34, 47, 81, 86, 92.

After Bubble Sort
2, 7, 8, 12, 12, 14, 18, 18, 34, 47, 81, 86, 92.

After Insertion Sort
2, 7, 8, 12, 12, 14, 18, 18, 34, 47, 81, 86, 92.

After Quick Sort
2, 7, 8, 12, 12, 14, 18, 18, 34, 47, 81, 86, 92.

After Merge Sort
2, 7, 8, 12, 12, 14, 18, 18, 34, 47, 81, 86, 92.

After Tree Sort
2, 7, 8, 12, 12, 14, 18, 18, 34, 47, 81, 86, 92.


3. Code and Output for Timing
a. Source Code for Timing

In order to test the running times of the different sorting methods, we created another Visual Studio project so that we
could have another main function to test our code. In the source code below, we needed to populate an array with random
values and have six copies of it, one for each sorting method. In addition, we needed to implement a timer that counts the
number of clock ticks each sorting method takes. Both the code for generating random values and for timing the sorting tests
were adopted from Doug Hogans online notes. Lastly, we used constants for the size of the arrays and largest integer size of
the arrays so that we could easily manipulate and recompile for different values. This allowed us to test the effectiveness of
the different sorting methods for 10 different array sizes. The code can be seen below.

// Programmers: Jake Eden and Max Hamill
// Section: 2
// Lab: Final Project
// Date: April 27, 2014
// Description: This program times the different sorting methods on random arrays. The code for
// a random number and the timer were taken from Doug's online examples.

#include "Sorts.h"
#include <cstdlib>
#include <ctime>
#include <iostream>
#include <iomanip>
using namespace std;

const int PHYSICAL_SIZE = 30000; // Size of array to generate
const int MAX_INT = 101; // Largest integer to allow in array



CMPSC 122 Lab Report Jake Eden and Max Hamill Page 9 of 24

double nextRandNum()
// PRE: Random number generator has been initialized with a call to srand().
// POST: FCTVAl == next random number x s.t. x >= 0 and x < 1.
{
return double(rand()%32768)/32768; // Generate "random" number
}

void populateArray(int data[])
// PRE: data is allocated to at least a size of PHYSICAL_SIZE.
// POST: data[0..PHYSICAL_SIZE-1] is filled with random values of size 0 to MAX_INT-1
{
srand( (unsigned) time(0) ); // Set seed for random number generation

for(int i = 0; i < PHYSICAL_SIZE; i++) // Fill data with random numbers
{
data[i] = nextRandNum()*MAX_INT; // Numbers of size 0 to MAX_INT-1
}
}

void copyArray(int original[], int copy[])
// PRE: original[0..PHYSICAL_SIZE-1] is initialized and copy is allocated to at least a
// size of PHYSICAL_SIZE.
// POST: copy[0..PHYSICAL_SIZE-1] mirrors original[0..PHYSICAL_SIZE-1].
{
for(int i = 0; i < PHYSICAL_SIZE; i++) // Copy values from original
{
copy[i] = original[i];
}
}

int main()
{
int test1[PHYSICAL_SIZE]; // Test array 1
int test2[PHYSICAL_SIZE]; // Test array 2
int test3[PHYSICAL_SIZE]; // Test array 3
int test4[PHYSICAL_SIZE]; // Test array 4
int test5[PHYSICAL_SIZE]; // Test array 5
int test6[PHYSICAL_SIZE]; // Test array 6
clock_t startTime; // Start time for sorting in clock ticks
clock_t finishTime; // Finish time for sorting in clock ticks
double elapsedTime; // Number of clock ticks for sorting

populateArray(test1); // Fill all arrays with random numbers
copyArray(test1, test2);
copyArray(test1, test3);
copyArray(test1, test4);
copyArray(test1, test5);
copyArray(test1, test6);

// Time Selection Sort
startTime = clock();
selectionSort(test1, PHYSICAL_SIZE-1);
finishTime = clock();
elapsedTime = double(finishTime-startTime);
cout << "Time to selection sort: ";
cout << setw(3) << elapsedTime << " clock ticks. \n";

// Time Bubble Sort
startTime = clock();
bubbleSort(test2, PHYSICAL_SIZE-1);
finishTime = clock();
elapsedTime = double(finishTime-startTime);
cout << "Time to bubble sort: ";
cout << setw(7) << elapsedTime << " clock ticks. \n";

// Time Insertion Sort
startTime = clock();
insertionSort(test3, PHYSICAL_SIZE-1);
finishTime = clock();
elapsedTime = double(finishTime-startTime);
cout << "Time to insertion sort: ";
cout << setw(3) << elapsedTime << " clock ticks. \n";


CMPSC 122 Lab Report Jake Eden and Max Hamill Page 10 of 24

// Time Quick Sort
startTime = clock();
quickSort(test4, 0, PHYSICAL_SIZE-1);
finishTime = clock();
elapsedTime = double(finishTime-startTime);
cout << "Time to quick sort: ";
cout << setw(6) << elapsedTime << " clock ticks. \n";

// Time Merge Sort
startTime = clock();
mergeSort(test5, 0, PHYSICAL_SIZE-1);
finishTime = clock();
elapsedTime = double(finishTime-startTime);
cout << "Time to merge sort: ";
cout << setw(7) << elapsedTime << " clock ticks. \n";

// Time Tree Sort
startTime = clock();
treeSort(test6, PHYSICAL_SIZE);
finishTime = clock();
elapsedTime = double(finishTime-startTime);
cout << "Time to tree sort: ";
cout << setw(8) << elapsedTime << " clock ticks. \n";

return 0;
}

b. Example Output for an Execution of the Timing Program

Below is an example of the output that displays how long each sort took (in clock ticks) for an array of size 30000 when
the elements were random.

Time to selection sort: 1112 clock ticks.
Time to bubble sort: 2450 clock ticks.
Time to insertion sort: 569 clock ticks.
Time to quick sort: 24 clock ticks.
Time to merge sort: 130 clock ticks.
Time to tree sort: 202 clock ticks.


4. Raw Data
a. Raw Data: Unsorted Input

We measured the number of clock ticks each sort method took for 10 different array sizes, each with three trials so as to
minimize any abnormal behavior. The array sizes ranged from 30,000 to 75,000 in 5000 increments, and the array values
were randomly generated with values from 0-100. The results have been compiled in tables below.

Table 1. Time (clock ticks) of each sorting algorithm for arrays of size 30,000-40,000 for random input.
Sorting
Algorithm
Array Size: 30000 Array Size: 35000 Array Size: 40000
Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3
Selection 1110 1113 1112 1512 1515 1511 1983 1965 1972
Bubble 2437 2439 2450 3359 3349 3380 4411 4404 4432
Insertion 567 568 569 774 779 777 1018 1013 1034
Quick 23 23 24 30 31 30 39 39 40
Merge 130 130 130 174 175 174 224 226 231
Tree 199 198 202 272 272 271 354 363 378





CMPSC 122 Lab Report Jake Eden and Max Hamill Page 11 of 24

Table 2. Time (clock ticks) of each sorting algorithm for arrays of size 45,000-55,000 for random input.
Sorting
Algorithm
Size: 45000 Size: 50000 Size: 55000
Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3
Selection 2501 2486 2493 3075 3079 3076 3742 3735 3741
Bubble 5621 5590 5582 6932 6955 6921 8362 8351 8316
Insertion 1277 1279 1276 1578 1573 1575 1902 1905 1949
Quick 48 48 49 58 58 58 70 71 72
Merge 283 282 284 348 344 345 416 415 434
Tree 449 445 451 555 555 559 665 689 664

Table 3. Time (clock ticks) of each sorting algorithm for arrays of size 60,000-70,000 for random input.
Sorting
Algorithm
Size: 60000 Size: 65000 Size: 70000
Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3
Selection 4427 4459 4423 5520 5219 5267 6016 6055 6027
Bubble 9984 9981 10004 11757 11783 11783 13602 13524 13603
Insertion 2276 2263 2324 2661 2649 2674 3086 3161 3089
Quick 83 84 83 96 96 96 112 110 112
Merge 492 492 493 576 574 578 670 683 670
Tree 801 803 811 949 943 955 1137 1257 1115

Table 4. Time (clock ticks) of each sorting algorithm for an array of size 75,000 for random input.
Sorting
Algorithm
Size: 75000
Trial 1 Trial 2 Trial 3
Selection 7042 6926 7065
Bubble 15635 15636 15624
Insertion 3550 3518 3522
Quick 127 125 126
Merge 774 776 755
Tree 1285 1277 1273


b. Raw Data: Ascending Sorted Input

We also measured the number of clock ticks each sort method took for when the array values were already sorted in
ascending order in order to see if this would affect sorting times. The results have been compiled in tables below.

Table 5. Time (clock ticks) of each sorting algorithm for arrays of size 30,000-40,000 for ascending sorted input.
Sorting
Algorithm
Size: 30000 Size: 35000 Size: 40000
Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3
Selection 1100 1098 1101 1506 1502 1502 1965 1964 1975
Bubble 1281 1287 1292 1750 1749 1754 2279 2291 2280
Insertion 0 0 0 0 0 0 1 0 0
Quick 1860 1879 1869 2564 2543 2544 3318 3338 3331
Merge 130 128 130 175 173 175 223 223 222
Tree 21183 19893 21010 30286 29277 30069 41443 40448 40272

CMPSC 122 Lab Report Jake Eden and Max Hamill Page 12 of 24

Table 6. Time (clock ticks) of each sorting algorithm for arrays of size 45,000-55,000 for ascending sorted input.
Sorting
Algorithm
Size: 45000 Size: 50000 Size: 55000
Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3
Selection 2487 2491 2539 3072 3069 3069 3737 3714 3716
Bubble 2891 2897 2942 3561 3570 3571 4313 4303 4309
Insertion 0 0 0 0 0 0 0 0 0
Quick 4275 4201 4227 5174 5171 5180 6278 6254 6273
Merge 285 281 286 347 350 347 413 416 418
Tree 54047 54679 53719 70117 69706 69942 87143 86974 87219

Table 7. Time (clock ticks) of each sorting algorithm for arrays of size 60,000-70,000 for ascending sorted input.
Sorting
Algorithm
Size: 60000 Size: 65000 Size: 70000
Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3
Selection 4430 4419 4427 5195 5210 5259 6020 6037 6088
Bubble 5148 5147 5131 6001 6015 6106 6963 6976 7040
Insertion 0 0 0 0 0 0 0 1 0
Quick 7590 7459 7460 8741 8741 8748 10278 10163 10135
Merge 495 493 494 571 574 574 664 667 668
Tree 106135 106152 106544 127979 127295 127374 150593 150578 150787

Table 8. Time (clock ticks) of each sorting algorithm for an array of size 75,000 for ascending sorted input.
Sorting
Algorithm
Size: 75000
Trial 1 Trial 2 Trial 3
Selection 6883 6913 6907
Bubble 7975 7987 7981
Insertion 0 0 1
Quick 11627 11617 11635
Merge 770 772 770
Tree 175684 176230 176573


c. Raw Data: Descending Sorted Input

Lastly, we measured the number of clock ticks each sort method took for when the array values were sorted in
descending order to see if this affected sorting time. The results have been compiled in tables below.

Table 9. Time (clock ticks) of each sorting algorithm for arrays of size 30,000-40,000 for descending sorted input.
Sorting
Algorithm
Size: 30000 Size: 35000 Size: 40000
Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3
Selection 1104 1104 1107 1503 1514 1503 1965 1981 1976
Bubble 1825 1849 1840 2517 2510 2508 3243 3261 3266
Insertion 1128 1133 1128 1549 1556 1540 2031 2016 2015
Quick 592 530 465 653 409 340 728 666 986
Merge 128 128 128 173 171 172 225 222 225
Tree 221 224 223 293 291 291 366 370 367

CMPSC 122 Lab Report Jake Eden and Max Hamill Page 13 of 24


Table 10. Time (clock ticks) of each sorting algorithm for arrays of size 45,000-55,000 for descending sorted input.
Sorting
Algorithm
Size: 45000 Size: 50000 Size: 55000
Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3
Selection 2496 2508 2490 3092 3142 3068 3721 3735 3721
Bubble 4139 4130 4112 5090 5176 5088 6181 6161 6166
Insertion 2559 2550 2536 3138 3160 3130 3807 3817 3819
Quick 912 753 1123 1164 1128 1671 1257 2126 594
Merge 280 283 280 343 341 343 413 413 413
Tree 457 456 456 552 557 549 657 654 657

Table 11. Time (clock ticks) of each sorting algorithm for arrays of size 60,000-70,000 for descending sorted input.
Sorting
Algorithm
Size: 60000 Size: 65000 Size: 70000
Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3 Trial 1 Trial 2 Trial 3
Selection 4416 4425 4421 5191 5211 5185 6089 5995 6022
Bubble 7351 7355 7327 8606 8727 8606 9971 9982 9977
Insertion 4560 4515 4536 5308 5290 5350 6183 6185 6175
Quick 2822 919 1427 629 1520 1236 2124 2358 3561
Merge 495 488 491 573 574 578 666 672 665
Tree 774 766 774 891 893 891 1026 1028 1027

Table 12. Time (clock ticks) of each sorting algorithm for an array of size 75,000 for descending sorted input.
Sorting
Algorithm
Size: 75000
Trial 1 Trial 2 Trial 3
Selection 6919 6885 6908
Bubble 11415 11494 11471
Insertion 7156 7095 7241
Quick 3660 2337 3893
Merge 779 770 769
Tree 1169 1168 1167













CMPSC 122 Lab Report Jake Eden and Max Hamill Page 14 of 24
5. Results
a. Unsorted Input

In order to make sense of the large amount of raw data, first we looked at the unsorted input and took the average values
for each trial as seen in the two tables below. Then we decided to graph the average clock tick values for each sort against the
size of the array in order to better understand the trend of each sort as array size increases.

Table 13. Average time (clock ticks) of each sorting algorithm for arrays of size 30,000-50,000 for random input.
Sorting
Algorithm
Size
30000 35000 40000 45000 50000
Selection 1112 1513 1973 2493 3077
Bubble 2442 3363 4416 5598 6936
Insertion 568 777 1022 1277 1575
Quick 23 30 39 48 58
Merge 130 174 227 283 346
Tree 200 272 365 448 556

Table 14. Average time (clock ticks) of each sorting algorithm for arrays of size 55,000-75,000 for random input.
Sorting
Algorithm
Size
55000 60000 65000 70000 75000
Selection 3739 4436 5335 6033 7011
Bubble 8343 9990 11774 13576 15632
Insertion 1919 2288 2661 3112 3530
Quick 71 83 96 111 126
Merge 422 492 576 674 768
Tree 673 805 949 1170 1278

Graph 1.


CMPSC 122 Lab Report Jake Eden and Max Hamill Page 15 of 24

In order to better differentiate between quick, merge, and tree sort, we created another graph with just those three sorts.

Graph 2.


The graphs can give us insight into the performance of each sorting algorithm, but wed also like to compare our values
to the theoretical running time values. In order to do this we essentially just took a ratio of the running time (in clock ticks)
over the theoretical asymptotic running time in which the size of the array was the n value. The size of the array was
divided by 1000 simply to eliminate unnecessary 0s, which dont significantly affect the ratio. For this case, we took the
theoretical asymptotic running times of the average-case (which is often just the same as the worst-case) from our lecture
notes and from Chapter 4 of Algorithms Unlocked by Thomas Cormen. We used the average-case running times in the ratio
because for unsorted input, it seems appropriate in the sense that average sorts would have random input. Notice that the
correctness of the theoretical running time can be seen if the ratio is consistent among each size. The value of the ratio itself
is less important and can (depending on the case) signify the value of a leading coefficient or other lesser terms.

Table 15. Experimental running time (clock ticks) vs. theoretical running time via a ratio of the two for random input of sizes
30,000-50,000.
Sorting
Algorithm
(Running Time)/(Theoretical Asymptotic Running Time)
Size 30 Size 35 Size 40 Size 45 Size 50
Selection
(n
2
)
1.24 1.24 1.23 1.23 1.23
Bubble
(n
2
)
2.71 2.75 2.76 2.76 2.77
Insertion
(n
2
)
0.631 0.634 0.639 0.631 0.63
Quick
(nlgn)
0.156 0.167 0.183 0.194 0.206
Merge
(nlgn)
0.883 0.969 1.07 1.15 1.23
Tree
(nlgn)
1.36 1.52 1.71 1.81 1.97


CMPSC 122 Lab Report Jake Eden and Max Hamill Page 16 of 24

Table 16. Experimental running time (clock ticks) vs. theoretical running time via a ratio of the two for random input of sizes
55,000-75,000.
Sorting
Algorithm
(Running Time)/(Theoretical Asymptotic Running Time)
Size 55 Size 60 Size 65 Size 70 Size 75
Selection
(n
2
)
1.24 1.23 1.26 1.23 1.25
Bubble
(n
2
)
2.76 2.78 2.79 2.77 2.78
Insertion
(n
2
)
0.634 0.636 0.630 0.635 0.628
Quick
(nlgn)
0.223 0.234 0.245 0.259 0.270
Merge
(nlgn)
1.33 1.39 1.47 1.57 1.64
Tree
(nlgn)
2.12 2.27 2.42 2.73 2.74


b. Ascending Sorted Input
We then turned to the raw data for ascending sorted input and took the average values for each trial as displayed below.

Table 17. Average time (clock ticks) of each sorting algorithm for arrays of size 30,000-50,000 for ascending sorted input.
Sorting
Algorithm
Size
30000 35000 40000 45000 50000
Selection 1100 1503 1968 2506 3070
Bubble 1287 1751 2283 2910 3567
Insertion 0 0 0 0 0
Quick 1869 2550 3329 4234 5175
Merge 129 174 223 284 348
Tree 20695 29877 40721 54148 69922

Table 18. Average time (clock ticks) of each sorting algorithm for arrays of size 55,000-75,000 for ascending sorted input.
Sorting
Algorithm
Size
55000 60000 65000 70000 75000
Selection 3722 4425 5221 6048 6901
Bubble 4308 5142 6041 6993 7981
Insertion 0 0 0 0 0
Quick 6268 7503 8743 10192 11626
Merge 416 494 573 666 771
Tree 87112 106277 127549 150653 176162










CMPSC 122 Lab Report Jake Eden and Max Hamill Page 17 of 24

Graph 3.



In order to better differentiate between selection, bubble, insertion, quick, and merge sort, we created another graph with just
those five sorts.

Graph 4.





CMPSC 122 Lab Report Jake Eden and Max Hamill Page 18 of 24

Again we attempted to compare our calculated running time values to the theoretical running time values of our sorting
algorithms. This time we wanted to see how the sorts perform when they are passed an array that is already sorted in
ascending order. The same ratio (running time in clock ticks over theoretical asymptotic running time) we used for Table 15
and Table 16 was employed below.

Table 19. Experimental running time (clock ticks) vs. theoretical running time via a ratio of the two for ascending sorted
input of sizes 30,000-50,000.
Sorting
Algorithm
(Running Time)/(Theoretical Asymptotic Running Time)
Size 30 Size 35 Size 40 Size 45 Size 50
Selection
(n
2
)
1.22 1.23 1.23 1.24 1.23
Bubble
(n
2
)
1.43 1.43 1.43 1.44 1.43
Insertion
(n)
0 0 0 0 0
Quick
(n
2
)
2.08 2.08 2.08 2.09 2.07
Merge
(nlgn)
0.876 0.969 1.05 1.15 1.23
Tree
(n
2
)
23.0 24.4 25.5 26.7 28.0

Table 20. Experimental running time (clock ticks) vs. theoretical running time via a ratio of the two for ascending sorted
input of sizes 55,000-75,000.
Sorting
Algorithm
(Running Time)/(Theoretical Asymptotic Running Time)
Size 55 Size 60 Size 65 Size 70 Size 75
Selection
(n
2
)
1.23 1.23 1.24 1.23 1.23
Bubble
(n
2
)
1.42 1.43 1.43 1.43 1.42
Insertion
(n)
0 0 0 0 0
Quick
(n
2
)
2.07 2.08 2.07 2.08 2.07
Merge
(nlgn)
1.31 1.39 1.46 1.55 1.65
Tree
(n
2
)
28.8 29.5 30.2 30.7 31.3
















CMPSC 122 Lab Report Jake Eden and Max Hamill Page 19 of 24

c. Descending Sorted
Finally, we took the raw data for descending sorted input and took the average values for each trial as displayed below.

Table 21. Average time (clock ticks) of each sorting algorithm for arrays of size 30,000-50,000 for descending sorted input.
Sorting
Algorithm
Size
30000 35000 40000 45000 50000
Selection 1105 1507 1974 2498 3101
Bubble 1838 2512 3257 4127 5118
Insertion 1130 1548 2021 2548 3143
Quick 529 467 793 929 1321
Merge 128 172 224 281 342
Tree 223 292 368 456 553

Table 22. Average time (clock ticks) of each sorting algorithm for arrays of size 55,000-75,000 for descending sorted input.
Sorting
Algorithm
Size
55000 60000 65000 70000 75000
Selection 3726 4421 5196 6035 6904
Bubble 6169 7344 8646 9977 11460
Insertion 3814 4537 5316 6181 7164
Quick 1326 1723 1128 2681 3297
Merge 413 491 575 668 773
Tree 656 771 892 1027 1168

Graph 5.






CMPSC 122 Lab Report Jake Eden and Max Hamill Page 20 of 24

In order to better differentiate between quick, merge, and tree sort, we created another graph with just those three sorts.

Graph 6.


Just like the previous two times, we again attempted to compare our calculated running time values to the theoretical
running time values of our sorting algorithms. This time we wanted to see how the sorts perform when they are passed an
array that is already sorted in descending order. In order to do this we took the same ratio as the last two times (running time
in clock ticks over theoretical asymptotic running time).

Table 23. Experimental running time (clock ticks) vs. theoretical running time via a ratio of the two for descending sorted
input of sizes 30,000-50,000.
Sorting
Algorithm
(Running Time)/(Theoretical Asymptotic Running Time)
30 35 40 45 50
Selection
(n
2
)
1.23 1.23 1.23 1.23 1.24
Bubble
(n
2
)
2.04 2.05 2.04 2.04 2.05
Insertion
(n
2
)
1.26 1.26 1.26 1.26 1.26
Quick
(n
2
)
0.588 0.381 0.496 0.459 0.528
Merge
(nlgn)
0.870 0.958 1.05 1.14 1.21
Tree
(n
2
)
0.248 0.238 0.230 0.225 0.221







CMPSC 122 Lab Report Jake Eden and Max Hamill Page 21 of 24

Table 24. Experimental running time (clock ticks) vs. theoretical running time via a ratio of the two for descending sorted
input of sizes 55,000-75,000.
Sorting
Algorithm
(Running Time)/(Theoretical Asymptotic Running Time)
Size 55 Size 60 Size 65 Size 70 Size 75
Selection
(n
2
)
1.23 1.23 1.23 1.23 1.23
Bubble
(n
2
)
2.04 2.04 2.05 2.04 2.04
Insertion
(n
2
)
1.26 1.26 1.26 1.26 1.27
Quick
(n
2
)
0.438 0.479 0.267 0.547 0.586
Merge
(nlgn)
1.30 1.39 1.47 1.56 1.65
Tree
(n
2
)
0.217 0.214 0.211 0.210 0.208


6. Observations
a. Unsorted Input

When it comes to unsorted input, quick sort deserves first mention. Not only because it is consistently fastestso fast, in
fact, that it was the reason we needed to have array sizes of at least 30,000, but also because it was the sorting algorithm
that caused our program to crash when we reached an array size of 40,000. We think this is probably due to the large number
of recursive calls that quick sort makes that are pushed on top of the call stack. At the large array sizes we were working with
there were hundreds of thousands of such calls. We had to increase the default memory size allocated to the call stack.
However, once we did, quick sort no longer gave any problem and retained its place as the fastest sort. Theoretically, its
average case running time is almost always its best case running time (nlgn). Looking at Graph 2, it appears as if quick sort
could be conforming to this standard for unsorted input; however, a more rigorous form of analysis is needed. This is where
our ratio test becomes useful. The idea is to divide the running time (in clock ticks) by the theoretical asymptotic running
time (in this case, (nlgn)) and compare the resulting ratio between each array size. If there is consistency, then it would
indicate a high level of conformity to the theoretical running time. In quick sorts case, we were not so fortunate. As seen in
Table 15 and Table 16, the ratio for size 35,000 was 0.156, but the ratio for size 75,000 was 0.270. While at first this may
seem to indicate that quick sort does not conform to its expected running time, we posit that the difference is probably
attributed to terms that are typically ignored in asymptotic run time analysis, especially since the ratio difference was
relatively minor between adjacent sizes. Therefore, quick sort excels at the divide and conquer technique, but it consumes a
large amount of memory, at least for our recursive implementation. We attribute its success for unsorted input to the
random nature of the last element in each (sub)array because this typically means that the pivot will be a value such that it
evenly splits the array into two sub-arrays.
Next, lets look at the slowest sorting algorithms: bubble sort, selection sort, and insertion sort. First of all, bubble sort
was noticeably worse (by far) than every other sorting algorithm. Even though the these three sorts all have (n
2
) as a worst
case running time (which is the same as their average case running time), bubble sort is still much worse than the others. In
fact, bubble sort became the reason we needed two graphs (Graph 1 and Graph 2) for unsorted input. At array size of 75,000,
bubble sort had an average of 15,632 clock ticks while selection sort had 7,011 clock ticks and insertion sort had 3,530 clock
ticks. The reason for this can be better seen when using our ratio test. Since all three sorts have an asymptotic running time of
(n
2
), we can actually compare the ratio values between them as a rough efficiency comparison. Bubble sorts ratio was
consistently about 2.75; selection sorts ratio was consistently about 1.24; and insertion sorts ratio was consistently about
0.631. Therefore, it can interpreted that bubble sorts algorithm runs at about 2.75n
2
if we dont eliminate the leading
coefficient, and the selection sort and insertion sort are both 1.24n
2
and 0.631n
2
, respectively. Obviously, we recognize that
2.75n
2
is not an accurate representation of bubble sorts algorithm (we are ignoring lesser terms and simplifying them into a
coefficient), but it is a useful comparative tool when comparing (n
2
) algorithms. This high leading coefficient makes sense
on a logical level when analyzing the specifics of bubble sorts algorithm because it involves constantly swapping adjacent
values within the array to bubble up the highest values. In comparison to selection sort, even though both sorts traverse the
array in nearly identical ways, bubble sort is swapping many times while selection sort is merely comparing values.
Therefore, selection sort is significantly faster. However, both are slower than insertion sort because the algorithm of
CMPSC 122 Lab Report Jake Eden and Max Hamill Page 22 of 24

insertion sort does not necessitate a full array traverse (of the unsorted section) each time its inner loop iterates; it depends on
where the unsorted value needs to be inserted. Furthermore, since all three are in-place algorithms (meaning that they dont
require extra array storage aside from the swaps), insertion sort seems to be a far better option.
Lastly, merge sort and tree sort are both (nlgn) algorithms that were the second fastest and third fastest, respectively.
Tree sort was interesting as well because it, too, crashed at the same time that quick sort crashed for an array size of 40,000,
meaning that it also has greater memory limitations than the other sorts (due again to the recursive nature of the algorithm
and its effect on the call stack). Thats not to say merge sort doesnt require a significant chunk of memory; its not an in-
place algorithm so a temporary array was needed to store the sorted values before copying them back into the output array.
Therefore, we think that we probably would have run into memory issues with merge sort if we hadnt already increased the
stack size. Merge sort always has a (nlgn) running time no matter the circumstances, which performs exceedingly well;
however, its constant factor is significantly higher than that of quick sort, meaning there are situations in which there are
better options than merge sort like when dealing with smaller array sizes. Like quick sort, both merge sorts and tree sorts
ratio were slightly inconsistent, but again like with quick sort, we imagine that its probably due to the influence of lesser
terms. At least, it is clear from Graph 2 that the algorithm is not (n
2
) like the other sorts. Despite its relative speed, tree sort
took approximately 1278 clock ticks to fully sort an array of size 75,000. At first, we thought this may be because it had to
first build a binary search tree, but upon further analysis (see subsection c. Descending Sorted Input), the vast majority of tree
sorts time is spent traversing the tree rather than sorting it. It may be possible that traversing via pointers takes longer than
via arrays because we constantly had to check for NULL pointers in order to ensure we didnt accidently deference a NULL
pointer. Therefore, merge sort and quick sort are more time efficient for unsorted input.

b. Ascending Sorted Input

Just looking at Graph 3, it is clear that tree sort needs to be mentioned first. Whatever gains it had for unsorted input, it
more than lost for ascending sorted input. Tree sort is now performing at its worst case of (n
2
), and while our ratio values
vary from about 23-31, we still think this is a relatively small margin, especially when considering adjacent ratio values, and
is indicative of such worst case performance. In fact, tree sort took more clock ticks to sort than any other condition or any
other sorting algorithm in our entire project. It took over 176,000 clock ticks for an array of size 75,000. This translated to
nearly 3 minutes of real world time. In comparison, the next worse time for an array of size 75,000 was by quick sort with
11,626 clock ticks. Thus tree sort took over 15 times longer to compute than the second worst sorting algorithm for this case..
This is attributed to the fact that the binary search tree now more closely resembles a linked list. Thus, this completely
eliminates the divide and conquer aspect of typical binary search trees.
The next surprise was quick sorts execution time. For unsorted input, it was the undisputed fastest, but now (aside from
tree sort) it executes in the slowest amount of time. Compared to its execution for unsorted input, it is about 80-90 times
slower. Usually, quick sorts average case running time is O(nlgn); however, it was now performing in its worst case
asymptotic running time of (n
2
). We confirmed this with our ratio test, which consistently returned a value of about 2.08.
Additionally, since quick sort is now running its worst case of (n
2
) we can compare its ratio value to the other (n
2
)
algorithms. Doing this, we can clearly see from Table 19 and Table 20 that quick sort should be faster than tree sort but
slower than bubble sort and selection sort, which have ratio values of 1.43 and 1.23, respectively. This slowdown was due to
how the pivot was chosen in the quick sort algorithm and the effect of it being greater than the beginning array elements. In
our implementation, quick sort chose the last value in the array as the pivot. For unsorted input, this generally resulted in two
nearly even partitions of the array, but now it doesnt partition the array at all. It merely removes the final value from
considerationinstead of divide and conquer it is chip and conquer. This means many more recursive calls are needed,
which are all essentially wasted on a sorted array anyway. However, what happens after the pivot is chosen is just as
inefficient. Essentially, the pivot is compared to all values in the array and resorted into its initial spot, wasting much time. A
better implementation would be one in which the pivot is either chosen randomly each time, or one in which a few indices are
searched and the middle value is chosen as the pivot.
Of course, the standout that we have yet to mention is insertion sort. Its running time returned 0 clock ticks for nearly
every trial, and when it didnt return 0, it returned 1 clock tick. Clearly, its performing at its best case asymptotic running
time of (n). Even at an array size of 75,000, insertion sort still returned an average value of 0 clock ticks. Naturally, then the
ratio test for insertion sort in this case was 0 for all sizes. This makes sense when examining how insertion sort operates. It
essentially checks if the value is in place in comparison to the value next to it, which is within a sorted section of the array,
and if the value is not in place, then it will shift over the values until it is in place. However, since everything is already
sorted, no shifting occurs, and the only real operation taking placing is just a check on each array element. Thus, (n), which
will only registered as 0 or 1 clock ticks.
Bubble and selection sort were both slightly faster, but still have a (n
2
) running time. Since bubble sort did not have to
perform any swaps and only check if values are in place, it could perform fasteralthough, it was still slightly slower than
selection sort as seen on Graph 4. The same type of reasoning can be used to explain why selection sort performed faster: it
didnt have to swap any values. However, both were still slower than merge sort because merge sort is always (nlgn), and in
CMPSC 122 Lab Report Jake Eden and Max Hamill Page 23 of 24

this case merge sort was the fastest sorting algorithm. Merge sorts running time was also extremely similar to its running
time for unsorted input. In fact, it doesnt seem like the input affects merge sort significantly.


c. Descending Sorted Input

After reviewing Graph 5 and Graph 6 it became obvious to us that there was something strange about the performance of
our tree sort. The tree sort algorithm should always have a performance of (n
2
) if assembled liked a linked list so why was
it that our descending sort was sorting so quickly? After much thought and careful tracing, we were able to get to the root of
the issue. We had assumed that the binary search tree would be created in a linked list linear fashion as we saw with
ascending sorted input; however, we didnt take into account what happens when keys are equal to the root. In our
implementation, if a key is equal to the root, it is sent to the right. Since our max integer value for our arrays was 100, this
means many keys are equal to the root and many keys are equal to sub-roots as well. This then results in a binary search tree
that is not perfectly balanced but is far from the linear structure of the linked list. Therefore, our tree sort algorithm still
performed quite well because it could still partially divide and conquer in recursive fashion.
Bubble sort was the slowest this time as well. Like with tree sort, we assumed that this would result in a slower sort time;
however, like with tree sort, we have to consider the effect of the maximum integer size of the elements. Since it was low at
100, there were many repeated values, which means that many values did not need to be swapped. Thus, bubble sort
performed better with descending sorted input than with unsorted input. Granted, bubble sort would still get a benefit in the
unsorted input test but it would be as immediate and beneficial as it is in the descending sorted input.
Selection and insertion were about the same in this case, both of which were faster than bubble sort. Since the maximum
integer size was 100 and there were many repeat values, selection sort managed to have a more or less equal running time to
insertion sort. Insertion sort should almost always be faster, which is why it is so obvious that the small maximum integer
size benefitted selection sort the most. Again, this effect is most noticeable for descending sorted input. Still, the running
times for both were (n
2
), which was again confirmed by our consistent ratio values.
Quick sort performed better than selection sort and insertion sort, but it performed better than it would have if our
maximum integer value were higher. It didnt perform as well as it did with unsorted input, but it did perform better than
ascending sorted input. It performed better than ascending sorted input because the pivot only needed to be swapped with the
initial value instead of being swapped with every value until back in its original position; it was merely swapped with the
initial element. Therefore, it performed slower than unsorted input but faster than the ascending sorted input.
As always, merge sort was consistently fast. It doesnt matter whether the input is unsorted, ascending sorted, or
descending sorted. Merge sort is still (nlgn) with the same constant factors.

7. Conclusion

The biggest takeaway is that no sorting algorithm is objectively the best. However, with that being said, there are still
some sorting algorithms that are almost always worse. For instance, bubble sort consistently sorted poorly compared to the
other algorithms, and its hard to imagine a situation in which bubble sort would be the most efficient method. Still, one
might be inclined to use it simply because of its ease of implementation. If the resources are available, why waste time
implementing a more efficient but trickier sorting algorithm? The same could be said about selection sort, considering its
subpar running times. Another take away is the balance between memory an d speed. Quick sort, merge sort, and tree sort all
consume a large amount of memory, which may be a factor, especially if programming on mobile devices. Merge sort was
consistently the fastest no matter the input so if memory is not a problem and the type of input was unknown, then we would
recommend using that. If the input was partially sorted or fully sorted and memory is an issue, then insertion sort would be a
good algorithm to use. With our implementation of tree sort, we would never recommend it over merge sort as a pure sorting
algorithm; however, if tree sort was implemented with a self-balancing tree than it could be a very useful sorting algorithm.
Additionally, tree sort could be beneficial if one wanted to traverse the tree in different ways (i.e., pre-order vs in-order).
Lastly, we learned just how much impact repeated values can have on sorting algorithms with tree sorts running time for
descending sorted input being a prime example of this.
Initially we struggled to extract the kind of deep thoughts that are required for a project of this nature, but once we got
accustomed to the idea of thinking analytically the project really took off. As our relationship as partners progressed, we
began to see eye to eye more often, but when we did not there was always a great spark in civil discussion. We both worked
together to rigorously analyze our data to the best of our abilities and were benefited from picking each others brains. Both
of us checked all of the code to ensure that it was the best we could produce. Teamwork was an essential component of this
project; if we had attempted to complete this project individually, the observations would have been lackluster because we
often pointed out each others mistakes whether it be grammatical or logical. We both experienced growth as programmers
and analytical thinkers as we helped each other and combined our strengths to complete this project. Whats great is that we
CMPSC 122 Lab Report Jake Eden and Max Hamill Page 24 of 24

can now both discuss all six sorting algorithms from memory instead of needing to consistently remind ourselves of how
each one worked like we did in the beginning of the project.

You might also like