12/5/2018 Shellsort - Wikipedia
Shellsort
Shellsort, also known as Shell sort or Shell's method, is an in-place comparison sort. It can be seen as either
Shellsort
a generalization of sorting by exchange (bubble sort) or sorting by insertion (insertion sort).[3] The method starts
by sorting pairs of elements far apart from each other, then progressively reducing the gap between elements to
be compared. Starting with far apart elements, it can move some out-of-place elements into position faster than a
simple nearest neighbor exchange. Donald Shell published the first version of this sort in 1959.[4][5] The running
time of Shellsort is heavily dependent on the gap sequence it uses. For many practical variants, determining their
time complexity remains an open problem.
Contents
Description
Pseudocode
Gap sequences
Computational complexity
Applications
See also
References
Shellsort with gaps 23, 10, 4, 1 in action.
Bibliography
External links Class Sorting algorithm
Data structure Array
Worst-case O(n2) (worst known gap
Description performance sequence)
Shellsort is a generalization of insertion sort that allows the exchange of items that are far apart. The idea is to O(n log2n) (best known
arrange the list of elements so that, starting anywhere, considering every hth element gives a sorted list. Such a gap sequence)[1]
list is said to be h-sorted. Equivalently, it can be thought of as h interleaved lists, each individually sorted.[6] Best-case O(n log n)[2]
Beginning with large values of h, this rearrangement allows elements to move long distances in the original list, performance
reducing large amounts of disorder quickly, and leaving less work for smaller h-sort steps to do.[7] If the list is Average depends on gap
then k-sorted for some smaller integer k, then the list remains h-sorted. Following this idea for a decreasing performance sequence
sequence of h values ending in 1 is guaranteed to leave a sorted list in the end.[6]
Worst-case О(n) total, O(1) auxiliary
An example run of Shellsort with gaps 5, 3 and 1 is shown below. space
complexity
a1 a2 a3 a4 a5 a6 a7 a8 a9 a10 a11 a12
Input data 62 83 18 53 07 17 95 86 47 69 25 28
After 5-sorting 17 28 18 47 07 25 83 86 53 69 62 95
After 3-sorting 17 07 18 47 28 25 69 62 53 83 86 95
After 1-sorting 07 17 18 25 28 47 53 62 69 83 86 95
The first pass, 5-sorting, performs insertion sort on five separate subarrays (a1, a6, a11), (a2, a7, a12), (a3, a8), (a4, a9), (a5,
a10). For instance, it changes the subarray (a1, a6, a11) from (62, 17, 25) to (17, 25, 62). The next pass, 3-sorting, performs
insertion sort on the three subarrays (a1, a4, a7, a10), (a2, a5, a8, a11), (a3, a6, a9, a12). The last pass, 1-sorting, is an
ordinary insertion sort of the entire array (a1,..., a12).
The step-by-step process of
As the example illustrates, the subarrays that Shellsort operates on are initially short; later they are longer but almost replacing pairs of items during the
ordered. In both cases insertion sort works efficiently. shell sorting algorithm.
Shellsort is not stable: it may change the relative order of elements with equal values. It is an adaptive sorting algorithm in
that it executes faster when the input is partially sorted.
Pseudocode
Using Marcin Ciura's gap sequence, with an inner insertion sort.
# Sort an array a[0...n-1].
gaps = [701, 301, 132, 57, 23, 10, 4, 1]
# Start with the largest gap and work down to a gap of 1
foreach (gap in gaps)
https://en.wikipedia.org/wiki/Shellsort 1/5
12/5/2018 Shellsort - Wikipedia
{
# Do a gapped insertion sort for this gap size.
# The first gap elements a[0..gap-1] are already in gapped order
# keep adding one more element until the entire array is gap sorted
for (i = gap; i < n; i += 1)
{
# add a[i] to the elements that have been gap sorted
# save a[i] in temp and make a hole at position i
temp = a[i]
# shift earlier gap-sorted elements up until the correct location for a[i] is found
for (j = i; j >= gap and a[j - gap] > temp; j -= gap)
{
a[j] = a[j - gap]
}
# put temp (the original a[i]) in its correct location
a[j] = temp
}
}
Gap sequences
The question of deciding which gap sequence to use is difficult. Every gap sequence that contains 1 yields a correct sort (as this makes the final pass an ordinary
insertion sort); however, the properties of thus obtained versions of Shellsort may be very different. Too few gaps slows down the passes, and too many gaps produces
an overhead.
The table below compares most proposed gap sequences published so far. Some of them have decreasing elements that depend on the size of the sorted array (N).
Others are increasing infinite sequences, whose elements less than N should be used in reverse order.
Author and
Worst-case
OEIS General term (k ≥ 1) Concrete gaps year of
time complexity
publication
[e.g. when
Shell, 1959[4]
N = 2p]
Frank &
Lazarus,
1960[8]
Hibbard,
A168604
1963[9]
Papernov &
A083318 , prefixed with 1 Stasevich,
1965[10]
A003586 Successive numbers of the form (3-smooth numbers) Pratt, 1971[1]
A003462 , not greater than Pratt, 1971[1]
Incerpi &
Sedgewick,
A036569
1985,[11]
Knuth[3]
Sedgewick,
A036562 , prefixed with 1
1986[6]
Sedgewick,
A033622
1986[12]
Gonnet &
Unknown Baeza-Yates,
1991[13]
Tokuda,
A108870 Unknown
1992[14]
Ciura,
A102549 Unknown (experimentally derived) Unknown
2001[15]
https://en.wikipedia.org/wiki/Shellsort 2/5
12/5/2018 Shellsort - Wikipedia
When the binary representation of N contains many consecutive zeroes, Shellsort using Shell's original gap sequence makes Θ(N2) comparisons in the worst case. For
instance, this case occurs for N equal to a power of two when elements greater and smaller than the median occupy odd and even positions respectively, since they are
compared only in the last pass.
Although it has higher complexity than the O(N log N) that is optimal for comparison sorts, Pratt's version lends itself to sorting networks and has the same
asymptotic gate complexity as Batcher's bitonic sorter.
Gonnet and Baeza-Yates observed that Shellsort makes the fewest comparisons on average when the ratios of successive gaps are roughly equal to 2.2.[13] This is why
their sequence with ratio 2.2 and Tokuda's sequence with ratio 2.25 prove efficient. However, it is not known why this is so. Sedgewick recommends to use gaps that
have low greatest common divisors or are pairwise coprime.[16]
With respect to the average number of comparisons, Ciura's sequence[15] has the best known performance; gaps from 701 were not determined but the sequence can
be further extended according to the recursive formula .
Tokuda's sequence, defined by the simple formula , where , , can be recommended for practical applications.
Computational complexity
The following property holds: after h2-sorting of any h1-sorted array, the array remains h1-sorted.[17] Every h1-sorted and h2-sorted array is also (a1h1+a2h2)-sorted,
for any nonnegative integers a1 and a2. The worst-case complexity of Shellsort is therefore connected with the Frobenius problem: for given integers h1,..., hn with gcd
= 1, the Frobenius number g(h1,..., hn) is the greatest integer that cannot be represented as a1h1+ ... +anhn with nonnegative integer a1,..., an. Using known formulae
for Frobenius numbers, we can determine the worst-case complexity of Shellsort for several classes of gap sequences.[18] Proven results are shown in the above table.
With respect to the average number of operations, none of the proven results concerns a practical gap sequence. For gaps that are powers of two, Espelid computed
this average as .[19] Knuth determined the average complexity of sorting an N-element array with two gaps (h, 1) to be
.[3] It follows that a two-pass Shellsort with h = Θ(N1/3) makes on average O(N5/3) comparisons/inversions/running time. Yao found the average
complexity of a three-pass Shellsort.[20] His result was refined by Janson and Knuth:[21] the average number of comparisons/inversions/running time made during a
Shellsort with three gaps (ch, cg, 1), where h and g are coprime, is in the first pass, in the second pass and
in the third pass. ψ(h, g) in the last formula is a complicated function asymptotically equal to
. In particular, when h = Θ(N7/15) and g = Θ(N1/5), the average time of sorting is O(N23/15).
Based on experiments, it is conjectured that Shellsort with Hibbard's gap sequence runs in O(N5/4) average time,[3] and that Gonnet and Baeza-Yates's sequence
requires on average 0.41NlnN(ln lnN+1/6) element moves.[13] Approximations of the average number of operations formerly put forward for other sequences fail
when sorted arrays contain millions of elements.
The graph below shows the average number of element comparisons in various variants of Shellsort, divided by the theoretical lower bound, i.e. log2N!, where the
sequence 1, 4, 10, 23, 57, 132, 301, 701 has been extended according to the formula .
Applying the theory of Kolmogorov complexity, Jiang, Li, and Vitányi proved the following lower bound for the order of the average number of operations/running
time in a p-pass Shellsort: Ω(pN1+1/p) when p≤log2N and Ω(pN) when p>log2N.[22] Therefore, Shellsort has prospects of running in an average time that
asymptotically grows like NlogN only when using gap sequences whose number of gaps grows in proportion to the logarithm of the array size. It is, however,
unknown whether Shellsort can reach this asymptotic order of average-case complexity, which is optimal for comparison sorts. The lower bound was improved by
Vitányi[23] for every number of passes to where . This result implies for example the Jiang-Li-Vitányi lower bound for all -pass
increment sequences and improves that lower bound for particular increment sequences. In fact all bounds (lower and upper) currently known for the average case
are precisely matched by this lower bound. For example, this gives the new result that the Janson-Knuth upper bound is matched by the resulting lower bound for the
used increment sequence, showing that three pass Shellsort for this increment sequence uses comparisons/inversions/running time. The formula allows
https://en.wikipedia.org/wiki/Shellsort 3/5
12/5/2018 Shellsort - Wikipedia
us to search for increment sequences that yield lower bounds which are unknown; for example an increment sequence for four passes which has a lower bound
greater than for the increment sequence . The lower bound becomes
The worst-case complexity of any version of Shellsort is of higher order: Plaxton, Poonen, and Suel showed that it grows at least as rapidly as
.[24]
Applications
Shellsort performs more operations and has higher cache miss ratio than quicksort. However, since it can be implemented using little code and does not use the call
stack, some implementations of the qsort function in the C standard library targeted at embedded systems use it instead of quicksort. Shellsort is, for example, used
in the uClibc library.[25] For similar reasons, an implementation of Shellsort is present in the Linux kernel.[26]
Shellsort can also serve as a sub-algorithm of introspective sort, to sort short subarrays and to prevent a slowdown when the recursion depth exceeds a given limit.
This principle is employed, for instance, in the bzip2 compressor.[27]
See also
Comb sort
References
1. Pratt, Vaughan Ronald (1979). Shellsort and Sorting Networks (Outstanding Dissertations in the Computer Sciences) (http://www.dtic.mil/get-tr-doc/pdf?AD=AD0
740110). Garland. ISBN 0-8240-4406-1.
2. "Shellsort & Comparisons" (http://www.cs.wcupa.edu/rkline/ds/shell-comparison.html).
3. Knuth, Donald E. (1997). "Shell's method". The Art of Computer Programming. Volume 3: Sorting and Searching (2nd ed.). Reading, Massachusetts: Addison-
Wesley. pp. 83–95. ISBN 0-201-89685-0.
4. Shell, D. L. (1959). "A High-Speed Sorting Procedure" (http://penguin.ewu.edu/cscd300/Topic/AdvSorting/p30-shell.pdf) (PDF). Communications of the ACM. 2
(7): 30–32. doi:10.1145/368370.368387 (https://doi.org/10.1145%2F368370.368387).
5. Some older textbooks and references call this the "Shell-Metzner" sort after Marlene Metzner Norton, but according to Metzner, "I had nothing to do with the sort,
and my name should never have been attached to it." See "Shell sort" (https://xlinux.nist.gov/dads/HTML/shellsort.html). National Institute of Standards and
Technology. Retrieved 2007-07-17.
6. Sedgewick, Robert (1998). Algorithms in C. 1 (3rd ed.). Addison-Wesley. pp. 273–281. ISBN 0-201-31452-5.
7. Kernighan, Brian W.; Ritchie, Dennis M. (1996). The C Programming Language (2nd ed.). Prentice Hall. p. 62. ISBN 7-302-02412-X.
8. Frank, R. M.; Lazarus, R. B. (1960). "A High-Speed Sorting Procedure". Communications of the ACM. 3 (1): 20–22. doi:10.1145/366947.366957 (https://doi.org/1
0.1145%2F366947.366957).
9. Hibbard, Thomas N. (1963). "An Empirical Study of Minimal Storage Sorting". Communications of the ACM. 6 (5): 206–213. doi:10.1145/366552.366557 (https://
doi.org/10.1145%2F366552.366557).
10. Papernov, A. A.; Stasevich, G. V. (1965). "A Method of Information Sorting in Computer Memories" (http://www.mathnet.ru/links/83f0a81df1ec06f76d3683c6cab7
d143/ppi751.pdf) (PDF). Problems of Information Transmission. 1 (3): 63–75.
11. Incerpi, Janet; Sedgewick, Robert (1985). "Improved Upper Bounds on Shellsort". Journal of Computer and System Sciences. 31 (2): 210–224.
doi:10.1016/0022-0000(85)90042-x (https://doi.org/10.1016%2F0022-0000%2885%2990042-x).
12. Sedgewick, Robert (1986). "A New Upper Bound for Shellsort". Journal of Algorithms. 7 (2): 159–173. doi:10.1016/0196-6774(86)90001-5 (https://doi.org/10.101
6%2F0196-6774%2886%2990001-5).
13. Gonnet, Gaston H.; Baeza-Yates, Ricardo (1991). "Shellsort". Handbook of Algorithms and Data Structures: In Pascal and C (2nd ed.). Reading, Massachusetts:
Addison-Wesley. pp. 161–163. ISBN 0-201-41607-7.
14. Tokuda, Naoyuki (1992). "An Improved Shellsort". In van Leeuven, Jan. Proceedings of the IFIP 12th World Computer Congress on Algorithms, Software,
Architecture. Amsterdam: North-Holland Publishing Co. pp. 449–457. ISBN 0-444-89747-X.
15. Ciura, Marcin (2001). "Best Increments for the Average Case of Shellsort" (http://sun.aei.polsl.pl/~mciura/publikacje/shellsort.pdf) (PDF). In Freiwalds, Rusins.
Proceedings of the 13th International Symposium on Fundamentals of Computation Theory. London: Springer-Verlag. pp. 106–117. ISBN 3-540-42487-3.
16. Sedgewick, Robert (1998). "Shellsort". Algorithms in C++, Parts 1–4: Fundamentals, Data Structure, Sorting, Searching. Reading, Massachusetts: Addison-
Wesley. pp. 285–292. ISBN 0-201-35088-2.
17. Gale, David; Karp, Richard M. (1972). "A Phenomenon in the Theory of Sorting". Journal of Computer and System Sciences. 6 (2): 103–115. doi:10.1016/S0022-
0000(72)80016-3 (https://doi.org/10.1016%2FS0022-0000%2872%2980016-3).
18. Selmer, Ernst S. (1989). "On Shellsort and the Frobenius Problem". BIT Numerical Mathematics. 29 (1): 37–40. doi:10.1007/BF01932703 (https://doi.org/10.100
7%2FBF01932703).
19. Espelid, Terje O. (1973). "Analysis of a Shellsort Algorithm". BIT Numerical Mathematics. 13 (4): 394–400. doi:10.1007/BF01933401 (https://doi.org/10.1007%2F
BF01933401).
20. Yao, Andrew Chi-Chih (1980). "An Analysis of (h, k, 1)-Shellsort". Journal of Algorithms. 1 (1): 14–50. doi:10.1016/0196-6774(80)90003-6 (https://doi.org/10.101
6%2F0196-6774%2880%2990003-6).
https://en.wikipedia.org/wiki/Shellsort 4/5
12/5/2018 Shellsort - Wikipedia
21. Janson, Svante; Knuth, Donald E. (1997). "Shellsort with Three Increments". Random Structures and Algorithms. 10 (1–2): 125–142. arXiv:cs/9608105 (https://a
rxiv.org/abs/cs/9608105). doi:10.1002/(SICI)1098-2418(199701/03)10:1/2<125::AID-RSA6>3.0.CO;2-X (https://doi.org/10.1002%2F%28SICI%291098-2418%28
199701%2F03%2910%3A1%2F2%3C125%3A%3AAID-RSA6%3E3.0.CO%3B2-X). CiteSeerX: 10.1.1.54.9911 (http://citeseerx.ist.psu.edu/viewdoc/summary?d
oi=10.1.1.54.9911).
22. Jiang, Tao; Li, Ming; Vitányi, Paul (2000). "A Lower Bound on the Average-Case Complexity of Shellsort" (http://www.cwi.nl/~paulv/papers/shell2.ps). Journal of
the ACM. 47 (5): 905–911. arXiv:cs/9906008 (https://arxiv.org/abs/cs/9906008). doi:10.1145/355483.355488 (https://doi.org/10.1145%2F355483.355488).
CiteSeerX: 10.1.1.6.6508 (http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.6.6508).
23. Vitányi, Paul (2018). "On the average-case complexity of Shellsort". Random Structures and Algorithms. 52 (2): 354–363. doi:10.1002/rsa.20737 (https://doi.org/
10.1002%2Frsa.20737).
24. Plaxton, C. Greg; Poonen, Bjorn; Suel, Torsten (1992). "Improved Lower Bounds for Shellsort". Annual Symposium on Foundations of Computer Science. 33:
226–235. doi:10.1109/SFCS.1992.267769 (https://doi.org/10.1109%2FSFCS.1992.267769). ISBN 0-8186-2900-2. CiteSeerX: 10.1.1.43.1393 (http://citeseerx.is
t.psu.edu/viewdoc/summary?doi=10.1.1.43.1393).
25. Novoa, Manuel III. "libc/stdlib/stdlib.c" (http://git.uclibc.org/uClibc/tree/libc/stdlib/stdlib.c#n700). Retrieved 2014-10-29.
26. "kernel/groups.c" (https://github.com/torvalds/linux/blob/72932611b4b05bbd89fafa369d564ac8e449809b/kernel/groups.c#L105). Retrieved 2012-05-05.
27. Julian Seward. "bzip2/blocksort.c" (https://www.ncbi.nlm.nih.gov/IEB/ToolBox/CPP_DOC/lxr/source/src/util/compress/bzip2/blocksort.c#L519). Retrieved
2011-03-30.
Bibliography
Knuth, Donald E. (1997). "Shell's method". The Art of Computer Programming. Volume 3: Sorting and Searching (2nd ed.). Reading, Massachusetts: Addison-
Wesley. pp. 83–95. ISBN 0-201-89685-0.
Analysis of Shellsort and Related Algorithms (http://www.cs.princeton.edu/~rs/shell/), Robert Sedgewick, Fourth European Symposium on Algorithms,
Barcelona, September 1996.
External links
Animated Sorting Algorithms: Shell Sort (https://web.archive.org/web/20150310043846/http://www.sorting-algorithms.com/shell-sort) at the Wayback Machine
(archived 10 March 2015) – graphical demonstration
Shellsort with gaps 5, 3, 1 as a Hungarian folk dance (https://www.youtube.com/watch?v=CmPA7zE8mx0)
Retrieved from "https://en.wikipedia.org/w/index.php?title=Shellsort&oldid=857438671"
This page was last edited on 31 August 2018, at 18:04 (UTC).
Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and
Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.
https://en.wikipedia.org/wiki/Shellsort 5/5