Sorting

• We have actually seen already two efficient ways to sort:

A kind of “insertion” sort

• Insert the elements into a red-black tree one by one

• Traverse the tree in in-order and collect the keys

• Takes O(nlog(n)) time

Heapsort (Willians, Floyd, 1964)

• Put the elements in an array• Make the array into a heap• Do a deletemin and put the

deleted element at the last position of the array

Put the elements in the heap

2924 1519

79 65 26 24 19 15 29 23 33 40 7

2924 1519

79 65 26 24 19 15 29 23 33 40 7

Make the elements into a heap

2924 1519

79 65 26 24 19 15 29 23 33 40 7

Heapify-down(Q,4)

2924 157

79 65 26 24 7 15 29 23 33 40 19

Heapify-down(Q,4)

2924 157

79 65 26 24 7 15 29 23 33 40 19

Heapify-down(Q,3)

2923 157

79 65 26 23 7 15 29 24 33 40 19

Heapify-down(Q,3)

2923 157

79 65 26 23 7 15 29 24 33 40 19

Heapify-down(Q,2)

2923 267

79 65 15 23 7 26 29 24 33 40 19

Heapify-down(Q,2)

2923 267

79 65 15 23 7 26 29 24 33 40 19

Heapify-down(Q,1)

2923 2665

79 7 15 23 65 26 29 24 33 40 19

Heapify-down(Q,1)

2923 2619

79 7 15 23 19 26 29 24 33 40 65

Heapify-down(Q,1)

2923 2619

79 7 15 23 19 26 29 24 33 40 65

Heapify-down(Q,0)

2923 2619

7 79 15 23 19 26 29 24 33 40 65

Heapify-down(Q,0)

2923 2679

7 19 15 23 79 26 29 24 33 40 65

Heapify-down(Q,0)

2923 2640

7 19 15 23 40 26 29 24 33 79 65

Heapify-down(Q,0)

Summery

• We can build the heap in linear time (we already did this analysis)

• We still have to deletemin the elements one by one in order to sort that will take O(nlog(n))

Quicksort (Hoare 1961)

quicksort

Input: an array A[p, r]

Quicksort (A, p, r) if (p < r)

then q = Partition (A, p, r) //q is the position of the pivot element

Quicksort (A, p, q-1) Quicksort (A, q+1, r)

2 8 7 1 3 5 6 4

2 1 7 8 3 5 6 4

2 1 3 8 7 5 6 4

2 1 3 4 7 5 6 8

2 8 7 1 3 5 6 4p r

Partition(A, p, r) x ←A[r]

i ← p-1 for j ← p to r-1

do if A[j] ≤ x then i ← i+1 exchange A[i] ↔ A[j] exchange A[i+1] ↔A[r] return i+1

Analysis

• Running time is proportional to the number of comparisons

• Each pair is compared at most once O(n2)

• In fact for each n there is an input of size n on which quicksort takes cn2 Ω(n2)

• Assume that the split is even in each iteration

T(n) = 2T(n/2) + bn

How do we solve linear recurrences like this ? (read Chapter 4)

Recurrence tree

T(n/2)

Recurrence tree

T(n/4)T(n/4)T(n/4)T(n/4)

Recurrence tree

T(n/4)T(n/4)T(n/4)T(n/4)logn

In every level we do bn comparisonsSo the total number of comparisons is O(nlogn)

Observations

• We can’t guarantee good splits

• But intuitively on random inputs we will get good splits

Randomized quicksort

• Use randomized-partition rather than partition

Randomized-partition (A, p, r) i ← random(p,r)

exchange A[r] ↔ A[i] return partition(A,p,r)

• On the same input we will get a different running time in each run !

• Look at the average for one particular input of all these running times

Expected # of comparisons

Let X be the expected # of comparisons

This is a random variable

Want to know E(X)

Expected # of comparisons

Let z1,z2,.....,zn the elements in sorted order

Let Xij = 1 if zi is compared to zj and 0 otherwise

1ijijXX

n 1 n n 1 n

ij iji 1 j i 1 i 1 j i 1

E X E X E X

by linearity of expectation

i ji 1 j i 1

Pr{z is compared to z }

n 1 n n 1 n

ij iji 1 j i 1 i 1 j i 1

E X E X E X

by linearity of expectation

i 1 j ii j

Pr{z is compared to z }

Consider zi,zi+1,.......,zj ≡ Zij

Claim: zi and zj are compared either zi or zj is the first chosen in Zij

Proof: 3 cases:– {zi, …, zj} Compared on this

partition, and never again.

– {zi, …, zj} the same

– {zi, …, zk, …, zj} Not compared on this partition. Partition separates them, so no future partition uses both.

= 1/(j-i+1) + 1/(j-i+1)= 2/(j-i+1)

Pr{zi is compared to zj}

= Pr{zi or zj is first pivot chosen from Zij} just explained

= Pr{zi is first pivot chosen from Zij} +

Pr{zj is first pivot chosen from Zij}

mutually exclusivepossibilities

1ij 1ij

n 1 n i+1

i 1 k 2

kSimplify with a change of variable, k=j-i+1.

2Simplify and overestimate, by adding terms.

n) lg O(n

Lower bound for sorting in the comparison model

A lower bound

• Comparison model: We assume that the operation from which we deduce order among keys are comparisons

• Then we prove that we need Ω(nlogn) comparisons on the worst case

Model the algorithm as a decision tree

Important Observations

• Every algorithm can be represented as a (binary) tree like this

• Each path corresponds to a run on some input

• The worst case # of comparisons corresponds to the longest path

The lower bound

Let d be the length of the longest path

#leaves ≤ 2dn! ≤

log2(n!) ≤d

Lower Bound for Sorting

• Any sorting algorithm based on comparisons between elements requires (N log N) comparisons.

Beating the lower bound

• We can beat the lower bound if we can deduce order relations between keys not by comparisons

Examples:• Count sort• Radix sort

Linear time sorting

• Or assume something about the input: random, “almost sorted”

Sorting an almost sorted input

• Suppose we know that the input is “almost” sorted

• Let I be the number of “inversions” in the input: The number of pairs ai,aj such that i<j and ai>aj

Example

1, 4 , 5 , 8 , 3

8, 7 , 5 , 3 , 1 I=10

• Think of “insertion sort” using a list

• When we insert the next item ak, how deep it gets into the list?

• As the number of inversions ai,ak for i < k lets call this Ik

Analysis

The running time is:

I n I n

Thoughts

• When I=Ω(n2) the running time is Ω(n2)

• But we would like it to be O(nlog(n)) for any input, and faster whan I is small

Finger red black trees

Finger treeTake a regular search tree and reverse the direction of the pointers on the rightmost spine

We go up from the last leaf until we find the subtree containing the item and we descend into it

Finger treesSay we search for a position at distance d from the end

Then we go up to height O(log(d))

Insertions and deletions still take O(log n) worst case time but O(log(d)) amortized time

So search for the dth position takes O(log(d)) time

Back to sorting

• Suppose we implement the insertion sort using a finger search tree

• When we insert item k then d=O(Ik) and it take O(log(Ik)) time

Analysis

The running time is:

( log( ) )n

Since ∑Ij = I this is at most

O n nn

Selection

Find the kth element

Randomized selection

Randomized-select (A, p, r,k) if p=r then return A[p]

q←randomized-partition(A,p,r) j ← q-p+1 if j=k then return A[q] else if k < j then return randomized-select(A,p,q-

1,k) else return randomized-select(A,q+1,r,k-j)

Expected running time

With probability 1/n, A[p,q] contains exactly k elements, for k=1,2,…,n

1( ( )) ( ) ( (max( 1, )))

E T n O n E T k n kn

Assume n is even

1( ( )) ( ) ( (max( 1, )))

( ( 1)) ( ( 2)) ....

1( ( )) ( )

1 ...... ( ( 1))2

E T n E T n

n nE T n O n E T E T

nE T E T n

In general

1( ( )) ( ) ( (max( 1, )))

2( ( )) ( ) ( ( ))

E T n O n E T kn

Solve by “substitution”

2( ( )) ( ) ( ( ))

E T n O n E T kn

Assume T(k) ≤ ck for k < n, and prove T(n) ≤ cn

2( ( ))

E T n an ckn

Solve by “substitution”1

2( ( ))

E T n an ckn

/ 2 11

can k k

/ 2 ( / 2 1)2 ( 1)

n nc n nan

( / 2 1) / 22 ( 1)( ( ))

n nc n nE T n an

2 ( 1) ( / 2 2)( / 2 1)

c n n n nan

c n nan

cncn an

Choose c ≥4a

Selection in linear worst case time

Blum, Floyd, Pratt, Rivest, and Tarjan (1973)

5-tuples

Sort the tuples

Recursively find the median of the medians

7 10 1 3 2 11

Recursively find the median of the medians

7 10 1 3 2 11

Partition around the median of the medians

Continue recursively with the side that contains the kth element

Neither side can be large

≤ ¾n

The reason

7 10 11

The reason

7 10 11

Analysis

3 1( ) ( )

4 5T n O n T n T n

( ) ( )T n O n

Order statistics, a dynamic version

rank and select

The dictionary ADT

• Insert(x,D)• Delete(x,D)• Find(x,D): Returns a pointer to x if x ∊ D, and

a pointer to the successor or predecessor of x if x is not in D

Suppose we want to add to the dictionary ADT

• Select(k,D): Returns the kth

element in the dictionary:

An element x such that k-1 elements are smaller than x

Select(5,D)

70673426

2120194

Select(5,D)

70673426

2120194

9089777370673426

2120194

Can we still use a red-black tree ?

For each node v store # of leaves in the subtree of v

9089777370673426

2120194

Select(7,T)

9089777370673426

2120194

Select(7,T)

9089777370673426

2120194

12Select(3, )

Select(7,T)

9089777370673426

2120194

Select(3, )

Select(1,)

Select(7,T)

9089777370673426

2120194

Select(i,T)

Select(i,T): Select(i,root(T))

Select(k,v): if k = 1 then return v.left if k = 2 then return v.right if k ≤ (v.left).size

then return Select(k,v.left) else return Select(k – (v.left).size),v.right)

O(logn) worst case time

Rank(x,T)

• Return the index of x in T

Rank(x,T)

xNeed to return 9

9089777370673426

2120194

xSum up the sizes of the subtrees to the left of the path

Rank(x,T)

• Write the p-code

Insertion and deletions

• Consider insertion, deletion is similar

Insert

Insert (cont)

Easy to maintain through rotations

size(x) ← size(B) + size(C)

size(y) ← size(A) + size(x)

Summary

• Insertion and deletion and other dictionary operations still take O(log n) time

Sorting

Documents

Transcript of Sorting

CAP SORTING - Packaging Machines | Liquid Filling … Sorting and Feeding for... · CAP SORTING APACKS offers four Cap Sorting options for our Automatic Inline Spindle and Chuck Cappers

Back to Sorting – More efficient sorting algorithms

1 Chapter 8 Sorting. 2 OBJECTIVE Introduces: Sorting Concept Sorting Types Sorting Implementation Techniques.

sorting Chapter 7 Sorting - William & Marytadavis/cs303/ch07sm.pdf · Chapter 7 Sorting Introduction 2 sorting fundamental task in data management well-studied problem in computer

Introduction to Sorting Methods –Basics of Sorting –Elementary Sorting Algorithms Selection sort Insertion sort Shellsort.

Data Structures 5 Sorting Prof A Alkhorabi. 5- 2 Overview Why Sorting? Simple Sorting – Selection & Insertion Sorts Sorting Performance Shell Sort Quicksort.

Sorting Heapsort Quick review of basic sorting methods Lower bounds for comparison-based methods Non-comparison based sorting.

Sorting (introduction)

Time Complexity s Sorting –Insertion sorting s Time complexity.

Quadratic Sorting Algorithms - University of Illinois at …homepages.math.uic.edu/~jan/mcs360/quadratic_sorts.pdfQuadratic Sorting Algorithms 1 Using C++ Sorting sorting a vector

Week 11 Sorting Algorithms. Sorting Sorting Algorithms A sorting algorithm is an algorithm that puts elements of a list in a certain order. We need sorting.

Sorting Agriculture Machine By ORANGE Sorting Machines (India) Private Limited

Sorting - EECS 2011€¦ · Sorting EECS 2011 1/95

Searching and Sorting Chapter 9. 9.1 Sorting Arrays.

Sorting and Sorting Algorithms Common computational task ...david/schools/new-sorting.pdf · Sorting and Sorting Algorithms Common computational task, embedded in many systems and

Sorting Importance of sorting Quicksort Lower bounds for comparison-based methods Heapsort Non-comparison based sorting.

CS235102 Data Structures Chapter 7 Sorting (Concentrating on Internal Sorting)

Searching and Sorting I 1 Searching and Sorting 1.

Searching and Sorting Searching: Sequential, Binary Sorting: Selection, Insertion, Shell.

Computer Programming Sorting and Sorting Algorithms 1.