Instructor: Shengyu Zhang 1. Example 1: Merge sort 2.

Instructor: Shengyu Zhang

Example 1: Merge sort

Starting example

Sorting: We have a list of numbers . We want to sort them in the increasing order.

An algorithm: merge sort

Merge sort: Cut it into two halves with equal size.

Suppose divides for simplicity. Suppose the two halves are sorted: Merge them.

Use two pointers, one for each half, to scan them, during which course do the appropriate merge.

How to sort each of the two halves? Recursively.

Complexity?

Suppose this algorithm takes time for an input with numbers.

Thus each of the two halves takes time. The merging?

Scanning elements, an time operation needed for each.

Total amount of time: .

How to solve/bound this recurrence relation?

A general method for designing algorithm: Divide and conquer Breaking the problem into subproblems

that are themselves smaller instances of the same type of problem

Recursively solving these subproblems

Appropriately combining their answers

Complexity

Running time on an input of size : Break problem into subproblems, each of

the same size . In general, is not necessarily equal to .

Time to recursively solve each subproblem: Time for breaking problem (into

subproblems) and combining the answers:

Master theorem

, , and are all constants. Then

Proof in textbook. Not required. But you need to know how to apply it.

Merge sort: . . So . By the master theorem: .

Example 2: Selection

Selection

Problem: Given a list of numbers, find the -th smallest.

We can sort the list, which needs . Can we do better, say, linear time? After all, sorting gives a lot more information

than we requested. Not always a waste: consider dynamic

programming where solutions to subproblems are also produced.

Idea of divide and conquer

Divide the numbers into 3 parts

, , Depending on the size of each part, we know

which part the -th element lies in.

Then search in that part.

Question: Which to choose?

Suppose we use a number in the given list as a pivot.

As said, we divide the list into three parts. : Those numbers smaller than : Those numbers equal to : Those numbers larger than

After the partition

The division is simple: just scan the list and put elements into the corresponding part. time.

To select the -th smallest value, it becomes

Complexity?

Divide and conquer

Note: though there are two subproblems (of sizes and ), we need to solve only one of them. Compare: in quicksort, we need to sort both

substrings! Complexity:

A new issue: and are not determined Depends on the pivot.

If the pivot is the median:

Thus finally , better than by sorting.

If the pivot is at one end (say, the smallest)

What’s the complexity?

Complexity:

The similarity to quicksort tells us: a random pivot performs well It’s away from either end by with const. prob.

To be more precise, it’s in with probability . And in this case, the recursion becomes

Recall:

Thus we can use the following simple strategy: Pick a random pivot, do the recursion

Each random pivot falls in w/ prob. . . It is enough to get good pivots to make the

problem size to drop to 1. Thus .

Example 3: Matrix multiplication

Matrix multiplication

Recall: the product of two matrices is another matrix.

Question: how fast can we multiply two matrices?

Recall: : the entry in the matrix . Similar for ,

This takes multiplications (of numbers).

For a long time, people thought this was the best possible.

Until Straussen came up with the following.

If we break the matrix into blocks

Then the product is just block multiplication

8 matrix multiplications of dimension Plus additions.

Thus the recurrence is

This gives exactly the same , not interesting.

However, Straussen observed that we can actually use only 7 (instead of 8) multiplications of matrices with dimension .

God knows how he came up with it. And here is how:

Thus the recurrence becomes

And solving this gives

Best in theory? .

Conjecture: !

In practice? People still use the algorithm most of the time. (For example, in matlab.) It’s simple and robust.

Fast Fourier Transform (FFT)

Multiplication of polynomials

Many applications need to multiply two polynomials. e.g. signal processing.

A degree- polynomial is

The degree is at most . (The degree is exactly if .) The summation of two polynomials is easy

---------------------------------------------------------------- which still has degree at most .

Multiplication

Multiplication of two polynomials: a different story.

) --------------------------------------------------------------- where (convolution)

Try an example: The multiplication can have degree . If we directly use this formula to multiply two

polynomials, it takes time.

Better? YES!

By FFT: We can do it in time . FFT: Fast Fourier Transform

What determines/represents a polynomial? Let’s switch to degree-() for later simplicity. We already used coefficients to represent a

polynomial. Coefficient representation

Other ways? Yes: By giving values on (distinct) points.

Point-value representation. [Fact] A degree-() polynomial is uniquely determined

by values on distinct points.

Reason

We have coefficients to determine

We know values on distinct points.

How to get the coefficients? Just solve the system of equations:

In matrix form:

The matrix is Vandermonde matrix, which is invertible. Thus it has a unique solution for the coefficients .

unknowns

Advantage of point-value representation? It makes the multiplication extremely easy:

Though is hard For any fixed point , is simply multiplication of two

numbers. So given

and , it’s really easy to get

. time.

Thus we have the following interesting idea for polynomial multiplication…

Go to an easy world and come back HK used to have many industries. Later found mainland has less expensive labor. So:

moved the companies to mainland, did the work there, and then shipped the products back to HK to sell

This is worth doing if: it’s cheaper in mainland, and the traveling/shipping is not expensive either. which turned out to be true.

In our case

We need to investigate: the cost of traveling. Both way.

Traveling

From coefficient representation to point-value representation: Evaluate two polynomials both on points.

--- evaluation. From point-value representation to coefficient

representation: Compute one polynomial back from point values.

--- interpolation. Both can be done in time.

Evaluation

One point?

Directly by the above: multiplications Horner’s rule:

--- multiplications.

Number of points

How many points we need to evaluate on? Since we later want to reconstruct the

product polynomial , which has degree . So (point,value) pairs are enough to

recover . We’ll evaluate and on points , and get .

points are enough. We use for convenience. Then recover from .

Now evaluation on one point needs time, so evaluations on points need time.

Too bad. We want . Important fact: cost(evaluations on points) can be

cheaper than cost(evaluation on point). If we choose the points carefully.

And it turns out that the chosen points also make the (later) interpolation easier.

This powerful tool is called Fourier Transform.

Complex roots of unity

1 has a complex root , satisfing . Discrete Fourier Transform (DFT):

where Try now.

Note: evaluates polynomial on .

So our evaluation task is just DFT.

All the essences are here…

We want to evaluate and on , , …, . Suppose for simplicity that is a power of . For , define two new polynomials:

Then Divide and Conquer.

So we can evaluate by evaluating and on .

Distinct points

are not all distinct.

Only values, each repeating twice!

So we evaluate only points (instead of ). Recursion:

: To compute for . Rewriting recursion:

. Applying master theorem: . Since , the cost of evaluating is , as claimed.

Interpolation

How about interpolation? i.e., to get the coefficients by point values.

Almost the same process!

DFT matrix: . What’s the inverse of the matrix ?

Pretty much the same matrixreplace with .

…and then divide the whole matrix by for normalization.

Why? You can directly check by multiplying the two

matrices and get (the identity matrix). e.g.

DFT matrix is unitary. Namely

: transpose. : complex conjugate DFT is symmetric: . So taking complex conjugate () gives

inverse.

Summary

Divide and conquer is a general method to design algorithms.

Master theorem to compute the complexity. Several examples.

Merge sort Selection Matrix multiplication FFT

Instructor: Shengyu Zhang 1. Example 1: Merge sort 2.

Documents

Transcript of Instructor: Shengyu Zhang 1. Example 1: Merge sort 2.

Sorting merge-sort

Merge Sort Quick Sort

Faster Sorting Methods Chapter 12. 2 Chapter Contents Merge Sort Merging Arrays Recursive Merge Sort The Efficiency of Merge Sort Iterative Merge Sort.

09. Merge Sort

Merge Sort and Recurrences - UTKweb.eecs.utk.edu/.../2-Jan-14-Mergesort-Recurrences-no-answers.pdf · Merge Sort and Recurrences . COSC 581, ... using merge sort. • Combine: Merge

Unit 281 Merge- and Quick Sort Merge Sort Quick Sort Exercises.

Merge sort and quick sort

Merge and Quick Sort - Islamic University of Gazasite.iugaza.edu.ps/.../Merge-and-Quick-Sort.pdf · Merge and Quick Sort Lawrence M. Brown Merge Sort Tree • Visualize a merge sort

Merge Sort Mit

Merge Quick Sort

Insertion Sort, Merge Sort and Quicksort

Merge and Quick Sort - Islamic University of Gazasite.iugaza.edu.ps/mawad/wp-content/uploads/Merge-and-Quick-Sort… · 2 Merge and Quick Sort Lawrence M. Brown Merge Sort Divide

Sorting Insertion Sort Merge Sort

3.5 merge sort

COMP1405 Ch6 Sorting - Carleton Universitypeople.scs.carleton.ca/.../ProcessingNotes/COMP1405_Ch6_Sorting.pdf · Merge Sorts Merge Sort, Polyphase Merge Sort, Strand Sort Non-Compairon

Merge Sort

Merge Sort (Analysis )

Evaluation of Sorting Algorithms, Mathematical and ... · PDF fileSort, Insertion Sort, Merge Sort, Quick Sort. ... 17 Cascade Merge Sort R.L.GiIstad 1960 18 PolyPhase Merge/Fobinocii

Insertion Merge Sort

Algorithms: Sorting. Rand Sort. Compare-and-exchange. Merge Sort. Quick Sort. Odd-even merge sort. Bitonic merge sort.