Tonga Institute of Higher Education Design and Analysis of Algorithms

Tonga Institute of Higher Education

Design and Analysis of Algorithms

IT 254

Lecture 5:

Advanced Design Techniques

Advanced Design Techniques

• Other methods of design, like divide-and-conquer and randomization, can usually be applied to many problems.

• There are newer techniques that are a little more complicated, but allow computer scientists to solve harder problems. We will look at two methods.

• "Greedy" programming is a way to optimize a solution where you must make a choice. In this method, you make the best choice at each time and by the end you will have made the best choice overall

• "Amortized Analysis" is a tool for analyzing algorithms that allows us to look at algorithms where the running time changes depending on input, and then to find the average "amortized" running time.

Greedy Algorithms• Optimization problems is a category of

problems that focus on finding the best solution.

• In these problems, there can be many possible solutions. Each solution has a value and the goal is to find the solution with the optimal (maybe maximum or minimum) value.

• Greedy algorithms are a technique used to solve these problems.

• A greedy algorithm will always make the choice that looks best at the moment it makes it.

• In mathematical words: It makes the locally optimal choice, hoping that it will lead to the globally optimal choice.

Greedy Algorithm: Activity Selection

• The first example we will look at is an activity selector.

• Suppose there is a set S = { 1,2, … n } of n proposed activities that all need to use a resource.

• For example, a classroom which can only be used by one class at a time.

• Each activity "i" has a start time and a finish time so that starti < finishi

• The activity selection problem is to select a maximum size set of activities that all can use a resource.

• For this problem, we assume that all activities are sorted in order of increasing finish time:– f1 < f2 < f3 < … < fn

Activity Selector• We can use the following

pseudo-code to demonstrate a greedy algorithm to solve the problem.

• Here "A" is the set of all activities. "j" is the last activity added. Since activities are listed in sorted order, fj is always the maximum finishing time of any activity in A.

• So fj = max { fk: k Э A }• Lines 3-4 select the first

activity and put it in A• Lines 5-8 add an activity in

the start time that is after the finishing time of the last thing added.

• This is a O(n) algorithm if activities are sorted before the algorithm

1:Greedy-Selector

2: n = length of Set

3: A = new Array(1)

4: j = 1

5: for i = 2 to n

6: if si > fj

7: A = A υ { i }

8: j = i

9: return A

Activity Selector

• The activity picked next by the algorithm is always the one with the earliest finish time that can be scheduled without conflicts.

• The activity picked is thus the "greedy" choice, because it does not look at all choices, but only the best one it sees right in front of it.

• The greedy choice maximizes the amount of unscheduled time.

Greedy Properties

• How do we know if the Greedy Algorithm is the best choice for solving a problem?

• The first important thing to know about is if a globally optimal solution can be gotten by making locally optimal solutions.

• This means, can we get the best answer for the whole problem, by making good answers for smaller problems.

• This means that we can sometimes use induction to help us see if the problem can be solved with a greedy algorithm.

Properties: Optimal Substructure

• "Optimal substructure" is an important property for a problem to have if you want to solve it using a greedy method

• Optimal substructure means the best solution to a problem is made of the best solutions to sub-problems inside of it

• The Activity Selector was an example of this. • The sub problem was: which activity added to the

list will maximize the size of the list right now. • After solving each of these sub-problems, we

have an answer that solves the whole problem

Greedy Algorithms: Compression

• "Huffman coding" is a method to compress data.

• The main idea of data compressions is that you start with an original thing x (text, picture, movie) and you want to change x with a function C(x) so that C(x) has fewer bits than x

• But you also want a way to change C(x) back into x, with a function D(x)

• Thus: D(C(x)) = x• C(x) = compress x• D(x) = decompress x

Compression

• There are two types of compression algorithms,– Lossless Compression: D(C(x)) = x– Lossy Compression: D(C(x)) ≈ x

• Things that compress text or programs should be lossless, but things like pictures and sound can be lossy– (You don’t want words from a book to be almost

the same as words from a book)– (But with a picture, if the quality is not perfect

you will not notice that much)

Huffman Compression

• Huffman compression can give savings of 20-90% most of the time, depending on the data being compressed.

• Huffman's greedy algorithm uses a table of the frequencies of occurrence for each character to build an optimal way of representing a character as a binary string.

• This means that it will count how many times it sees each letter and make a table of the counts. Then it will find the shortest binary strings (like "01010") that will map to the letter that occurs the most.

Huffman Compression• Huffman will do compression based on a "variable-

length" code. This means that characters that occur more often will use fewer bits, while characters that do not occur often will use more bits.

• A "fixed-length" code means every character has the same amount of bits used to save it.

• For example, what if we have a book with 100,000 letters that we want to compress

A B C D E F

# (in thousands 45 13 12 16 9 5

Fixed-length code 000 001 010 011 100 101

Variable length code 0 101 100 111 1101

1100

Huffman Compression

• In the "fixed-length" coding we need 3 bits for each character and there are 100,000 characters. Thus, 300,000 bits are needed

• In the variable length code, we can do better.– (45*1 + 13*3 + 12*3 + 16*3 + 9*4 +

5*4) = 224,000 bits. – This saving is about 25% better than the

fixed length

Huffman Compression

• If you noticed the variable lengthed codes from before, there was one character that had a 1 bit code, but there was no two bit codes.

• This is because Huffman uses "prefix codes." This means that a used code is never the same as the beginning of another code. (Prefix means the part of a word that comes in the beginning)

• This makes encoding and decoding much easier if we follow this rule.

• For example: abc = 0 + 101 + 100 = 0101100• And 001011101 = 0 + 0 + 101 + 1101 = aabe• Prefix codes make sure that we always get the

same thing out that we put in ( D(C(x)) = x )

Huffman

• The decoding needs an easy way to find which letter belongs to a mapping.

• A binary tree is a good data structure for this.• The leaves of the tree will be a character and the

paths will represent either 0 or 1. • 0 can mean "go to the left child"• 1 can mean "go to the right child"• This also means we can easily know how many

bits we need to save the data.• Cost = SumAllCharacters(height(c) * count(c))

where "c" is a character• Or Cost =

Allc

cCcH )(*)(

Example Trees

Fixed Length Codes

Variable Length Codes

Each node stores the total number of characters in the sub tree below.The leaves contain the character and the number of occurrences.We can see that a variable length code tree may become very unbalanced

Huffman Coding• Huffman invented a greedy

algorithm that will find the variable length codes and build the tree in a bottom up manner.

• C is a set of n characters.• Q is a Priority Queue that

holds and sorts all the characters by frequency

• The for loop keeps taking out the two minimum nodes and making them children of z, a new node that holds the total frequency count

• Then we insert z back into the Queue until we have done this for each character.

• Lastly, we return the Extract-Min which is the root of the tree

Huffman(C)

n = length of C

Q = C

for i = 1 to n – 1

z = new Node()

z->left = Extract-Min(Q)

z->right = Extract-Min(Q)

z->count = z->left->count +

z->right->count

Insert(Q,z)

return Extract-Min(Q)

Huffman Tree BuildingA B

C D

E F

Huffman Trees• The Huffman coding uses a greedy

algorithm because it has an "optimal substructure."

• This means that if it is able to solve small problems correctly, then it can put the small problems together to get the best solution for the whole problem

• In the Huffman codes, we solve a prefix code problem for two nodes at a time. We "greedily" choose the two smallest nodes and then work on the next two smallest.

• This process demonstrates how to use greedy algorithms to solve a problem that would otherwise be difficult

Amortized Analysis

• Amortized analysis is the time required to do a sequence of operations averaged over all the operations done.

• We have looked at the worst-case running times of individual operations, but…

• Sometimes the cost of a single operation changes a lot, so the worst case is not so good.

• Instead, we want to look at the average cost of an operation over a series of operations.

• This is different from the “Average-case” analysis. • Amortized analysis looks at the average cost of an

operation. Average-case looks at how long an entire function will take on average

Amortized

• Thus, “amortized time” means:– If any sequence of n operations take < T(n) time, the

amortized time per operation is T(n)/n– Also, if the amortized time of one operation is U(n), then

any sequence of n operations takes n*U(n) time

• This average is over a sequence of operations for any sequence– Not the average for an input distribution– Not the average over random choices made by an

algorithm

• Amortized analysis: a way to express an algorithm in such a way that even if the worst case is bad, the total performance of a sequence of operations is not always bad

Amortized Analysis

• Method One: Accounting method– Charge each operation an amortized cost– Amount not used stored in “bank”– Later operations can used stored credit– Balance must not go negative

• Method Two: Aggregate method of amortized analysis:– n operations take time T(n)– Average cost of an operation = T(n)/n

Dynamic Tables

• What if we want to make a table, like a hash table, for dynamic data (data that changes). And we want to make it as small as possible

• Problem: if too many items inserted, table may be too small for all of them, so…

• Solution: get more memory if we need it

Dynamic Tables

1. Initialize table to size m = 12. Insert elements until n elements >

m3. Generate new table of size 2m4. Reinsert old elements into new table5. (back to step 2)

– What is the worst-case cost of an insert?– One insert can be costly, but the total?

Analysis Of Dynamic Tables

• Let ci = cost of ith insert

– ci = i if i-1 is exact power of 2,

– 1 otherwise

• Example:– Operation Table Size Cost

Insert(1) 1 1 1




– 1 otherwise


Insert(1) 1 1 1Insert(2) 2 1 + 1 2Insert(3) 4 1 + 2 3




– 1 otherwise


Insert(1) 1 1 1Insert(2) 2 1 + 1 2Insert(3) 4 1 + 2 3Insert(4) 4 1 4Insert(5) 8 1 + 4 5




– 1 otherwise


Insert(1) 1 1 1Insert(2) 2 1 + 1 2Insert(3) 4 1 + 2 3Insert(4) 4 1 4Insert(5) 8 1 + 4 5Insert(6) 8 1 6Insert(7) 8 1 7Insert(8) 8 1 8

Analysis Of Dynamic Tables• Let ci = cost of ith insert


– 1 otherwise


Insert(1) 1 1 1Insert(2) 2 1 + 1 2Insert(3) 4 1 + 2Insert(4) 4 1Insert(5) 8 1 + 4Insert(6) 8 1Insert(7) 8 1Insert(8) 8 1Insert(9) 16 1 + 8

123456789

Aggregate Analysis: Dynamic Tables

• "n" Insert() operations cost:

• Average cost of operation = (total cost)/(# operations) < 3n/n < 3

• So we can say a dynamic table costs the same as a fixed-size table– Both O(1) per Insert operation, even though in

the dynamic table some operations are really bad.

nnnncn

j

jn

ii 3)12(2

lg

01

Accounting Analysis

• We can also use another form of amortized analysis, called the "accounting method".

• For our dynamic table we can "charge" each operation $3 amortized cost– Use $1 to perform immediate Insert()– Store $2

• When table doubles– Use the saved $2 to reinsert two old items– We’ve "paid" these costs with the last n/2

Insert()s• Benefit: O(1) amortized cost per operation

Accounting Analysis

• Suppose we also must support insert & delete.

• Then the table could shrink and grow– Table overflows double it (as before)– Table < 1/4 full halve it– Charge $3 for Insert (as before)– Charge $2 for Delete

• Store extra $1 in emptied slot• Use later to pay to copy remaining items to new table

when shrinking table• We only need extra $1 because we're reinserting fewer

items

Example: Binary Counter• Consider the following

problem:• You want a binary

counter that does n Increment operations

• We use an array A that holds bits so that– A[i] = 0 or – A[i] = 1

• A[0] is lowest order bit, so value of counter is

• X = SUM(A[i]*2i) from i n

Algorithm

Increment(A) A[0] = A[0]+1 i = 0

while (A[i] == 2) A[i+1] = A[i+1] + 1;A[i] = 0;i++

Example: Binary Counter

• The running time of Increment is the number of iterations of the while loop

• Examples:– X = 47 A = <0,…,0,1,0,1,1,1,1 >– X = 48 A = <0,…,0,1,1,0,0,0,0 >– X = 49 A = <0,…,0,1,1,0,0,0,1 >

• Increment from x = 47 to x = 48 has cost 5• Increment from x = 48 to x = 49 has cost 1


• Analysis of a sequence of n increments– Number of bits in representation of n is log n,

thus n operations = O(n) * O(lg n) = O(n lg n)– Amortized running time of Increment is O(1) for

one operation, O(n) for n operations– A[0] will flip on every increment (n times)– A[1] will flip on every second increment (n/2

times)– A[2] will flip on every third increment (n/4)– A[i] will flip on every 2i increment (n/2i times)– Total running time T(n) = Σ(n/2i) from i=0 to log n– T(n) = Σ(n/2i) to lg n < Σ (1/2i) to ∞ = 2n = O(n)– Amortized cost for one operation O(2n)/n = O(2)


• Accounting Analysis:– Every 1 in A will hold one credit– Change from 1 -> 0 paid using a credit– Change from 0 -> 1 paid by Increment.

Pay one credit to do the flip and place one credit on new 1

– Increment cost O(1) amortized.

Summary• Advanced design techniques should be put in

your mind in case you run across a problem that seems especially difficult.

• Greedy algorithms can be very helpful in solving problems that would seem hard to program.

• Amortized analysis is not really a way to program, but is instead a way to find out running times.

• What it does special is that it realizes that different operations and different input may change the running time.

• In these cases, we may be more interested in an average running time of an operation, instead of the worst case of an algorithm.

Tonga Institute of Higher Education Design and Analysis of Algorithms

Documents

Transcript of Tonga Institute of Higher Education Design and Analysis of Algorithms