TREES Chapter 6 Common final examinations When: Thursday, 12/12, 3:30-5:30 PM Where: Ritter Hall -...

Post on 28-Dec-2015

217 views 0 download

Tags:

Transcript of TREES Chapter 6 Common final examinations When: Thursday, 12/12, 3:30-5:30 PM Where: Ritter Hall -...

TREES

Chapter 6

Common final examinations • When: Thursday, 12/12, 3:30-5:30 PM• Where: Ritter Hall - Walk Auditorium 131

Chapter Objectives

To learn how to use a tree to represent a hierarchical

organization of information use recursion to process trees implement binary trees, binary search

trees, and heaps using linked data structures and arrays

To understand the different ways of traversing a tree the differences between binary trees, binary

search trees, and heaps

Chapter Objectives (cont.) To learn how to

use a binary search tree to store information so that it can be retrieved in an efficient manner

use a Huffman tree to encode characters using fewer bytes than ASCII or Unicode, resulting in smaller files and reduced storage requirements

Trees - Introduction

All previous data organizations we've studied are linear — each element can have only one predecessor and successor

Accessing all elements in a linear sequence is O(n)

Trees are nonlinear and hierarchical Tree nodes can have multiple successors

(children), but only one predecessor or parent

Where Have I Seen Trees?

Trees - Introduction (cont.)

Trees can represent hierarchical organizations of information: class hierarchy disk directory and subdirectories family tree

Trees are recursive data structures Why?

Many methods to process trees are written recursively

Trees - Introduction (cont.)

We cover only binary trees In a binary tree each element has two

successors Binary trees can be represented by

arrays and by linked data structures Searching a binary search tree, an

ordered tree, is generally more efficient than searching an ordered list—O(log n) versus O(n)

Convert a binary search tree to a sorted double-linked list. We can only change the target of pointers, but cannot create any new nodes.

Microsoft Interview Question

Describe a chicken using a programming language.

Google Interview Question

Tree Terminology and Applications

Tree Terminology

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successorsThe node at the

top of a tree is called its root

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

The links from a node to its successors are called branches

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successorsThe successors of

a node are called its children

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

The predecessor of a node is called its parent

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successorsEach node in a tree has exactly one parent except for the root node, which has no parent

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

Nodes that have the same parent are siblings

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

A node that has no children is called a leaf node

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

A node that has no children is called a leaf node

Leaf nodes also are known as

external nodes,

and nonleaf nodes are known

as internal nodes

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

A generalization of the parent-child relationship is the

ancestor-descendant relationship

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

dog is the parent of cat in this tree

A generalization of the parent-child relationship is the

ancestor-descendant relationship

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

cat is the parent of canine in this tree

A generalization of the parent-child relationship is the

ancestor-descendant relationship

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

canine is a descendant of cat in this tree

A generalization of the parent-child relationship is the

ancestor-descendant relationship

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

dog is an ancestor of canine in this tree

A generalization of the parent-child relationship is the

ancestor-descendant relationship

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

A subtree of a node is a tree whose root is a child of that node

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

A subtree of a node is a tree whose root is a child of that node

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

A subtree of a node is a tree whose root is a child of that node

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

The level of a node is

determined by its distance from the

root

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

The level of a node is its

distance from the root plus 1

Level 1

Level 2

Level 3

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

The level of a node is defined

recursively

Level 1

Level 2

Level 3

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successors

The level of a node is defined

recursively

Level 1

Level 2

Level 3

• If node n is the root of tree T, its level is 1

• If node n is not the root of tree T, its level is 1 + the level of its parent

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successorsThe height of a tree is the number of nodes in the longest path from the root node to a leaf node

Tree Terminology (cont.)

dog

cat wolf

canine

A tree consists of a collection of elements or nodes, with each node linked to its successorsThe height of a tree is the number of nodes in the longest path from the root node to a leaf node

The height of this tree is 3

Binary Trees

In a binary tree, each node has two subtrees

A set of nodes T is a binary tree if either of the following is true T is empty Its root node has two subtrees, TL and TR,

such that TL and TR are binary trees

(TL = left subtree; TR = right subtree)

Example: Expression Tree Each node contains an

operator or an operand Operands are stored in

leaf nodes Parentheses are not stored

in the tree because the tree structure dictates the order of operand evaluation

Operators in nodes at higher tree levels are evaluated after operators in nodes at lower tree levels

(x + y) * ((a + b) / c)

Example: Binary Search Tree Binary search trees

All elements in the left subtree precede those in the right subtree

A formal definition:

A set of nodes T is a binary search tree if either of the following is true:

T is empty If T is not empty, its root node has two subtrees, TL and

TR, such that TL and TR are binary search trees and the value in the root node of T is greater than all values in TL and is less than all values in TR

dog

cat wolf

canine

Write a function to determine whether a given binary tree of distinct integers is a valid binary search tree. Tricky problem

Direct recursion does not work.

Google Interview Question

Binary Search Tree (cont.)

A binary search tree never has to be sorted because its elements always satisfy the required order relationships

When new elements are inserted (or removed) properly, the binary search tree maintains its order

In contrast, a sorted array must be expanded whenever new elements are added, and compacted whenever elements are removed—expanding and contracting are both O(n)

Binary Search Tree (cont.)

When searching a BST, each probe has the potential to eliminate half the elements in the tree, so searching can be O(log n)

In the worst case, searching is O(n)

Recursive Algorithm for Searching a Binary Tree

1. if the tree is empty

2. return null (target is not found)

else if the target matches the root node's data

3. return the data stored at the root node

else if the target is less than the root node's data

4. return the result of searching the left subtree of the root

else

5. return the result of searching the right subtree of the

root

Full, Perfect, and Complete Binary Trees

A full binary tree is a binary tree where all nodes have either 2 children or 0 children (the leaf nodes)

7

101

12930

52 11

64

13

Full, Perfect, and Complete Binary Trees (cont.)

A perfect binary tree is a full binary tree of height n with exactly 2n – 1 nodes

In this case, n = 3 and 2n – 1 = 7

3

1

420

5

6

Full, Perfect, and Complete Binary Trees (cont.)

A complete binary tree is a perfect binary tree through level n - 1 with some extra leaf nodes at level n (the tree height), all toward the left

3

1

420

5

General Trees

We do not discuss general trees in this chapter.

Tree Traversals

Tree Traversals

Often we want to determine the nodes of a tree and their relationship We can do this by walking through the tree in a

prescribed order and visiting the nodes as they are encountered

This process is called tree traversal Three common kinds of binary tree traversal

Inorder Preorder Postorder

Write a preorder traversal of a binary search tree without recursion / stack / extra memory storage. Hint – you can augment the node data structure.

However, you can’t add a visited field to the node.

Print a tree like (Parent ( leftchild (leftchild, rightchild), rightchild(leftchild,rightchild) ) )

Google Interview Question

Tree Traversals (cont.)

Preorder: visit root node, traverse TL, traverse TR

Inorder: traverse TL, visit root node, traverse TR

Postorder: traverse TL, traverse TR, visit root node

Visualizing Tree Traversals

You can visualize a tree traversal by imagining a mouse that walks along the edge of the tree

If the mouse always keeps the tree to the left, it will trace a route known as the Euler tour

The Euler tour is the path traced in blue in the figure on the right

Visualizing Tree Traversals (cont.)

A Euler tour (blue path) is a preorder traversal

The sequence in this example isa b d g e h c f i j

The mouse visits each node before traversing its subtrees (shown by the downward pointing arrows)

Visualizing Tree Traversals (cont.)

If we record a node as the mouse returns from traversing its left subtree (shown by horizontal black arrows in the figure) we get an inorder traversal

The sequence isd g b h e a i f j c

Visualizing Tree Traversals (cont.)

If we record each node as the mouse last encounters it, we get a postorder traversal (shown by the upward pointing arrows)

The sequence isg d h e b i j f c a

Traversals of Binary Search Trees and Expression Trees

An inorder traversal of a binary search tree results in the nodes being visited in sequence by increasing data value

canine, cat, dog, wolf

dog

cat wolf

canine

Traversals of Binary Search Trees and Expression Trees (cont.)

An inorder traversal of an expression tree results in the sequencex + y * a + b / c

If we insert parentheses where they belong, we get the infix form:(x + y) * ((a + b) / c)

*

/+

c+yx

ba

Traversals of Binary Search Trees and Expression Trees (cont.)

A postorder traversal of an expression tree results in the sequencex y + a b + c / *

This is the postfix or reverse polish form of the expression

Operators follow operands

*

/+

c+yx

ba

Traversals of Binary Search Trees and Expression Trees (cont.)

A preorder traversal of an expression tree results in the sequence* + x y / + a b c

This is the prefix or forward polish form of the expression

Operators precede operands

*

/+

c+yx

ba

Implementing a BinaryTree Class

Node<E> Class

Just as for a linked list, a node consists of a data part and links to successor nodes

The data part is a reference to type E

A binary tree node must have links to both its left and right subtrees

Node<E> Class (cont.)

protected static class Node<E> implements Serializable {

protected E data;

protected Node<E> left;

protected Node<E> right;

public Node(E data) {

this.data = data;

left = null;

right = null;

}

public String toString() {

return data.toString();

}

}

Node<E> is declared as an inner class within

BinaryTree<E>

Node<E> Class (cont.)

protected static class Node<E> implements Serializable {

protected E data;

protected Node<E> left;

protected Node<E> right;

public Node(E data) {

this.data = data;

left = null;

right = null;

}

public String toString() {

return data.toString();

}

}

Node<E> is declared protected. This way we

can use it as a superclass.

BinaryTree<E> Class (cont.)

BinaryTree<E> Class (cont.)

Assuming the tree is referenced by variable bT (type BinaryTree)

then . . .

BinaryTree<E> Class (cont.)

bT.root.data references the

Character object storing '*'

BinaryTree<E> Class (cont.)

bT.root.left references the left subtree of the root

BinaryTree<E> Class (cont.)

bT.root.right references the right subtree of the root

BinaryTree<E> Class (cont.)bT.root.right.data

references the Character object storing

'/'

BinaryTree<E> Class (cont.)

BinaryTree<E> Class (cont.)

Class heading and data field declarations:

import java.io.*;

public class BinaryTree<E> implements Serializable {

// Insert inner class Node<E> here

protected Node<E> root;

. . .

}

BinaryTree<E> Class (cont.)

The Serializable interface defines no methods

It provides a marker for classes that can be written to a binary file using the ObjectOutputStream and read using the ObjectInputStream

Constructors

The no-parameter constructor:

public BinaryTree() {

root = null;

}

The constructor that creates a tree with a given node at the root:

protected BinaryTree(Node<E> root) {

this.root = root;

}

Constructors (cont.)

The constructor that builds a tree from a data value and two trees:

public BinaryTree(E data, BinaryTree<E> leftTree, BinaryTree<E> rightTree) {

root = new Node<E>(data);if (leftTree != null) {

root.left = leftTree.root;} else {

root.left = null;}if (rightTree != null) {

root.right = rightTree.root;} else {

root.right = null;}

}

getLeftSubtree and getRightSubtree Methods

public BinaryTree<E> getLeftSubtree() {

if (root != null && root.left != null) {

return new BinaryTree<E>(root.left);

} else {

return null;

}

}

getRightSubtree method is symmetric

isLeaf Method

public boolean isLeaf() {

return (root.left == null && root.right == null);

}

toString Method

The toString method generates a string representing a preorder traversal in which each local root is indented a distance proportional to its depth

public String toString() {

StringBuilder sb = new StringBuilder();

preOrderTraverse(root, 1, sb);

return sb.toString();

}

preOrderTraverse Method

private void preOrderTraverse(Node<E> node, int depth, StringBuilder sb) {

for (int i = 1; i < depth; i++) {

sb.append(" "); // indentation

}

if (node == null) {

sb.append("null\n");

} else {

sb.append(node.toString());

sb.append("\n");

preOrderTraverse(node.left, depth + 1, sb);

preOrderTraverse(node.right, depth + 1, sb);

}

}

preOrderTraverse Method (cont.)

*

+

x

null

null

y

null

null

/

a

null

null

b

null

null

*

+

ayx

/

b

(x + y) * (a / b)

Binary Search Trees

Overview of a Binary Search Tree

Recall the definition of a binary search tree:A set of nodes T is a binary search tree if either of the following is true

T is empty If T is not empty, its root node has two

subtrees, TL and TR, such that TL and TR are binary search trees and the value in the root node of T is greater than all values in TL and less than all values in TR

Overview of a Binary Search Tree (cont.)

Recursive Algorithm for Searching a Binary Search Tree

1. if the root is null

2. the item is not in the tree; return null

3. Compare the value of target with root.data

4. if they are equal

5. the target has been found; return the data at the root

else if the target is less than root.data

6. return the result of searching the left subtreeelse

7. return the result of searching the right subtree

Searching a Binary Tree

Searching for "kept"

Performance

Search a tree is generally O(log n) If a tree is not very full, performance will

be worse Searching a tree with only

right subtrees, for example, is O(n)

Interface SearchTree<E>

BinarySearchTree<E> Class

Implementing find Methods

Insertion into a Binary Search Tree

Implementing the add Methods

/** Starter method add. pre: The object to insert must implement the

Comparable interface. @param item The object being inserted @return true if the object is inserted, false

if the object already exists in the tree*/public boolean add(E item) { root = add(root, item); return addReturn;}

Implementing the add Methods (cont.)

/** Recursive add method. post: The data field addReturn is set true if the item is added to the tree, false if the item is already in the tree. @param localRoot The local root of the subtree @param item The object to be inserted @return The new local root that now contains the inserted item */ private Node<E> add(Node<E> localRoot, E item) { if (localRoot == null) { // item is not in the tree — insert it. addReturn = true; return new Node<E>(item); } else if (item.compareTo(localRoot.data) == 0) { // item is equal to localRoot.data addReturn = false; return localRoot; } else if (item.compareTo(localRoot.data) < 0) { // item is less than localRoot.data localRoot.left = add(localRoot.left, item); return localRoot; } else { // item is greater than localRoot.data localRoot.right = add(localRoot.right, item); return localRoot; } }

Removal from a Binary Search Tree

Three cases need to be considered: If the item to be removed has no

children then simply delete the reference to the item

If the item to be removed has only one child thenchange the reference to the item so that it

references the item’s only child If the item to be removed has two

children thenRequires a bit more work…

Removal from a Binary Search Tree (cont.)

Removing from a Binary Search Tree (cont.)

If the item to be removed has two children, then Replace it with the largest item in its left

subtree the inorder predecessor

Delete the node with the largest value from its left subtree

Removing from a Binary Search Tree (cont.)

Delete rat

Removing from a Binary Search Tree (cont.)

Algorithm for Removing from a Binary Search Tree

Implementing the delete Method

Listing 6.5 (BinarySearchTree delete Methods; pages 325-326)

NOTE: There is nothing special about the left

subtree. We could do the same thing with the right subtree.How? Just use the smallest value in the right subtree.

Testing a Binary Search Tree

Verify that an inorder traversal will display the tree contents in ascending order after a series of insertions and deletions are performed

Heaps and Priority Queues

Heaps and Priority Queues A heap is a complete binary tree with the

following properties The value in the root is

the smallest item in the tree

Every nonempty subtree is a heap

Inserting an Item into a Heap

6

18 29

37 26 76 32 74 89

20 28 39 66

Inserting an Item into a Heap (cont.)

6

18 29

837 26 76 32 74 89

20 28 39 66

Inserting an Item into a Heap (cont.)

6

18 29

6637 26 76 32 74 89

20 28 39 8

Inserting an Item into a Heap (cont.)

6

18 8

6637 26 76 32 74 89

20 28 39 29

Inserting an Item into a Heap (cont.)

6

18 8

6637 26 76 32 74 89

20 28 39 29

Removing an Item from a Heap

6

18 8

6637 26 76 32 74 89

20 28 39 29

Always remove from the root

Removing an Item from a Heap (cont.)

6

18 8

6637 26 76 32 74 89

20 28 39 29

Removing an Item from a Heap (cont.)

66

18 8

37 26 76 32 74 89

20 28 39 29

Removing an Item from a Heap (cont.)

8

18 66

37 26 76 32 74 89

20 28 39 29

Removing an Item from a Heap (cont.)

8

18 29

37 26 76 32 74 89

20 28 39 66

Removing an Item from a Heap (cont.)

8

18 29

37 26 76 32 74 89

20 28 39 66

Implementing a Heap

Because a heap is a complete binary tree, it can be implemented efficiently using an array rather than a linked data structure

0 1 2 7 8 9 10 11 123 4 5 6

Implementing a Heap (cont.)

8

18 29

37 26 76 32 74 89

20 28 39 66

8

8

18 29

18 29

20 28 39 66

20 28 39 66

37 26 76 32 74 89

37 26 76 32 74 89

0 1 2 7 8 9 10 11 123 4 5 6

Implementing a Heap (cont.)

8 18 29 20 28 39 66 37 26 76 32 74 89

8

18 29

37 26 76 32 74 89

20 28 39 66

Pare

nt

L. Child

R. C

hild

For a node at position p,

L. child position: 2p + 1 R. child position: 2p + 2

0 1 2 7 8 9 10 11 123 4 5 6

Implementing a Heap (cont.)

8 18 29 20 28 39 66 37 26 76 32 74 89

8

18 29

37 26 76 32 74 89

20 28 39 66

Pare

nt

L. Child

R. C

hild

For a node at position p,

L. child position: 2p + 1 R. child position: 2p + 2

0 1 2 7 8 9 10 11 123 4 5 6

Implementing a Heap (cont.)

8 18 29 20 28 39 66 37 26 76 32 74 89

8

18 29

37 26 76 32 74 89

20 28 39 66

Pare

nt

L. Child

R. C

hild

For a node at position p,

L. child position: 2p + 1 R. child position: 2p + 2

0 1 2 7 8 9 10 11 123 4 5 6

Implementing a Heap (cont.)

8 18 29 20 28 39 66 37 26 76 32 74 89

8

18 29

37 26 76 32 74 89

20 28 39 66

Pare

nt

L. Child

R. C

hild

For a node at position p,

L. child position: 2p + 1 R. child position: 2p + 2

0 1 2 7 8 9 10 11 123 4 5 6

Implementing a Heap (cont.)

8 18 29 20 28 39 66 37 26 76 32 74 89

8

18 29

37 26 76 32 74 89

20 28 39 66

Pare

nt

L. Child

R. C

hild

For a node at position p,

L. child position: 2p + 1 R. child position: 2p + 2

0 1 2 7 8 9 10 11 123 4 5 6

Implementing a Heap (cont.)

8 18 29 20 28 39 66 37 26 76 32 74 89

8

18 29

37 26 76 32 74 89

20 28 39 66

Pare

nt

Child

A node at position c can find its parent at(c – 1)/2

Inserting into a Heap Implemented as an ArrayList

8

18 29

37 26 76 32 74 89

20 28 39 66

0 1 2 7 8 9 10 11 123 4 5 6

8 18 29 20 28 39 66 37 26 76 32 74 89

13

1. Insert the new element at the end of the ArrayList and set child to table.size() - 1

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 29

37 26 76 32 74 89

20 28 39 66

0 1 2 7 8 9 10 11 123 4 5 6

6 18 29 20 28 39 66 37 26 76 32 74 89

13

1. Insert the new element at the end of the ArrayList and set child to table.size() - 1

8

8

Child

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 29

37 26 76 32 74 89

20 28 39 66

0 1 2 7 8 9 10 11 123 4 5 6

6 18 29 20 28 39 66 37 26 76 32 74 89

13

2. Set parent to (child – 1)/ 2

8

8

Child

Parent

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 29

37 26 76 32 74 89

20 28 39 66

0 1 2 7 8 9 10 11 123 4 5 6

6 18 29 20 28 39 66 37 26 76 32 74 89

13

3. while (parent >= 0 and table[parent] > table[child])

4. Swap table[parent] and table[child]

5. Set child equal to parent

6. Set parent equal to (child-1)/28

8

Child

Parent

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 29

37 26 76 32 74 89

20 28 39 8

0 1 2 7 8 9 10 11 123 4 5 6

6 18 29 20 28 39 8 37 26 76 32 74 89

13

3. while (parent >= 0 and table[parent] > table[child])

4. Swap table[parent] and table[child]

5. Set child equal to parent

6. Set parent equal to (child-1)/266

66

Child

Parent

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 29

37 26 76 32 74 89

20 28 39 8

0 1 2 7 8 9 10 11 123 4 5 6

6 18 29 20 28 39 8 37 26 76 32 74 89

13

3. while (parent >= 0 and table[parent] > table[child])

4. Swap table[parent] and table[child]

5. Set child equal to parent

6. Set parent equal to (child-1)/266

66

Child

Parent

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 29

37 26 76 32 74 89

20 28 39 8

0 1 2 7 8 9 10 11 123 4 5 6

6 18 29 20 28 39 8 37 26 76 32 74 89

13

3. while (parent >= 0 and table[parent] > table[child])

4. Swap table[parent] and table[child]

5. Set child equal to parent

6. Set parent equal to (child-1)/266

66

Child

Parent

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 29

37 26 76 32 74 89

20 28 39 8

0 1 2 7 8 9 10 11 123 4 5 6

6 18 29 20 28 39 8 37 26 76 32 74 89

13

3. while (parent >= 0 and table[parent] > table[child])

4. Swap table[parent] and table[child]

5. Set child equal to parent

6. Set parent equal to (child-1)/266

66

Child

Parent

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 8

37 26 76 32 74 89

20 28 39 29

0 1 2 7 8 9 10 11 123 4 5 6

6 18 8 20 28 39 29 37 26 76 32 74 89

13

3. while (parent >= 0 and table[parent] > table[child])

4. Swap table[parent] and table[child]

5. Set child equal to parent

6. Set parent equal to (child-1)/266

66

Child

Parent

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 8

37 26 76 32 74 89

20 28 39 29

0 1 2 7 8 9 10 11 123 4 5 6

6 18 8 20 28 39 29 37 26 76 32 74 89

13

3. while (parent >= 0 and table[parent] > table[child])

4. Swap table[parent] and table[child]

5. Set child equal to parent

6. Set parent equal to (child-1)/266

66

Child

Parent

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 8

37 26 76 32 74 89

20 28 39 29

0 1 2 7 8 9 10 11 123 4 5 6

6 18 8 20 28 39 29 37 26 76 32 74 89

13

3. while (parent >= 0 and table[parent] > table[child])

4. Swap table[parent] and table[child]

5. Set child equal to parent

6. Set parent equal to (child-1)/266

66

Child

Parent

Inserting into a Heap Implemented as an ArrayList (cont.)

6

18 8

37 26 76 32 74 89

20 28 39 29

0 1 2 7 8 9 10 11 123 4 5 6

6 18 8 20 28 39 29 37 26 76 32 74 89

13

3. while (parent >= 0 and table[parent] > table[child])

4. Swap table[parent] and table[child]

5. Set child equal to parent

6. Set parent equal to (child-1)/266

66

Removal from a Heap Implemented as an ArrayList

Removing an Element from a Heap Implemented as an ArrayList

1. Remove the last element (i.e., the one at size() – 1) and set the item at 0 to this value.

2. Set parent to 0.

3. while (true)

4. Set leftChild to (2 * parent) + 1 and rightChild to leftChild + 1.

5. if leftChild >= table.size()

6. Break out of loop.

7. Assume minChild (the smaller child) is leftChild.

8. if rightChild < table.size() and

table[rightChild] < table[leftChild]

9. Set minChild to rightChild.

10. if table[parent] > table[minChild]

11. Swap table[parent] and table[minChild].

12. Set parent to minChild.

else

13. Break out of loop.

Performance of the Heap

remove traces a path from the root to a leaf

insert traces a path from a leaf to the root

This requires at most h steps where h is the height of the tree

The largest full tree of height h has 2h-1 nodes

The smallest complete tree of height h has 2(h-1) nodes

Both insert and remove are O(log n)

Interview Question

Given an array 1 to N, how many permutations of it will be Min Heap. This is pretty tricky…

Recall palindromes? Find numbers which are palindromic in both

their decimal and octal representations

Priority Queues

The heap is used to implement a special kind of queue called a priority queue

The heap is not very useful as an ADT on its own We will not create a Heap interface or code a class

that implements it Instead, we will incorporate its algorithms when we

implement a priority queue class and heapsort Sometimes a FIFO queue may not be the best

way to implement a waiting line A priority queue is a data structure in which only

the highest-priority item is accessible

Priority Queues (cont.)

In a print queue, sometimes it is more appropriate to print a short document that arrived after a very long document

A priority queue is a data structure in which only the highest-priority item is accessible (as opposed to the first item entered)

Insertion into a Priority Queue

Priority of document is inversely proportional to its page count.

PriorityQueue Class

Java provides a PriorityQueue<E> class that implements the Queue<E> interface given in Chapter 4.

Using a Heap as the Basis of a Priority Queue

In a priority queue, just like a heap, the “smallest” item always is removed first

Because heap insertion and removal is O(log n), a heap can be the basis of a very efficient implementation of a priority queue

java.util.PriorityQueue uses an Object[] array.

The TA will present an implementation of priority queue using an ArrayList

Huffman Trees

Huffman Tree

A Huffman tree represents Huffman codes for characters that might appear in a text file

As opposed to ASCII or Unicode, Huffman code uses different numbers of bits to encode letters more common characters use fewer bits Examples of frequently used characters are…

Many programs that compress files use Huffman codes

Huffman Tree (cont.)

To form a code, traverse the tree from the root to the chosen

character, appending 0 if you branch left, and 1 if you branch right.

Huffman Tree (cont.)

Examples:d : 10110

e : 010

Huffman Trees

A Huffman tree can be implemented using a binary tree and a PriorityQueue

A straight binary encoding of an alphabet assigns a unique binary number to each symbol in the alphabet Unicode is an example of such a coding

The message “go eagles” requires 144 bits in Unicode but only 38 bits using Huffman coding

Huffman Trees (cont.)

Huffman Trees (cont.)

Building a Custom Huffman Tree

Suppose we want to build a custom Huffman tree for a file

Input: an array of objects such that each object contains a reference to a symbol occurring in that file and the frequency of occurrence (weight) for the symbol in that file

Building a Custom Huffman Tree (cont.)

Analysis: Each node will have storage for two data items:

the weight of the node and the symbol associated with the node

All symbols will be stored in leaf nodes For nodes that are not leaf nodes, the symbol part

has no meaning The weight of a leaf node will be the frequency of

the symbol stored at that node The weight of an interior node will be the sum of

frequencies of all leaf nodes in the subtree rooted at the interior node

Building a Custom Huffman Tree (cont.)

Analysis: A priority queue will be the key data

structure in our Huffman tree We will store individual symbols and

subtrees of multiple symbols in order by their priority (frequency of occurrence)

Building a Custom Huffman Tree (cont.)

Building a Custom Huffman Tree (cont.)

Design

Algorithm for Building a Huffman Tree1. Construct a set of trees with root nodes that contain each of the individual symbols and their weights.2. Place the set of trees into a priority queue.3. while the priority queue has more than one item4. Remove the two trees with the smallest weights.5. Combine them into a new binary tree in which the weight of the tree

root is the sum of the weights of its children.6. Insert the newly created tree back into the priority queue.

Implementation

Listing 6.9 (Class HuffmanTree; page 349)

Listing 6.10 (The buildTree Method (HuffmanTree.java); pages 350-351)

Listing 6.11 (The decode Method (HuffmanTree.java); page 352)

Interview Question

Write code for Huffman encoding. Nailed it!

153

Huffman Code Construction: 2nd Example

Character count in text.125

Freq

93

80

76

73

71

61

55

41

40

E

Char

T

A

O

I

N

R

H

L

D

31

27

C

U

65S

Example source:pages.cs.wisc.edu/~shuchi/courses/577-F12/demos/huffman.ppt

154

Huffman Code Construction

C U

31 27

125Freq

9380767371

61554140

EChar

TAOIN

RHLD

3127

CU

65S

155

Huffman Code Construction

C U

58

31 27

125Freq

9380767371

61

554140

EChar

TAOIN

R

HLD

58

65S

3127

CU

156

Huffman Code Construction

C U

58

D L

81

31 27

40 41

125Freq

93

80767371

615855

EChar

T

AOIN

R

H

81

65S

4140

LD

157

Huffman Code Construction

H

C U

58

113

D L

81

31 27

5540 41

125Freq

93

80767371

61

113E

Char

T

AOIN

R

81

65S

5855H

158

Huffman Code Construction

R S H

C U

58

113126

D L

81

31 27

5561 6540 41

125

Freq

93

80767371

113E

Char

T

AOIN

81

126

61R65S

126

159

Huffman Code Construction

R S N I H

C U

58

113144126

D L

81

31 27

5571 7361 6540 41

125

Freq

93

8076

144

113E

Char

T

AO

81

7371

IN

144

160

Huffman Code Construction

R S N I H

C U

58

113144126

D L

81

156

A O

31 27

5571 7361 6540 41

80 76

126125

Freq

93

156

113E

Char

T81

8076

AO

161

Huffman Code Construction

R S N I H

C U

58

113144126

D L

81

156 174

A O T

31 27

5571 7361 6540 41

9380 76

144126125

Freq

156

113E

Char174

93T81

174

162

Huffman Code Construction

R S N I

E

H

C U

58

113144126

238

T

D L

81

156 174

A O

80 76

71 7361 6540 41

31 27

55

12593

144126

238Freq

156

Char

125113

E

163

Huffman Code Construction

R S N I

E

H

C U

58

113144126

238270

T

D L

81

156 174

A O

31 27

5571 7361 65

125

40 41

9380 76

238

156

270Freq

174

Char

144126

164

Huffman Code Construction

R S N I

E

H

C U

58

113144126

238270

330

T

D L

81

156 174

A O

31 27

5571 7361 65

125

40 41

9380 76

270330Freq

238

Char

156174

165

Huffman Code Construction

R S N I

E

H

C U

58

113144126

238270

330 508

T

D L

81

156 174

A O

31 27

5571 7361 65

125

40 41

9380 76

330508FreqChar

270238

166

Huffman Code Construction

R S N I

E

H

C U

58

113144126

238270

330 508

838

T

D L

81

156 174

A O

31 27

5571 7361 65

125

40 41

9380 76

838FreqChar

330508

167

Huffman Code Construction

R S N I

E

H

C U

0

0

T

D L

1

0 0

A O

0

11

1

10

0

1

1

1

1

1

1

0

0

0

0

0

1

125

Freq

93

80

76

73

71

61

55

41

40

E

Char

T

A

O

I

N

R

H

L

D

31

27

C

U

65S

0000

Fixed

0001

0010

0011

0100

0101

0111

1000

1001

1010

1011

1100

0110

110

Huff

011

000

001

1011

1010

1000

1111

0101

0100

11100

11101

1001

838Total 4.00 3.62