Trees Linear access time of linked lists is prohibitive Does there exist any simple data structure...

Post on 04-Jan-2016

213 views 1 download

Transcript of Trees Linear access time of linked lists is prohibitive Does there exist any simple data structure...

Trees Linear access time of linked lists is

prohibitiveDoes there exist any simple data structure for

which the running time of most operations (search, insert, delete) is O(log N)?

TreesBasic conceptsTree traversalBinary treeBinary search tree and its operations

Trees A tree is a collection of nodes

The collection can be empty(recursive definition) If not empty, a tree

consists of a distinguished node r (the root), and zero or more nonempty subtrees T1, T2, ...., Tk, each of whose roots are connected by a directed edge from r

Some Terminologies

Child and Parent Every node except the root has one parent  A node can have an zero or more children

Leaves Leaves are nodes with no children

Sibling nodes with same parent

More Terminologies Path

A sequence of edges Length of a path

number of edges on the path Depth of a node

length of the unique path from the root to that node Height of a node

length of the longest path from that node to a leaf all leaves are at height 0

The height of a tree = the height of the root = the depth of the deepest leaf

Ancestor and descendant If there is a path from n1 to n2 n1 is an ancestor of n2, n2 is a descendant of n1 Proper ancestor and proper descendant

Example: UNIX Directory

Example: Expression Trees

Leaves are operands (constants or variables) The internal nodes contain operators Will not be a binary tree if some operators are not

binary

Tree Traversal Used to print out the data in a tree in a

certain order Pre-order traversal

Print the data at the rootRecursively print out all data in the left

subtreeRecursively print out all data in the right

subtree

Preorder, Postorder and Inorder Preorder traversal

node, left, rightprefix expression

○ ++a*bc*+*defg

Preorder, Postorder and Inorder Postorder traversal

left, right, nodepostfix expression

○ abc*+de*f+g*+

Inorder traversalleft, node, rightinfix expression

○ a+b*c+d*e+f*g

Example: Unix Directory TraversalPreOrder PostOrder

Preorder, Postorder and Inorder Pseudo Code

Binary Trees A tree in which no node can have more than two

children

The depth of an “average” binary tree is considerably smaller than N, even though in the worst case, the depth can be as large as N – 1.

Generic binary tree

Worst-casebinary tree

Node Struct of Binary Tree Possible operations on the Binary Tree

ADTParent, left_child, right_child, sibling, root,

etc

ImplementationBecause a binary tree has at most two

children, we can keep direct pointers to them

Convert a Generic Tree to a Binary Tree

Binary Search Trees (BST) A data structure for efficient searching,

insertion and deletion Binary search tree property

For every node XAll the keys in its left

subtree are smaller than the key value in X

All the keys in its right subtree are larger than the key value in X

Binary Search Trees

A binary search tree Not a binary search tree

Binary Search Trees

Average depth of a node is O(log N) Maximum depth of a node is O(N)

The same set of keys may have different BSTs

Searching BST If we are searching for 15, then we are

done. If we are searching for a key < 15, then we

should search in the left subtree. If we are searching for a key > 15, then we

should search in the right subtree.

Searching Find X: return a pointer to the node that has

key X, or NULL if there is no such node

Time complexity: O(height of the tree)

BinaryNode * BinarySearchTree::Find(const float &x, BinaryNode *t) const

{

if (t == NULL)

return NULL;

else if (x < t->element)

return Find(x, t->left);

else if (t->element < x)

return Find(x, t->right);

else

return t; // match

}

In-order Traversal of BST Inorder traversal of BST prints out all the

keys in sorted order

Inorder: 2, 3, 4, 6, 7, 9, 13, 15, 17, 18, 20

findMin/ findMax Goal: return the node containing the smallest

(largest) key in the tree Algorithm: Start at the root and go left (right) as

long as there is a left (right) child. The stopping point is the smallest (largest) element

Time complexity = O(height of the tree)

BinaryNode * BinarySearchTree::FindMin(BinaryNode *t) const

{

if (t == NULL)

return NULL;

if (t->left == NULL)

return t;

return FindMin(t->left);

}

Insertion Proceed down the tree as you would with a find If X is found, do nothing (or update something) Otherwise, insert X at the last spot on the path

traversed

Time complexity = O(height of the tree)

Deletion When we delete a node, we need to

consider how we take care of the children of the deleted node.This has to be done such that the property

of the search tree is maintained.

Deletion under Different Cases

Case 1: the node is a leafDelete it immediately

Case 2: the node has one childAdjust a pointer from the parent to bypass that

node

Deletion Case 3 Case 3: the node has 2 children

Replace the key of that node with the minimum element at the right subtree

Delete that minimum element ○ Has either no child or only right child because if it has

a left child, that left child would be smaller and would have been chosen. So invoke case 1 or 2.

Time complexity = O(height of the tree)

29

AVL Trees Keep the tree balanced Use node rotation

Balanced condition:

The left and the right subtrees of each node should differ by at most one level.

The method was proposed by two Russian scientists Adelson-Velskii and Landis in 1962

30

Single Rotation5

37

6 8

9

The sub-tree containing the inserted node (rooted at 8) is at the same side as the ‘heavier’ sub-tree of node 5 (rooted at 7)

Links to be changed

New node

31

After the Rotation7

58

963

The middle node 6 is switched to the other subtree

32

Double Rotation5

6

78

5

7 89 6

9

8

8

7

7

The new node 6 is in the left subtree of node 8, while node 8 is the right subtree of 5 -> rotate 7 and 8 to obtain same-side trees.

New node

33

Splay TreesMove to the top each accessed node with rotations, decreasing the depth of the tree.

34

Multi-Way SearchNodes contain more than one key

nodes are used only to contain the keys, the actual records are stored at the terminal nodes at the bottom of the tree

35

Complexity Issues AVL trees:

Search,Insertion, and Deletion: O(logN) Creating the tree – O(N)

Trees with multiple keys: O(LogmN)

m – branching factor

Linear searchLinear search In computer science, linear search or

sequential search is a method for finding a particular value in a list, that consists of checking every one of its elements, one at a time and in sequence, until the desired one is found.

Linear search is the simplest search algorithm.

Linear searchLinear search (continued)(continued)

The simplest type of searching process.

In the sequential search, each element of the array is compared to the key, in the order it appears in the array, until the first element matching the key is found. 

Example AlgorithmLINEAR vs. BINARY

Finding the oldest person in a room full of people1. Understanding the problem

initial condition – room full of people goal – identify the oldest person assumptions

a person will give their real birthday if two people are born on the same day,

they are the same age if there is more than one oldest person,

finding any one of them is okay

Finding the oldest person (algorithm 1)1. line up all the people along one wall2. ask the first person to state his or her name and birthday, then write this

information down on a piece of paper3. for each successive person in line:

i. ask the person for his or her name and birthdayii. if the stated birthday is earlier than the birthday on the paper, cross out old

information and write down the name and birthday of this person

Note: When you reach the end of the line, the name and birthday of the oldest person will be written on the paper

Finding the oldest person (algorithm 2)1. line up all the people along one wall2. as long as there is more than one person in the line, repeatedly

i. have the people pair up (1st with 2nd, 3rd with 4th, etc) – if there is an odd number of people, the last person will be without a partner

ii. ask each pair of people to compare their birthdaysiii. request that the younger of the two leave the line

when there is only one person left in line, that person is the oldest

for algorithm 1:○ 100 people 5*100 = 500 seconds○ 200 people 5*200 = 1000 seconds○ 400 people 5*400 = 2000 seconds

. . .○ 1,000,000 people 5*1,000,000

= 5,000,000 seconds

for algorithm 2:

○ 100 people 5* log2 100 = 35 seconds

○ 200 people 5* log2 200 = 40 seconds

○ 400 people 5* log2 400 = 45 seconds

. . .○ 1,000,000 people 5* log2 1,000,000 = 100 seconds

Analysis For a list with n items, the best case is when the value is equal to the

first element of the list, in which case only one comparison is needed. The worst case is when the value is not in the list (or occurs only once at the end of the list), in which case n comparisons are needed. If the value being sought occurs k times in the list, and all orderings of the list are equally likely, the expected number of comparisons is:

Asymptotically, therefore, the worst-case cost and the expected cost of linear search are both O(n).

APPLICATION Linear search is usually very simple to

implement, and is practical when the list has only a few elements, or when performing a single search in an unordered list.

When many values have to be searched in the same list, it often pays to pre-process the latter in order to use a faster method. For example, one may sort the list and use binary search, or build any efficient search data structure from it.

ALGORITHM The following ALGORITHM describes a typical variant of

linear search, where the result of the search is supposed to be either the location of the list item where the desired value was found; or an invalid location Λ, to indicate that the desired element does not occur in the list.

Note: the last line is executed only after all list items have been examined with none matching.

If the list is stored as an array data structure, the location may be the index of the item found (usually between 1 and n, or 0 and n−1). In that case the invalid location Λ can be any index before the first element (such as 0 or −1, respectively) or after the last one (n+1 or n, respectively).

Recursive version Linear search can also be described as

a recursive algorithm:

Searching in reverse order Linear search in an array that is usually

programmed by stepping up an index variable until it reaches the last index.

In many computers, one can reduce the work of the first comparison by scanning the items in reverse order.

Example Algorithm Suppose an array A with elements indexed 1 to n is to be

searched for a value x. The following pseudocode performs a forward search, returning n + 1 if the value is not found:

The following pseudocode searches the array in the reverse order, returning 0 when the element is not found:

End…Thank you