Search Marc Toussaint University of Stuttgart Winter 2016/17 · 2016-12-21 · Artiﬁcial...

Artificial Intelligence

Search

Marc ToussaintUniversity of StuttgartWinter 2016/17

(slides based on Stuart Russell’s AI course)

Motivation:Search algorithms are core tool for decision making, especially when the domain is toocomplex to use alternatives like Dynamic Programming. In recent years, with the increase incomputational power search methods became the method of choice for complex domains, likethe game of Go, or certain POMDPs.Learning about search tree algorithms is an important background for several reasons:

• The concept of decision trees, which represent the space of possible future decisionsand state transitions, is generally important for thinking about decision problems.

• In probabilistic domains, tree search algorithms are a special case of Monte-Carlomethods to estimate some expectation, typically the so-called Q-function. The respectiveMonte-Carlo Tree Search algorithms are the state-of-the-art in many domains.

• Tree search is also the background for backtracking in CSPs as well as forward andbackward search in logic domains.

We will cover the basic tree search methods (breadth, depth, iterative deepening) andeventually A∗

Search – – 1/48

Outline

• Problem formulation & examples

• Basic search algorithms

Search – – 2/48

Problem Formulation & Examples

Search – Problem Formulation & Examples – 3/48

Example: RomaniaOn holiday in Romania; currently in Arad.Flight leaves tomorrow from BucharestFormulate goal:

be in Bucharest, Sgoal = {Bucharest}Formulate problem:

states: various cities, S = {Arad, Timisoara, . . . }actions: drive between cities, A = {edges between states}

Find solution:sequence of cities, e.g., Arad, Sibiu, Fagaras, Bucharestminimize costs with cost function, (s, a) 7→ c

Example: Romania

Deterministic, fully observable search problemdef.

A deterministic, fully observable search problem is defined by fouritems:

initial state s0 ∈ S e.g., s0 = Aradsuccessor function succ : S×A→ S

e.g., succ(Arad,Arad-Zerind) = Zerindgoal states Sgoal ⊆ S

e.g., s = Buchareststep cost function cost(s, a, s′), assumed to be ≥ 0

e.g., traveled distance, number of actions executed, etc.the path cost is the sum of step costs

A solution is a sequence of actions leading from s0 to a goalAn optimal solution is a solution with minimal path costs

Example: vacuum world state space graph

states??:

integer dirt and robot locations (ignore dirt amounts etc.)actions??: Left, Right, Suck, NoOpgoal test??: no dirtpath cost??: 1 per action (0 for NoOp)

states??: integer dirt and robot locations (ignore dirt amounts etc.)actions??:

Left, Right, Suck, NoOpgoal test??: no dirtpath cost??: 1 per action (0 for NoOp)

states??: integer dirt and robot locations (ignore dirt amounts etc.)actions??: Left, Right, Suck, NoOpgoal test??:

no dirtpath cost??: 1 per action (0 for NoOp)

states??: integer dirt and robot locations (ignore dirt amounts etc.)actions??: Left, Right, Suck, NoOpgoal test??: no dirtpath cost??:

1 per action (0 for NoOp)

states??: integer dirt and robot locations (ignore dirt amounts etc.)actions??: Left, Right, Suck, NoOpgoal test??: no dirtpath cost??: 1 per action (0 for NoOp)

Example: The 8-puzzle

states??:

integer locations of tiles (ignore intermediate positions)actions??: move blank left, right, up, down (ignore unjamming etc.)goal test??: = goal state (given)path cost??: 1 per move[Note: optimal solution of n-Puzzle family is NP-hard]

states??: integer locations of tiles (ignore intermediate positions)actions??:

move blank left, right, up, down (ignore unjamming etc.)goal test??: = goal state (given)path cost??: 1 per move[Note: optimal solution of n-Puzzle family is NP-hard]

states??: integer locations of tiles (ignore intermediate positions)actions??: move blank left, right, up, down (ignore unjamming etc.)goal test??:

= goal state (given)path cost??: 1 per move[Note: optimal solution of n-Puzzle family is NP-hard]

states??: integer locations of tiles (ignore intermediate positions)actions??: move blank left, right, up, down (ignore unjamming etc.)goal test??: = goal state (given)path cost??:

1 per move[Note: optimal solution of n-Puzzle family is NP-hard]

states??: integer locations of tiles (ignore intermediate positions)actions??: move blank left, right, up, down (ignore unjamming etc.)goal test??: = goal state (given)path cost??: 1 per move[Note: optimal solution of n-Puzzle family is NP-hard]

Basic Tree Search Algorithms

Search – Basic Tree Search Algorithms – 9/48

Tree search example

Implementation: states vs. nodes

• A state is a (representation of) a physical configuration

• A node is a data structure constituting part of a search treeincludes parent, children, depth, path cost g(x)

(States do not have parents, children, depth, or path cost!)

• The Expand function creates new nodes, filling in the various fieldsand using the SuccessorFn of the problem to create thecorresponding states.

Implementation: general tree searchfunction Tree-Search( problem, fringe) returns a solution, or failure

fringe← Insert(Make-Node(Initial-State[problem]), fringe)loop do

if fringe is empty then return failurenode←Remove-Front(fringe)if Goal-Test(problem,State(node)) then return nodefringe← InsertAll(Expand(node, problem), fringe)

function Expand( node, problem) returns a set of nodessuccessors← the empty setfor each action, result in Successor-Fn(problem,State[node]) do

s← a new NodeParent-Node[s]← node; Action[s]← action; State[s]← resultPath-Cost[s]←Path-Cost[node] + Step-Cost(State[node], action, result)Depth[s]←Depth[node] + 1add s to successors

return successors

Search strategies

• A strategy is defined by picking the order of node expansion

• Strategies are evaluated along the following dimensions:completeness—does it always find a solution if one exists?time complexity—number of nodes generated/expandedspace complexity—maximum number of nodes in memoryoptimality—does it always find a least-cost solution?

• Time and space complexity are measured in terms ofb = maximum branching factor of the search treed = depth of the least-cost solutionm = maximum depth of the state space (may be∞)

Uninformed search strategies

• Uninformed strategies use only the information available in the problemdefinition

– Breadth-first search– Uniform-cost search– Depth-first search– Depth-limited search– Iterative deepening search

Breadth-first search

• Expand shallowest unexpanded node

• Implementation:fringe is a FIFO queue, i.e., new successors go at end

Properties of breadth-first searchComplete??

Yes (if b is finite)Time?? 1 + b+ b2 + b3 + . . .+ bd + b(bd − 1) = O(bd+1), i.e., exp. in dSpace?? O(bd+1) (keeps every node in memory)Optimal?? Yes, if cost-per-step=1; not optimal otherwise

Space is the big problem; can easily generate nodes at 100MB/secso 24hrs = 8640GB.

Properties of breadth-first searchComplete?? Yes (if b is finite)Time??

1 + b+ b2 + b3 + . . .+ bd + b(bd − 1) = O(bd+1), i.e., exp. in dSpace?? O(bd+1) (keeps every node in memory)Optimal?? Yes, if cost-per-step=1; not optimal otherwise

Properties of breadth-first searchComplete?? Yes (if b is finite)Time?? 1 + b+ b2 + b3 + . . .+ bd + b(bd − 1) = O(bd+1), i.e., exp. in dSpace??

O(bd+1) (keeps every node in memory)Optimal?? Yes, if cost-per-step=1; not optimal otherwise

Properties of breadth-first searchComplete?? Yes (if b is finite)Time?? 1 + b+ b2 + b3 + . . .+ bd + b(bd − 1) = O(bd+1), i.e., exp. in dSpace?? O(bd+1) (keeps every node in memory)Optimal??

Yes, if cost-per-step=1; not optimal otherwise

Properties of breadth-first searchComplete?? Yes (if b is finite)Time?? 1 + b+ b2 + b3 + . . .+ bd + b(bd − 1) = O(bd+1), i.e., exp. in dSpace?? O(bd+1) (keeps every node in memory)Optimal?? Yes, if cost-per-step=1; not optimal otherwise

Uniform-cost search

• “Cost-aware BFS”: Expand least-cost unexpanded node

• Implementation:fringe = queue ordered by path cost, lowest first

• Equivalent to breadth-first if step costs all equal

Complete?? Yes, if step cost ≥ εTime?? # of nodes with g ≤ cost-of-optimal-solution, O(bdC

∗/εe)

where C∗ is the cost of the optimal solutionSpace?? # of nodes with g ≤ cost-of-optimal-solution, O(bdC

∗/εe)

Optimal?? Yes: nodes expanded in increasing order of g(n)

Uniform-cost search

• “Cost-aware BFS”: Expand least-cost unexpanded node

• Implementation:fringe = queue ordered by path cost, lowest first

• Equivalent to breadth-first if step costs all equal

Complete?? Yes, if step cost ≥ εTime?? # of nodes with g ≤ cost-of-optimal-solution, O(bdC

∗/εe)

where C∗ is the cost of the optimal solutionSpace?? # of nodes with g ≤ cost-of-optimal-solution, O(bdC

∗/εe)

Optimal?? Yes: nodes expanded in increasing order of g(n)

Depth-first search

• Expand deepest unexpanded node

• Implementation:fringe = LIFO queue, i.e., put successors at front

Depth-first search

Properties of depth-first searchComplete??

No: fails in infinite-depth spaces, spaces with loopsModify to avoid repeated states along path⇒ complete in finite

spacesTime?? O(bm): terrible if m is much larger than d

but if solutions are dense, may be much faster than breadth-firstSpace?? O(bm), i.e., linear space!Optimal?? No

Properties of depth-first searchComplete?? No: fails in infinite-depth spaces, spaces with loops

Modify to avoid repeated states along path⇒ complete in finitespacesTime??

O(bm): terrible if m is much larger than dbut if solutions are dense, may be much faster than breadth-first

Space?? O(bm), i.e., linear space!Optimal?? No

Modify to avoid repeated states along path⇒ complete in finitespacesTime?? O(bm): terrible if m is much larger than d

but if solutions are dense, may be much faster than breadth-firstSpace??

O(bm), i.e., linear space!Optimal?? No

but if solutions are dense, may be much faster than breadth-firstSpace?? O(bm), i.e., linear space!Optimal??

but if solutions are dense, may be much faster than breadth-firstSpace?? O(bm), i.e., linear space!Optimal?? No

Depth-limited search

• depth-first search with depth limit l,i.e., nodes at depth l have no successors

• Recursive implementation using the stack as LIFO:

function Depth-Limited-Search( problem, limit) returns soln/fail/cutoffRecursive-DLS(Make-Node(Initial-State[problem]), problem, limit)

function Recursive-DLS(node, problem, limit) returns soln/fail/cutoffcutoff-occurred?← falseif Goal-Test(problem,State[node]) then return nodeelse if Depth[node] = limit then return cutoffelse for each successor in Expand(node, problem) do

result←Recursive-DLS(successor, problem, limit)if result = cutoff then cutoff-occurred?← trueelse if result 6= failure then return result

if cutoff-occurred? then return cutoff else return failure

Iterative deepening searchfunction Iterative-Deepening-Search( problem) returns a solution

inputs: problem, a problem

for depth← 0 to∞ doresult←Depth-Limited-Search( problem, depth)if result 6= cutoff then return result

Iterative deepening search

Properties of iterative deepening searchComplete??

YesTime?? (d+ 1)b0 + db1 + (d− 1)b2 + . . .+ bd = O(bd)

Space?? O(bd)

Optimal?? Yes, if step cost = 1Can be modified to explore uniform-cost tree

Numerical comparison for b = 10 and d = 5, solution at far left leaf: