Inf2D 05: Informed Search and Exploration for Agents

Inf2D 05: Informed Search andExploration for Agents

Valerio Restocchi

School of Informatics, University of Edinburgh

23/01/20

Slide Credits: Jacques Fleuriot, Michael Rovatsos, Michael Herrmann, Vaishak Belle

Outline

− Best-first search

− Greedy best-first search

− A∗ search

− Heuristics

− Admissibility

Review: Tree search

A search strategy is defined by picking the order of nodeexpansion from the frontier

Best-first search

− An instance of general TREE-SEARCH orGRAPH-SEARCH

− Idea: use an evaluation function f (n) for each node n

I estimate of “desirability”Ü Expand most desirable unexpanded node, usually thenode with the lowest evaluation

− Implementation:Order the nodes in frontier in decreasing order ofdesirability

− Special cases:

I Greedy best-first searchI A∗search

Romania with step costs in km

Greedy best-first search

− Evaluation function f (n) = h(n) (heuristic)

− h(n): estimated cost of cheapest path from state at noden to a goal state

I e.g., hSLD(n): straight-line distance from n to goal(Bucharest)

I Greedy best-first search expands the node that appearsto be closest to goal

Added slide: HeuristicWhat is a heuristic?

− From the greek word “heuriskein” meaning “to discover”or “to find”

− A heuristic is any method that is believed or practicallyproven to be useful for the solution of a given problem,although there is no guarantee that it will always work orlead to an optimal solution.

− Here we will use heuristics to guide tree search. This maynot change the worst case complexity of the algorithm,but can help in the average case.

− We will introduce conditions (admissibility, consistency,see below) in order to identify good heuristics, i.e. thosewhich actually lead to an improvement over uninformedsearch.

− See also: https://en.wikipedia.org/wiki/Heuristic7

Greedy best-first search example

Properties of greedy best-first search

− Complete? No – can get stuck in loops

I Graph search version is complete in finite space, but notin infinite ones

− Time? O(bm) for tree version, but a good heuristic cangive dramatic improvement

− Space? O(bm) – keeps all nodes in memory

− Optimal? No

A∗ search

− Idea: avoid expanding paths that are already expensive

− Evaluation function f (n) = g(n) + h(n)

I g(n): cost so far to reach nI h(n): estimated cost from n to goalI f (n): estimated total cost of path through n to goal

− A∗ is both complete and optimal if h(n) satisfies certainconditions

A∗ search example

Admissible heuristics

− A heuristic h(n) is admissible if for every node n,h(n) ≤ h∗(n), where h∗(n) is the true cost to reach thegoal state from n.

− An admissible heuristic never overestimates the cost toreach the goal, i.e., it is optimistic

I Thus, f (n) = g(n) + h(n) never overestimates the truecost of a solution

− Example: hSLD(n) (never overestimates the actual roaddistance)

− Theorem: If h(n) is admissible, A∗ using TREE-SEARCHis optimal.

Optimality of A∗ (proof)− Suppose some suboptimal goal G2 has been generated

and is in the frontier. Let n be an unexpanded node inthe frontier such that n is on a shortest path to anoptimal goal G .

− f (G2) = g(G2) since h(G2) = 0

− f (G ) = g(G ) since h(G ) = 0

− g(G2) > g(G ) since G2 is suboptimal

− f (G2) > f (G ) from above

Optimality of A∗ (proof cntd.)− Suppose some suboptimal goal G2 has been generated

and is in the frontier. Let n be an unexpanded node inthe frontier such that n is on a shortest path to anoptimal goal G .

− f (G ) < f (G2) from above (G2 is suboptimal)

− h(n) ≤ h∗(n) since h is admissible

− g(n) + h (n) ≤ g (n) + h∗ (n) = f (G )

− f (n) ≤ f (G )

Hence f (n) < f (G2) ⇒ A∗ will never select G2 for expansion.

Consistent heuristics− A heuristic is consistent if for every node n, every

successor n′ of n generated by any action a,

h(n) ≤ c(n, a, n′) + h(n′)

− If h is consistent, we have

f (n′) = g(n′) + h(n′)

= g(n) + c(n, a, n′) + h(n′)

≥ g(n) + h(n)

≥ f (n)

− i.e., f (n) is non-decreasingalong any path.

Theorem:If h(n) is consistent, A∗ using GRAPH-SEARCH is optimal.

Optimality of A∗

− A∗ expands nodes in order of increasing f value

− Gradually adds “f -contours” of nodes

− Contour i has all nodes with f = fi , where fi < fi+1

Properties of A∗

− Complete? Yes (unless there are infinitely many nodeswith f ≤ f (G ))

− Time? Exponential

− Space? Keeps all nodes in memory

− Optimal? Yes

Admissible heuristics

Example:

− for the 8-puzzle:

I h1(n): number of misplaced tilesI h2(n): total Manhattan distance

(i.e., no. of squares from desired location of each tile)

Exercise: Calculatethese two values:

− h1 (S) = ?

− h2 (S) = ?

Dominance

− If h2(n) ≥ h1(n) for all n (both admissible) then

I h2 dominates h1I h2 is better for search

− Typical search costs (average number of nodes expanded):

I d = 12 IDS = 3,644,035 nodesA∗(h1) = 227 nodesA∗(h2) = 73 nodes

I d = 24 IDS = too many nodesA∗(h1) = 39,135 nodesA∗(h2) = 1,641 nodes

Relaxed problems

− A problem with fewer restrictions on the actions is calleda relaxed problem

− The cost of an optimal solution to a relaxed problem is anadmissible heuristic for the original problem

− If the rules of the 8-puzzle are relaxed so that a tile canmove anywhere,

I then h1(n) gives the shortest solution

− If the rules are relaxed so that a tile can move to anyadjacent square,

I then h2(n) gives the shortest solution

− Can use relaxation to automatically generate admissibleheuristics

Summary

Smart search based on heuristic scores.

− Best-first search

− Greedy best-first search

− A∗ search

− Admissible heuristics and optimality.

Inf2D 05: Informed Search and Exploration for Agents

Documents

Transcript of Inf2D 05: Informed Search and Exploration for Agents

Arti cial Intelligence Intelligent Agents · 2020. 1. 23. · Intelligent agents Four types of agents: Re ex agents, model-based agents, goal-based agents, and utility-based agents.

PHYSICS-INFORMED KRIGING: A PHYSICS-INFORMED GAUSSIAN ...

NBER WORKING PAPER SERIES MARKET DISTORTIONS WHEN …pricetheory.uchicago.edu/levitt/Papers/LevittSyverson2004.pdf · Market Distortions when Agents are Better Informed: The Value

Goal-Based Agents Informed Search - Iowa State Universityweb.cs.iastate.edu/~cs572/cs572informed-search09.pdf · Informed search • Informed use problem-specific knowledge • Which

DRIVE BETTER- INFORMED DECISIONS€¦ · relationships with customers and agents. NEW BUSINESS & UNDERWRITING Protect your Use your new business and underwriting operations data to

Adverse Selection with Heterogeneously Informed Agents€¦ · Jinkins, Ricardo Lagos, Shouyong Shi, Venky Venkateswaran, Randy Wright and the participants in several seminars and

Adrenal Agents. Women’s Health Agents. Men’s Health Agents.

Chemical Weapons Bolechová, Havelková. Types of Chemical Weapons Nerve Agents Blister Agents Blood Agents Choking Agents Incapacitating Agents.

KEEP AGENTS INFORMED AND IN TOUCH

Agents & Mobile Agents Introduction – Agents & Mobile Agents ©Shonali Krishnaswamy1.

Informed Search and Exploration for Agents · Informatics 2D Informed Search and Exploration for Agents R&N: § 3.5, 3.6 Michael Rovatsos University of Edinburgh 29th January 2015

Keeping Michigan consumers safe and informed! CONSUMER ...€¦ · ones to benefit are the agents who earn large commissions on financial products that may take veterans’ funds

NQF-ENDORSED VOLUNTARY CONSENSUS STANDARDS FOR …SCIP-Inf2a Overall Rate ** SCIP-Inf2b CABG Table 5.01 SCIP-Inf2c Other Cardiac Surgery Table 5.02 SCIP-Inf2d Hip Arthroplasty Table

Decentralized Trading With Private Information...trades of the two assets. However, before trading begins, a fraction of agents— the informed agents—receive some information about

Strategies to Make Informed Consent Truly Informed

Patient experience of the informed consent process during ......The original trial compared two anticoagulant agents in patients undergoing coronary intervention. A witnessed oral

Diagnosis and Treatment of Diabetic Foot Infections · informed choices among the various topical, oral, and paren-teral antibiotic agents. Virtually all severe and some moderate

(Antimetabolites agents, Alkylating agents ...Adverse effects of anticancer drugs (Antimetabolites agents, Alkylating agents , Antimicrotubule agents, Miscellaneous agents , Immune

Understanding the Forward Premium Puzzle: A Microstructure ... · transparent way, the adverse-selection rationale for the forward premium puzzle. 7 The presence of informed agents

Position Classification Standard for Purchasing Series, GS ... · Purchasing agents keep informed of available goods and services by checking commercial catalogs, contact files, and