Abstract

Behavioural priors: Learning to search efficiently

in action planning

Aapo Hyvärinen

Depts of Computer Science and PsychologyUniversity of Helsinki

Abstract

Prior knowledge important in perception What kinds of objects/scenes are

typical/frequent? Formalized as prior probabilities in

Bayesian inference We propose same needed in action

planning What kind of action sequences are typically

good? Number of possible action sequences

large:computationally more efficient to constrain search those which are typically good

Basic framework: Planning

Thoroughly investigated in classic AI Agent is in state A

and wants to state B:What sequence of actions is needed?

Agent assumed to have a world model Exponential explosion in computation:

for a actions and t time steps, at possibilities

Exhaustive search impossible

Biological agents are different

A biological agent faces the same planning problems many times Moving the same limbs Navigation in the same environment Manipulating similar objects, etc.

Good action sequences obey regularities, due to the physical structure of the world

“Good” means action sequences selected by careful, computationally intensive, planning

Learning regularities aids in planning

Initially, agent considers whole search space

It can learn from information on which action sequences were good / typically executed “Typical” and “good” are strongly correlated

because only rather good sequences are executed

Search can then be constrained Examples of regularities:

No point in moving limb back and forth Many sequences contain detours Skills: learning more regularities in particular

A prior model on good sequences

A probabilistic approach: build a model on the statistical structure of those sequences which were executed (= lead to goal, or close)

Use a model which can generate candidate sequences in future planning E.g. A Markov model

After sufficient experience, search only using action sequences generated by the prior model

Simulation 1

Grid world, actions: “up”, “down”, “left”, “right”

Food randomly scattered Initially, planning is random Markov prior learned,

used in later test period Result 1: Markov model learns the “rules”

Do not go back and forth Do not change direction too often

Result 2: Performance improved

Simulation 2

Using behavioural prior to improve model-free reinforcement learning

Same grid world, one goal Value function incompletely

learned by Q-learning After Q-learning, plan to

find maximum of value function

Abstract

Documents

Transcript of Abstract

Abstract art Abstract Art. What is abstract art? What is abstract art? 1.Does Abstract art break the rules from previous art? Explain your answer. 2.

Abstract Machines of Systems Biology (Extended Abstract)aleteya.cs.buap.mx/~jlavalle/papers/cardelli/... · Abstract Machines of Systems Biology (Extended Abstract) Luca Cardelli

Abstract Number: 008-0619 Abstract Title: Advanced ...

Abstract First Last Title Type Abstract Title Abstract Topic ... Final Details (Oral).pdfAbstract ID Title First Name Last Name Type Abstract Title Abstract Topic Date of Presentation

SCIENTIFIC ABSTRACT MATANTSEV, N.V.Title: SCIENTIFIC ABSTRACT MATANTSEV, N.V. - Subject: SCIENTIFIC ABSTRACT MATANTSEV, N.V. - Keywords

Unit 021 Abstract Classes What is an Abstract Class? Properties of an Abstract Class Discovering Abstract Classes.

Abstract Factory Abstract Factory using Factory Method.

Abstract Expressionism. Abstract Expressionist Sculpture MoMA Abstract Expressionism.

ABSTRACT Document: INTERACTIVE SONIFICATION OF ABSTRACT DATA

Abstract Classes. 2 r Java allows abstract classes use the modifier abstract on a class header to declare an abstract class abstract class Vehicle {

· public abstract class GeometrischeForm {public abstract double flaeche (); public abstract String typ (); public abstract boolean equals (Object obj);

Running head: ABSTRACT STRUCTURE Selectivity for Abstract ...

english abstract abstract euskera orbea + cocreable ...

AN ABSTRACT OF THE ... - Andrews Forest

ABSTRACT - University of Technology Sydney Web viewWord counts : abstract- 199. Text-2673. ABSTRACT

SCIENTIFIC ABSTRACT · Title: SCIENTIFIC ABSTRACT - Subject: SCIENTIFIC ABSTRACT - Keywords

Part One: Abstract Expressionism. Abstract Expressionism.

How to Write an Abstract: Abstract Submission & Poster Presentation How to Write an Abstract: Abstract Submission & Poster Presentation Mohammad K. Ismail.

ABSTRACT 1 ABSTRACT 2 - apollondesign.eu€¦ · abstract 2 abstract 4 abstract 7 abstract. 32 abstract 8 abstract 10 abstract 11 abstract 12 abstract 9 abstract. 33 abstract 13 abstract

ARTICLE INFO GRAPHICAL ABSTRACT KEYWORDS HIGHLIGHTS ABSTRACT