AgentArchitectures · 1. Reﬂex(reactive)Agent 2. Model-basedReﬂexAgent 3....

Agent Architectures

BE4M36MAS - Multiagent systems

Organization

Distant Teaching

• Active participation stillexpected!

• Don’t be afraid to ask• Un-mute and speak,that’s it.

• You can use the chat ifyou are shy.

• Preferably, connect withvideo.

Tutors

David Fiedler (Tutorial 1–3) david.fiedler@agents.fel.cvut.cz

Dominik Andreas Seitz seitzdom@fel.cvut.cz

Michal Šustr michal.sustr@aic.fel.cvut.cz

Aditya Aradhye (Tutorial 12–14)

Website:https://cw.fel.cvut.cz/wiki/courses/be4m36mas/start

Course Outline

Agent Programming22 Sept Introduction to multi-agent

systemsPěchouček

29 Sept Agent Architectures. Belief-Desire-Intention architecture

Pěchouček/Jakob

Course Outline

Non-cooperative Game Theory6 Oct Introduction to Game Theory Bošanský13 Oct Solving Normal-form Games Bošanský20 Oct Games in Extensive Form Bošanský27 Oct Solving Extensive-Form Games Bošanský3 Nov Other Game Representations Bošanský

Course Outline

Multiagent Resource Allocation10 Nov Multiagent Resource Allocation Jakob

Course Outline

Auctions24 Nov Auctions 1 Jakob1 Dec Auctions 2 Jakob

Course Outline

Coalitional Game Theory8 Dec Coalitional Game Theory 1 Kroupa15 Dec Coalitional Game Theory 2 Kroupa22 Dec Coalitional Game Theory 3 Kroupa

Course Outline

Social Choice, Voting5 Jan Social Choice, Voting Kroupa

Tutorials

Attendance: voluntary (but tracked)

Assessment – 3 assignments:

1. Agent programming (max 14 pts)2. Game theory (max 14 pts)3. Coalitional game theory (12 pts)

You have to obtain at least 20 points. Plagiarism is strictlyforbidden (Strong punishments would be applied).

Agent architectures

Components of agent architectures

Actions (A)Ways for the agent to influence theenvironment

Percepts (P)Observations about the state of the world

Decision making (d : P∗ → A)Mapping perception history to actions

Components of agent architectures

Actions (A)Ways for the agent to influence theenvironment

Percepts (P)Observations about the state of the world

Decision making (d : P∗ → A)Mapping perception history to actions

Architecture types

1. Reflex (reactive) Agent2. Model-based Reflex Agent3. Model-based Goal-based Agent4. Model-based Utility-based Agent5. Learning-based Agent

(Russell and Norvig)

Let’s Play with Agents!

Wumpus’ World

Wumpus’ World• Grid world environment• Agent has to find the gold brickand carry it to the bottom le tsquare

• Problem: Entering a squareoccupied by Wumpus orcontaining a pit costs agent hislife(Wumpus does not move)

Wumpus’ World

Wumpus’ World — Percepts• Breeze — whenever agent standsnext to a pit

• Stench — whenever agentstands next to Wumpus

• Gold — when agent carries agold brick

Wumpus’ World

Wumpus’ World — Actions• Going to any neighboring square(only vertically and horizontally)

Reflex agent

Agent conditions his decision solely on his current percepts.(e.g. on the facts he can currently sense)

Task: Implement a reflex agent for Wumpus world. Beware, donot use any kind of memory or smarter reasoning ;-)

Model-based reflex agent

Agent uses percepts to graduallybuild a model of the environment.

Decisions are based on the expectedstate of the world according to hismodel.

Question: Does this approach allow us to overcome this issue?

Task: Implement a model-based agent and reach the gold!

Agent uses percepts to graduallybuild a model of the environment.

Decisions are based on the expectedstate of the world according to hismodel.

Question: Does this approach allow us to overcome this issue?

Task: Implement a model-based agent and reach the gold!

Question: Is the behaviour of the agent rational?

Definitely not!

Agent just exploits the model to stay alive. He does notintentionally pursue his goal.

Question: Is the behaviour of the agent rational?Definitely not!

Agent just exploits the model to stay alive. He does notintentionally pursue his goal.

Model-based Goal-based agent

Actions are chosen in order to reach a declaratively specifiedgoal.

Techniques:

1. Planning Planning in AI2. Belief-Desire-Intention Architecture this course

Question: What does it mean for an agent in Wumpus’ world?

Model-based Utility-based agent

Not all ways to reach the goal are equally plausible. Someways to reach the goal should be prefered against others.(e.g. cheaper or less risky ones)

Utility driven sequential decision making:

• Non–adversarial: MDPs, POMDPs Planning in AI• Adversarial: Sequential games this course

Learning-based agent

Agent does not fully know the task he is facing.(what his action does, what is his goal etc.)

He learns the task on the go — strategy reflecting these findscannot be fixed in advance.

Learning both model and strategy.

Next tutorial

• Belief-Desire-Intention architecture

AgentArchitectures · 1. Reﬂex(reactive)Agent 2. Model-basedReﬂexAgent 3....

Documents

Transcript of AgentArchitectures · 1. Reﬂex(reactive)Agent 2. Model-basedReﬂexAgent 3....

Fair Reactive Programming - McGill University School of ...acave1/papers/fair-reactive-long.pdf · Reactive programming seeks to model systems which react and respond to input such

Nonlinear Model Predictive Control of a Reactive Distillation Column

Typeful Updates on Reactive Live Web Programming · Typeful Updates on Reactive Live Web Programming 3 2 Reactive Web Programming Our computational model is designed to host a running

Reactors: A Deterministic Model for Composable Reactive ...marten/pdf/Loh... · desirable properties for designing and programming reactive systems. Our model is deterministic by

PRECONDITIONING A COUPLED MODEL FOR REACTIVE …Key words. Reactive transport, porous media, preconditioning, non-linear systems. 1. Introduction Reactive transport in porous media

Neuromorphic Model of Reflex for Realtime Human-Like … · 2020. 8. 20. · Original Article Neuromorphic Model of Reﬂex for Realtime Human-Like Compliant Control of Prosthetic

A Process Network Model for Reactive Streaming Software with Deterministic Task ... · 2018-04-10 · A Process Network Model for Reactive Streaming Software with Deterministic Task

A chemical kinetic model for reactive …cires1.colorado.edu/jimenez/Papers/ch4_kinetics.pdfA chemical kinetic model for reactive transformations of aerosol particles Previous models

Three-dimensional model for multi-component reactive transport

Combined Active and Reactive Power Control of Wind Farms based on Model Predictive Control · active and reactive power control based on Model Predictive Control (MPC) to improve

VIATRA 3: A reactive model transformation platform

Creating a Dynamic Reactive Communication Model from Scratch

Indirect Methods: Exoplanets The reﬂex motion of the star ...

Reﬂex motion: masses and orbits

NARPAA E-Class Module 10 - Proactive Reactive Model

Aalborg Universitet A Data-Driven Stochastic Reactive Power … · leads to the development of the dynamic reactive power opti-mization (DRPO) model [1-3]. This model is actually

Chapter 5 : Electrons in Atoms. Problems with Rutherford’s Model of the Atom Chlorine # 17 Reactive Potassium # 19 Very reactive Argon # 18 Not reactive.

The effect of lexical frequency and Lombard reﬂex on tone ...jurafsky/zhao09.pdfThe effect of lexical frequency and Lombard reﬂex on tone hyperarticulation Yuan Zhao , Dan Jurafsky

Model-Driven Development of Reactive Information Systems · 4 Reiko Heckel, Marc Lohmann: Model-Driven Development of Reactive Information Systems the other two by describing the

Model-based Reactive Programming of Cooperative Vehicles for Mars Exploration