AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3....

29
Agent Architectures BE4M36MAS - Multiagent systems

Transcript of AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3....

Page 1: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Agent Architectures

BE4M36MAS - Multiagent systems

Page 2: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Organization

Page 3: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Distant Teaching

• Active participation stillexpected!

• Don’t be afraid to ask• Un-mute and speak,that’s it.

• You can use the chat ifyou are shy.

• Preferably, connect withvideo.

1

Page 4: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Tutors

David Fiedler (Tutorial 1–3) [email protected]

Dominik Andreas Seitz [email protected]

Michal Šustr [email protected]

Aditya Aradhye (Tutorial 12–14)

Website:https://cw.fel.cvut.cz/wiki/courses/be4m36mas/start

2

Page 5: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Course Outline

Agent Programming22 Sept Introduction to multi-agent

systemsPěchouček

29 Sept Agent Architectures. Belief-Desire-Intention architecture

Pěchouček/Jakob

3

Page 6: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Course Outline

Non-cooperative Game Theory6 Oct Introduction to Game Theory Bošanský13 Oct Solving Normal-form Games Bošanský20 Oct Games in Extensive Form Bošanský27 Oct Solving Extensive-Form Games Bošanský3 Nov Other Game Representations Bošanský

4

Page 7: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Course Outline

Multiagent Resource Allocation10 Nov Multiagent Resource Allocation Jakob

5

Page 8: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Course Outline

Auctions24 Nov Auctions 1 Jakob1 Dec Auctions 2 Jakob

6

Page 9: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Course Outline

Coalitional Game Theory8 Dec Coalitional Game Theory 1 Kroupa15 Dec Coalitional Game Theory 2 Kroupa22 Dec Coalitional Game Theory 3 Kroupa

7

Page 10: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Course Outline

Social Choice, Voting5 Jan Social Choice, Voting Kroupa

8

Page 11: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Tutorials

Attendance: voluntary (but tracked)

Assessment – 3 assignments:

1. Agent programming (max 14 pts)2. Game theory (max 14 pts)3. Coalitional game theory (12 pts)

You have to obtain at least 20 points. Plagiarism is strictlyforbidden (Strong punishments would be applied).

9

Page 12: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Agent architectures

Page 13: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Components of agent architectures

Actions (A)Ways for the agent to influence theenvironment

Percepts (P)Observations about the state of the world

Decision making (d : P∗ → A)Mapping perception history to actions

10

Page 14: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Components of agent architectures

Actions (A)Ways for the agent to influence theenvironment

Percepts (P)Observations about the state of the world

Decision making (d : P∗ → A)Mapping perception history to actions

10

Page 15: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Architecture types

1. Reflex (reactive) Agent2. Model-based Reflex Agent3. Model-based Goal-based Agent4. Model-based Utility-based Agent5. Learning-based Agent

(Russell and Norvig)

11

Page 16: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Let’s Play with Agents!

Page 17: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Wumpus’ World

Wumpus’ World• Grid world environment• Agent has to find the gold brickand carry it to the bottom le tsquare

• Problem: Entering a squareoccupied by Wumpus orcontaining a pit costs agent hislife(Wumpus does not move)

12

Page 18: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Wumpus’ World

Wumpus’ World — Percepts• Breeze — whenever agent standsnext to a pit

• Stench — whenever agentstands next to Wumpus

• Gold — when agent carries agold brick

13

Page 19: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Wumpus’ World

Wumpus’ World — Actions• Going to any neighboring square(only vertically and horizontally)

14

Page 20: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Reflex agent

Agent conditions his decision solely on his current percepts.(e.g. on the facts he can currently sense)

Task: Implement a reflex agent for Wumpus world. Beware, donot use any kind of memory or smarter reasoning ;-)

15

Page 21: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Model-based reflex agent

Agent uses percepts to graduallybuild a model of the environment.

Decisions are based on the expectedstate of the world according to hismodel.

Question: Does this approach allow us to overcome this issue?

Task: Implement a model-based agent and reach the gold!

16

Page 22: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Model-based reflex agent

Agent uses percepts to graduallybuild a model of the environment.

Decisions are based on the expectedstate of the world according to hismodel.

Question: Does this approach allow us to overcome this issue?

Task: Implement a model-based agent and reach the gold!

16

Page 23: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Model-based reflex agent

Question: Is the behaviour of the agent rational?

Definitely not!

Agent just exploits the model to stay alive. He does notintentionally pursue his goal.

17

Page 24: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Model-based reflex agent

Question: Is the behaviour of the agent rational?Definitely not!

Agent just exploits the model to stay alive. He does notintentionally pursue his goal.

17

Page 25: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Model-based Goal-based agent

Actions are chosen in order to reach a declaratively specifiedgoal.

Techniques:

1. Planning Planning in AI2. Belief-Desire-Intention Architecture this course

Question: What does it mean for an agent in Wumpus’ world?

18

Page 26: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Model-based Utility-based agent

Not all ways to reach the goal are equally plausible. Someways to reach the goal should be prefered against others.(e.g. cheaper or less risky ones)

Utility driven sequential decision making:

• Non–adversarial: MDPs, POMDPs Planning in AI• Adversarial: Sequential games this course

19

Page 27: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Learning-based agent

Agent does not fully know the task he is facing.(what his action does, what is his goal etc.)

He learns the task on the go — strategy reflecting these findscannot be fixed in advance.

Learning both model and strategy.

20

Page 28: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Next tutorial

Page 29: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)

Next tutorial

• Belief-Desire-Intention architecture

21