1 General Opponent* Modeling for Improving Agent-Human Interaction Sarit Kraus Dept. of Computer...

General Opponent* Modeling for Improving Agent-Human

Interaction

Sarit Kraus Dept. of Computer Science Bar Ilan University

AMECMay 2010

Negotiation is an extremely important form of people interaction

Motivation

Computers interacting with people

Computer persuadeshuman

Computer has the control

Human has the control

Culture sensitive agents

The development of standardized agent to be used in the collection of data for studies on culture and negotiation

Buyer/Seller agents negotiate well across cultures PURB

Semi-autonomous cars

Medical applications

Gertner Institute for Epidemiology and Health Policy Research

Automated care-taker

I will be too tired in the afternoon!!!

I scheduled an appointment for you at the physiotherapist this afternoon

Try to reschedule and fail

The physiotherapist has no other available

appointments this week .How about resting before

the appointment?

Security applications

•Collect•Update•Analyze•Prioritize

Irrationalities attributed to– sensitivity to context– lack of knowledge of own preferences– the effects of complexity– the interplay between emotion and cognition– the problem of self control – bounded rationality in the bullet

People often follow suboptimal decision strategies

General opponent* modeling

Small number of examples – difficult to collect data on people

Noisy data – people are inconsistent (the same person may act

differently)– people are diverse

Challenges of human opponent* modeling

Multi-attribute multi-round bargaining– KBAgent

Revelation + bargaining – SIGAL

Optimization problems– AAT based learning

Coordination with people: – Focal point based learning

Agenda

QOAgent [LIN08]

Multi-issue, multi-attribute, with incomplete information

Domain independent Implemented several tactics and heuristics

– qualitative in nature Non-deterministic behavior, also via means of

randomization

R. Lin, S. Kraus, J. Wilkenfeld, and J. Barry. Negotiating with bounded rational agents in environments with incomplete information using an automated agent. Artificial Intelligence, 172(6-7):823–851, 2008

Played at least as well as people

Is it possible to improve the QOAgent?

Yes, if you have data

KBAgent [OS09]

Y. Oshrat, R. Lin, and S. Kraus. Facing the challenge of human-agent negotiations via effective general opponent modeling. In AAMAS, 2009

Multi-issue, multi-attribute, with incomplete information

Domain independent Implemented several tactics and heuristics

– qualitative in nature Non-deterministic behavior, also via means

of randomization Using data from previous interactions

Example scenario

Employer and job candidate– Objective: reach an

agreement over hiring terms after successful interview

General opponent modeling

• Challenge: sparse data of past negotiation sessions of people negotiation

• Technique: Kernel Density Estimation

Estimate likelihood of other party:– accept an offer– make an offer– its expected average utility

The estimation is done separately for each possible agent type:– The type of a negotiator is determined using a simple

Bayes' classifier

Use estimation for decision making

General opponent modeling

KBAgent as the job candidate

Best result: 20,000, Project manager, With leased car; 20% pension funds, fast promotion, 8 hours

20,000Team ManagerWith leased carPension: 20%Slow promotion9 hours

12,000ProgrammerWithout leased carPension: 10%Fast promotion10 hours

20,000Project managerWithout leased carPension: 20%Slow promotion9 hours

KBAgentHuman

KBAgent as the job candidate Best agreement: 20,000, Project manager, With leased car; 20%

pension funds, fast promotion, 8 hours

KBAgentHuman

20,000ProgrammerWith leased carPension: 10%Slow promotion9 hours

Round 7

12,000ProgrammerWithout leased carPension: 10%Fast promotion10 hours

20,000Team ManagerWith leased carPension: 20%Slow promotion9 hours

Experiments

172 grad and undergrad students in Computer Science

People were told they may be playing a computer agent or a person.

Scenarios: – Employer-Employee– Tobacco Convention: England vs. Zimbabwe

Learned from 20 games of human-human

Results: Comparing KBAgent to others

Player Type Average Utility Value (std)

KBAgent vs people

Employer

468.9 (37.0)

QOAgent vs peoples 417.4 (135.9)

People vs. People 408.9 (106.7)

People vs. QOAgent 431.8 (80.8)

People vs. KBAgent 380. 4 (48.5)

KBAgent 482.7 (57.5)

QOAgent

Job Candidate

397.8 (86.0)

People vs. People 310.3 (143.6)

People vs. QOAgent 320.5 (112.7)

People vs. KBAgent 370.5 (58.9)

Main results

In comparison to the QOAgent– The KBAgent achieved higher utility values than

QOAgent– More agreements were accepted by people– The sum of utility values (social welfare) were higher

when the KBAgent was involved The KBAgent achieved significantly higher utility

values than people Results demonstrate the proficiency negotiation

done by the KBAgent

General opponent modeling improves agent negotiations

General opponent* modeling improves agent bargaining

Automated care-taker

I will be too tired in the afternoon!!!

I arrange for you to go to the physiotherapist in the afternoon

How can I convince him? What argument should I give?

Security applications

How should I convince him to provide me with information?

Which information to reveal?

Argumentation

Should I tell him that I will lose a project if I don’t hire today?

Should I tell him I was fired from my last job?

Should I tell her that my leg hurts?

Should I tell him that we are running out of antibiotics?

Build a game that combines information revelation and bargaining

Color Trails (CT)

An infrastructure for agent design, implementation and evaluation for open environments

Designed with Barbara Grosz (AAMAS 2004)

Implemented by Harvard team and BIU team

An experimental test-ted

Interesting for people to play– analogous to task settings;– vivid representation of strategy

space (not just a list of outcomes).

Possible for computers to playCan vary in complexity

– repeated vs. one-shot setting;– availability of information; – communication protocol.

Game description

The game is built from phases:– Revelation phase– First proposal phase– Counter-proposal phase

Joint work with Kobi Gal and Noam Peled28

Two boards

Why not equilibrium agents?

Results from the social sciences suggest people do not follow equilibrium strategies: Equilibrium based agents played against

people failed. People rarely design agents to follow equilibrium

strategies (Sarne et al AAMAS 2008).

Equilibrium strategies are usually not cooperative –

all lose.

Perfect Equilibrium agent

Solved using Backward induction; no strategic signaling

Phase two:– Second proposer: Find the most beneficial proposal

while the responder benefit remains positive.– Second responder: Accepts any proposal which

gives it a positive benefit.

Perfect Equilibrium agent

Phase one:– First proposer: propose the opponent’s counter-

proposal– First responder: Accepts any proposals which gives it

the same or higher benefit from its counter-proposal. In both boards, the PE with goal revelation

yields lower or equal expected utility than non-revelation PE

Revelation: Reveals in half of the games

Asymmetric game

Performance

140 students

Benefits diversity

Average proposed benefit to players from first and second rounds

Revelation affect

The effect of revelation on performance:

Only 35% of the games played by humans included revelation

Revelation had a significant effect on human performance but not on agent performance

People were deterred by the strategic machine-generated proposals, which heavily depended on the role of the proposer and the responder.

SIGAL agent

Agent based on general opponent modeling:

Genetic algorithm

Logistic Regression

SIGAL Agent: Acceptance

Learns from previous games Predict the acceptance probability for each

proposal using Logistic regression Features (for both players) relating to

proposals:– Benefit.– Goal revelations.– Players types– Benefit difference

between rounds 2 and 1.

SIGAL Agent: counter proposals

Model the way humans make counter-proposals

SIGAL Agent

Maximizes expected benefit given any state in the game– Round– Player revelation– Behavior in round 1

Agent strategies comparison

Round 1 Round 2

Agent Send Receive Send Receive

EQ Green:10 Gray:11

Purple:2 Green:2 Purple:10 Gray:11

SIGAL Green:2 Purple:9 Green:2 Putple:5

SIGAL agent: performance

Agents performance comparison

Equilibrium Agent SIGAL Agent

General opponent* modeling improves agent negotiations

GENERAL OPPONENT* MODELING IN MAXIMIZATION PROBLEMS

AAT agent

Agent based on general* opponent modeling

Decision Tree/ Naïve Byes

Aspiration Adaptation Theory (AAT)

Economic theory of people’s behavior (Selten)– No utility function exists for decisions (!)

Relative decisions used instead Retreat and urgency used for goal variables

Avi Rosenfeld and Sarit Kraus. Modeling Agents through Bounded Rationality Theories. Proc. of IJCAI 2009., JAAMAS, 2010.

Commodity search

If price < 800 buy; otherwise visit 5 stores and buy in the cheapest.

Results

Behavioral models used in General opponent* modeling is beneficial

50Sparse Naïve Learning Sparse AAT

Using AAT to Quickly Learn

General opponent* modeling in cooperative environments

Coordination with limited communication

Communication is not always possible:– High communication costs– Need to act undetected– Damaged communication devices– Language incompatibilities– Goal: Limited interruption of human

activities

I. Zuckerman, S. Kraus and J. S. Rosenschein. Using Focal Points Learning to Improve Human-Machine Tactic Coordination, JAAMAS, 2010.

Focal Points (Examples)

Divide £100 into two piles, if your piles are identical to your coordination partner, you get the £100. Otherwise, you get nothing.

101 equilibria53

Focal points (Examples)

9 equilibria16 equilibria

Focal Points

Thomas Schelling (63):

Focal Points = Prominent solutions to tactic coordination games.

Prior work: Focal Points Based Coordination for closed environments

Domain-independent rules that could be used by automated agents to identify focal points:

Properties: Centrality, Firstness, Extremeness, Singularity. – Logic based model– Decision theory based model

Algorithms for agents coordination.

Kraus and Rosenchein MAAMA 1992Fenster et al ICMAS 1995Annals of Mathematics and Artificial Intelligence 200056

FPL agent

Agent based on general* opponent modeling

Decision Tree/ neural network

Focal Point

FPL agent

Agent based on general opponent modeling:

Decision Tree/ neural network

raw data vector

FP vector58

Focal Point Learning

3 experimental domains:

Results – cont’

“very similar domain” (VSD) vs “similar domain” (SD) of the “pick the pile” game.

General opponent* modeling improves agent coordination

Evaluation of agents (EDA)

Peer Designed Agents (PDA): computer agents developed by humans

Experiment: 300 human subjects, 50 PDAs, 3 EDA

Results: – EDA outperformed PDAs in the same situations in

which they outperformed people, – on average, EDA exhibited the same measure of

generosity

Experiments with people is a costly process

R. Lin, S. Kraus, Y. Oshrat and Y. Gal. Facilitating the Evaluation of Automated Negotiators using Peer Designed Agents, in AAAI 2010.

Negotiation and argumentation with people is required for many applications

General* opponent modeling is beneficial– Machine learning– Behavioral model – Challenge: how to integrate machine learning and

behavioral model

Conclusions

1 General Opponent* Modeling for Improving Agent-Human Interaction Sarit Kraus Dept. of Computer...

Documents

Transcript of 1 General Opponent* Modeling for Improving Agent-Human Interaction Sarit Kraus Dept. of Computer...

Culture Sensitive Negotiation Agents Sarit Kraus Dept of Computer Science Bar- Ilan University

Nonmonotonic Reasoning Preferen - filosoficas.unam.mxlogica/TextosAjenos/KLM.pdfNonmonotonic Reasoning Preferen tial Mo dels and Cum ulativ e Logics Sarit Kraus y Daniel Lehmann z

SARIT KRAUS Curriculum Vitae Research Interests M.Sc- · SARIT KRAUS Curriculum Vitae Research Interests: The development of intelligent systems, negotiation and cooperation in mixed

SARIT KRAUS Curriculum Vitaelegacydirs.umiacs.umd.edu/~sarit/data/info/full-cv-kraus10-1-16.pdf · 2 IFAAMAS Influential Paper Award, 2014 Onn Shehory and Sarit Kraus. Methods for

WhatsApp Usage Patterns and Prediction Models Usage Patterns and Prediction Models Avi Rosenfeld1, Sigal Sina 2, David Sarne , Or Avidov 2, Sarit Kraus;3 1Department of Information

Hierarchical Information Combination in Large-Scale Multiagent … · 2003. 5. 14. · 130 Osher Yadgar, Sarit Kraus, and Charles L. Ortiz, Jr. to tasks. For these sorts of problem

Deployed ARMOR Protection: The Application of a …users.umiacs.umd.edu/~sarit/data/articles/AAMASind2008...way, Milind Tambe, Craig Western, Praveen Paruchuri and Sarit Kraus, Proc.

Sarit Kraus Department of Computer Science Bar-Ilan University University of Maryland sarit@cs.biu.ac.il 1 sarit

Allocation algorithms for personal TV advertisementssarit/data/articles/art253... · Allocation algorithms for personal TV advertisements Ron Adany • Sarit Kraus • Fernando Ordonez

SARIT KRAUS Curriculum Vitae · optimization, social networks. Education: Ph.D.-1989- The Hebrew University of Jerusalem, Computer Science. Thesis title - "Planning and Communication

SARIT KRAUS Curriculum Vitae · 1 SARIT KRAUS Curriculum Vitae Address: Dept. of Computer Science Bar-Ilan University Ramat Gan, 52900, Israel phone: +972-3-531-8762 cell: +972-54-3303565

Allocation algorithms for personal TV advertisements...REGULAR PAPER Allocation algorithms for personal TV advertisements Ron Adany • Sarit Kraus • Fernando Ordonez Springer-Verlag

Automated Agents that Interact Proficiently with People Sarit Kraus Bar-Ilan University sarit@cs.biu.ac.il.

Galit Haim, Ya'akov Gal, Sarit Kraus and Michele J. Gelfand A Cultural Sensitive Agent for Human-Computer Negotiation 1.

SARIT KRAUS Curriculum Vitae - UMIACS · • A special commendation from the city of Los Angeles, together with Prof. Tambe, Prof. Ordonez and their USC students, for the creation

Automated Agents for the Provision of Arguments Ariel Rosenfeld and Sarit Kraus (BIU)

A Study of Computational and Human Strategies in Revelation Games 1 Noam Peled, 2 Kobi Gal, 1 Sarit Kraus 1 Bar-Ilan university, Israel. 2 Ben-Gurion university,

Jürgen Dix University of Maryland/University of Koblenz joint work with Sarit Kraus (Bar-Ilan), VS Subrahmanian (Maryland) 1 IMPACT Agent Extensions.

Adapting the Social Network to Affect Electionsusers.umiacs.umd.edu/~sarit/data/articles/fp195-Sina.pdf · avinatanh@gmail.com Sarit Kraus Dept of Computer Science Bar Ilan University,

Incentive engineering for Boolean games · 2012. 11. 20. · Incentive Engineering for Boolean Games Michael Wooldridge+ Ulle Endriss∗ Sarit Kraus† Jer´ ome Lanˆ g‡ +Department