Sarit Kraus Bar-Ilan University

29
Sarit Kraus Bar-Ilan University [email protected]

Transcript of Sarit Kraus Bar-Ilan University

Page 1: Sarit Kraus Bar-Ilan University

Sarit Kraus Bar-Ilan University

[email protected]

Page 2: Sarit Kraus Bar-Ilan University

Computerized Agents that Interact with People

2

Automated

Speech Therapist

Persuading people

to save energy

Culture sensitive

negotiation agent

Automated

mediators

Agent supports

Argumentation

Supporting

Teams of

Robots &

Operator

Virtual suspect for

training investigators

Training people in

negotiations

Page 3: Sarit Kraus Bar-Ilan University

People Often Follow Suboptimal Decision Strategies

Irrationalities attributed to

sensitivity to context

lack of knowledge of own preferences

the effects of complexity

the interplay between emotion and cognition

the problem of self control

3 Kahneman Selten

Page 4: Sarit Kraus Bar-Ilan University

Why not Only Equilibrium Agents? Nash equilibrium: stable strategies; no agent has an

incentive to deviate

Results from the social sciences suggest people do not follow equilibrium strategies:

Equilibrium based agents played against people failed.

People rarely design agents to follow equilibrium strategies.

Page 5: Sarit Kraus Bar-Ilan University

Why not Only Behavioral Science Models? There are several models that describe human

decision making

Most models specify general criteria that are context sensitive but usually do not provide specific parameters or mathematical definitions

5

Page 6: Sarit Kraus Bar-Ilan University

Why not Only Machine Learning?

Machine learning builds models based on data

It is difficult to collect human data

Collecting data on a specific user is very time consuming.

Human data is noisy

“Curse” of dimensionality

6

Page 7: Sarit Kraus Bar-Ilan University

Methodology

7

Human Prediction

Model

Take action

Machine Learning

Game Theory Optimization

methods

Data

Human behavior models

Human specific data

Page 8: Sarit Kraus Bar-Ilan University

Ariel Rosenfeld, Amos Azaria, Sarit Kraus, Claudia V. Goldman, Omer Tsimhoni

Page 9: Sarit Kraus Bar-Ilan University

CCS reduces ~10% of the car’s power efficiency!

Reduced ecological footprint.

Extending travel distance of EV.

Economically efficient.

Why bother?

Page 10: Sarit Kraus Bar-Ilan University

• Driver’s and system’s goals are

partially conflicting.

Partially Conflicting Interests

Let’s minimize energy

consumption... I’m Hot!

Ariel Rosenfeld et al. AAMAS 2015 @ Istanbul,

Turkey. May 2015

Page 11: Sarit Kraus Bar-Ilan University

Challenges in Repeated Advice Provision in CCS in Real Cars Repeated interaction

Drivers’ preferences.

Long-term effect of advice.

Changing environment.

Estimating expected energy consumption.

Page 12: Sarit Kraus Bar-Ilan University

Climate Control System (In GM Chevrolet Volt 2011)

Page 13: Sarit Kraus Bar-Ilan University

Advice

Controls

Effects

Agent

Effects Effects

Goal: minimize the accumulative energy consumption.

Page 14: Sarit Kraus Bar-Ilan University

Driver/Environment models We recruited 38 subjects. (not that easy!)

Each subject spent 30 min. in the car,

simulating 3 different trips.

Subjects were presented with different advice.

ML algorithm for extracting probabilities:

Drivers likelihood to accept an advice

Car’s condition likelihood to change.

Page 15: Sarit Kraus Bar-Ilan University

Presenting Advice to User

Page 16: Sarit Kraus Bar-Ilan University

Presenting Advice to User

~80% of drivers explicitly accepted.

Page 17: Sarit Kraus Bar-Ilan University

78% accuracy (post-hoc).

Influential Features: Current internal temperature. Change from current setting (Reference point). % of accepted advice (Trust). Saving percentage (Expectation bias).

Not influential:

External temperature. Average temperatures\fan. Accepted deltas.

Prediction of drivers reactions

Page 18: Sarit Kraus Bar-Ilan University

MACS – MDP agent

Uses the predictions for the transition function.

State of the art – SAP (Azaria et al. 2012)

Considers the Social Utility of advice.

The weight provides a trade-off between short and long term gain.

Agents

driveragent UwUw )1(

Page 19: Sarit Kraus Bar-Ilan University

Evaluation

45 drivers - 15 per condition, 3 rounds.

The lower the better.

Page 20: Sarit Kraus Bar-Ilan University

Why Did MDP Outperform the SAP?

SAP was aggressive.

Some subjects stopped clicking on the advice.

Agent Avg. go eco % Avg. save % Avg. consumption

MACS 0.835 23.1 0.174

SAP agent

0.641 33.7 0.237

Page 21: Sarit Kraus Bar-Ilan University

Ariel Rosenfeld , Noa Agmon,

Oleg Maksimov, Amos Azaria,

Sarit Kraus

Page 22: Sarit Kraus Bar-Ilan University

One operator – Multiple robots

Search And Rescue (SAR)

Warehouse operation

Automatic air-craft towing

Fire-Fighting

Military applications

Etc..

Page 23: Sarit Kraus Bar-Ilan University

Semi-Autonomous Robots

Controls the robots

Noisy signals

Page 24: Sarit Kraus Bar-Ilan University

Agent

Controls the robots

Page 25: Sarit Kraus Bar-Ilan University

Agent design

Provide Advice

Machine Learning

Optimization

Data on robots performance

Data on human behavior

Robot model Human model

150 hours of

simulations (no

human operator).

30 human

Operators in

simulation

Page 26: Sarit Kraus Bar-Ilan University

Evaluation: Three Environments

16 subjects

16 subjects

12 subjects

Page 27: Sarit Kraus Bar-Ilan University
Page 28: Sarit Kraus Bar-Ilan University

Objects found per condition

Simulated office Physical office Simulated

warehouse yard

Page 29: Sarit Kraus Bar-Ilan University

29

Agents interacting proficiently with people is important

Human Prediction

Model

Take action

machine learning

Game Theory Optimization

methods

Human behavior models

Data (from specific culture)

Human specific data

Challenging: Experimenting with people is very difficult !!! Working with people from other disciplines is challenging.

Challenging: How to integrate machine learning and behavioral models? How to use in agent’s strategy?