Stefanie Tellex Ross Knepper, Adrian Li, Daniela Rus ... · 33 Assembly System Strips-style...

Asking for Help Using Inverse Semantics

Stefanie Tellex

Ross Knepper, Adrian Li, Daniela Rus, Nicholas Roy

How can robots collaborate with people using natural language?

● Following instructions.– “Put the metal crate on the truck.”

Symbol Grounding Problem

“The pallet of boxes on the left.”

Move the pallet from the truck.

Remove the pallet from the back of the truck.

Offload the metal crate from the truck.

Pick up the silver container from the truck bed.

● Asking questions.– “What does 'the metal crate' refer to?”

● Requesting Help.– “Hand me the black leg that is under the table.”

What does 'the metal crate'

refer to?

The box pallet near the ammo pallet.

refer to?

The box pallet near the ammo pallet.

“Put the pallet on the truck.”

What does 'the truck'refer to?

What does 'the pallet'refer to?

Information-theoretic Human-Robot Dialog

● Identify uncertain parts of the command.

Generalized Grounding Graphs to model word meanings.

● Ask a targeted question.

Use a metric based on entropy to select questions.

● Use information from the answer to infer better actions.

Merge graphs based on linguistic coreference.

Joint work with Robin Deits, Pratiksha Thaker

Help me!

Please hand me the white table leg.

Solution Overview

1. Detect the failure.

2. Infer an action to fix the problem.

3. Infer a natural language sentence describing the action.

4. Replan after the human has provided help based on the updated state.

Prior Work

Understanding Language Generating Language

Matuszek et al., 2012 MacMahon et al., 2006 Dzifcak et al., 2009Kollar et al., 2010Tellex et al., 2011

Jurafsky and Martin, 2008Reiter and Dale, 2000 Striegnitz et al., 2011Garoufi and Kaoller, 2011Chen and Mooney, 2011Golland et al., 2010Krahmer et al., 2012

This work

Prior Work: Unifying Generation and

Understanding● Goodman and Stuhlmueller (2013)

– Bayesian approach to generate and understand language.

– Bag-of-words models of semantics demonstrated in simulation.

● Vogel et al. (2013)– DEC-POMDP to demonstrate Gricean maxims emerge from multiagent interaction.

– Bag-of-words models of semantics demonstarted in simulation.

● This work– Bayesian approach to generate and understand grounded language for robots.

– Compositional grounded semantics demonstrated on an end-to-end robotic system.

● Dragan and Srinivasa (2012)– Analogous mathematical framework for gesture interpretation and production.

Solution Overview

Assembly System

● Strips-style symbolic planner to assemble a piece of furniture. (Knepper et al., 2013)

● Pre- and post- conditions for each action.

action attach_leg_to_top(robot(Robot), leg(Leg), table_top(TableTop)) {

pre { robot.arm.holding == leg; table_top.hole[0].attached_to == None; } post { robot.arm.holding = None; table_top.hole[0].attached_to = leg.hole; leg.hole.attached_to = table_top.hole[0]; }}

Solution Overview

Infer an Action

● Rule-based heuristic to generate symbolic request.

Failed symbolic condition Symbolic requestpart.visible == True; locate_part(robot, part)

robot.arm.holding == leg; give_part(robot, part)

leg.aligned == top.hole[0]; align_with_hole(leg, top, hole)

leg.hole.attached == top.hole[0]; screw_in_leg(leg, top, hole)

top.upside_down == True; flip(top)

Solution Overview

Infer a Natural Language Sentence

● Input: Symbolic Request● Output: Natural language request

Symbolic request Natural Language Requestlocate_part(robot, part) Find the part.

give_part(robot, part) Give me the part.

align_with_hole(leg, top, hole) Align the part with the hole.

screw_in_leg(leg, top, hole) Screw in the leg.

flip(top) Flip the table.

Template baseline

Hand me the part.

Hand me the white leg.

Hand me the white legthat is on the table.

γk are groundings, or objects, places, paths, and

events in the external world. Each γk

corresponds to a

constituent phrase in the language input.

Hand me the white legthat is on the table.

SemanticsUnderstanding (Forward Semantics):

What groundings match the language?

argmaxγ1 ...γN

f (γ1 ... γN , language)

corresponds to a

Semantics

argmaxγ1 ...γN

p(γ1 ... γN∣language)

argmaxγ1 ...γN

1Z∏i

gi(γ1 ... γN , language)

Tellex et al. 2011

corresponds to a

Understanding (Forward Semantics): What groundings match the language?

Semantics

argmaxγ1 ...γN

p(γ1 ... γN∣language)

argmaxγ1 ...γN

1Z∏i

Tellex et al. 2011

corresponds to a

Searching for Groundings

Hand me the white leg on the black table.

argmaxγ1 ...γN

1Z∏i

g1(γ1, the black table)=0.1

argmaxγ1 ...γN

1Z∏i

argmaxγ1 ...γN

1Z∏i

g1(γ1, the white leg on the black table )=0.1

argmaxγ1 ...γN

1Z∏i

g1(γ1, the white leg on the black table )=0.9

argmaxγ1 ...γN

1Z∏i

g1(γ1, Hand me the white leg on the black table )=0.1

argmaxγ1 ...γN

1Z∏i

g1(γ1, Hand me the white leg on the black table )=0.9

Training the Semantics Model

Type the words you would use to ask a person to carry out the action you see in this video.

Training the Semantics Model

Pick up a black table leg off of the floor.Pick up the black table leg.Pick up the black table leg.Walk over to the white table.Place black leg on white table bottom.Locate the black table leg on the floor by the white table. Find the black table leg and attach it to the white table. Hand me the black table leg

Robotic Demonstrations of G3

Tellex et al. AAAI 2011, Kollar, Tellex et al. HRI 2010, Huang, Tellex et al., IROS 2010, Tellex et al., JHRI 2013, Tellex et al., MLJ 2013

Semantics

Generation (Inverse Semantics): What language specifies the groundings?

argmaxlanguage

argmaxγ1 ...γN

corresponds to a

Searching for Sentences

Context Free Grammar

S→VP NP S→VP NP PP

PP→TO NPVP→ flip∣give∣pickup∣place

NP→the white leg∣the black leg∣methe white table∣the black table

TO→under∣on∣near

NP→NP PP

Searching for SentencesS

Pick up

the white leg

near the black table

Pick up

the white leg

near the black tableγ

Pick up

the white leg

Beam search

Pick up

the white leg

give_part(robot, part)

Pick up

the white leg

Pick up

the white leg

Forward SemanticsUnderstanding: What groundings match the language?

Generation: What language specifies the groundings?

argmaxlanguage

corresponds to a

argmaxγ1 ...γN

1Z∏i

Forward SemanticsUnderstanding: What groundings match the language?

Generation: What language specifies the groundings?

argmaxlanguage ,γ1 ...γk

corresponds to a

argmaxγ1 ...γN

1Z∏i

Inverse Semantics

p (γ1 ... γN∣language)

∏igi(γ1 ... γN , language)

∑Γ ' ∏i

gi(γ1 ' ... γN ' , language)

Equivalent to the problem of language understanding!

Inverse Semantics

∑Γ ' ∏i

Inverse Semantics

∑Γ ' ∏i

0.90.9+0.9+0.9+K

give the robot the white leg.

Inverse Semantics

∑Γ ' ∏i

argmaxlanguage ,γ1 ... γk

0.70.7+0.1+0.1+K

give the robot the white leg that is on the black table.

Solution Overview

Replan From Current State

● Human may have– helped differently than expected.

– failed to help in time.

– caused side-effects.

Evaluation Overview

● Does the inverse-semantics method generate requests that are easier to understand than other methods?– Online corpus-based evaluation.

● Does our approach work in an end-to-end-system?– User study in a real-world furniture assembly

system.

Evaluation Overview

● Does the inverse-semantics method generate requests that are easier to understand than other methods?– Online corpus-based evaluation.

● Does our approach work in an end-to-end-system?– User study in a real-world furniture assembly

system.

Corpus-Based Evaluation

“Give me the white leg that is on the black table.”

Which video is the best response to the natural language request?

“Give me the white leg that is on the black table.”

Which video is the best response to the natural language request?

Corpus-Based Evaluation: Results

Generation Algorithm Example Success Rate(%)

Chance 20

“Help me” “Help me.” 21 ±8.0

Chance 20

Templates “Hand me part 2.” 47 ±5.7

G3 Approach 1 “Give me the white leg.” 52.3 ± 5.7

G3 Approach 2“Give me the white leg that is on the black table.”

64.3 ± 5.4

Hand-written Request“Take the table leg that is on the table and place it in the robot’s hand.”

94 ± 4.7

Chance 20

Templates “Hand me part 2.” 47 ±5.7

Inverse Semantics w/onormalizer

“Give me the white leg.” 52.3 ± 5.7

Inverse Semantics“Give me the white leg that is on the black table.”

64.3 ± 5.4

Hand-written Request“Take the table leg that is on the table and place it in the robot’s hand.”

94 ± 4.7

Evaluation Overview

● Does the inverse-semantics method generate requests that are easier to understand than other methods?– Corpus-based Evaluation on AMT.

● Does our approach work in an end-to-end-system?– User-study in a real-world furniture assembly

system.

Evaluation Overview

● Does the inverse-semantics method generate requests that are easier to understand than other methods?– Corpus-based Evaluation on AMT.

● Does our approach work in an end-to-end-system?– User-study in a real-world furniture assembly

system.

User Study

● Human-robot team assembled Ikea furniture in parallel for 15 minutes.

● Robots asked for help when they encountered failure.– Three staged failures (e.g., a part out of reach on the table.)

– Many unstaged failures (e.g., a part slipped out of the robot's grasp.)

● Human provided whatever help they felt was appropriate.

● Robots continued operating autonomously.

User Study ResultsObjective Metrics

BaselineInverse Semantics

% Error Free Interaction

% Overall Success Rate

User Study ResultsSubjective Metrics

Prefer Parallelism

Future Work

● Planning in very large state spaces.● Grounded dialog.

Affordance-Aware Planning

Cooking Game

Multimodal POMDPs for Collaborative Robots

● Estimate human's mental state from language, gesture, and perceptual observations.

● Solve POMDPs with very large observation spaces.

Contributions

● Defined a Bayesian algorithm for generating natural language requests for help.

● Demonstrated that people can understand robotic help requests compared to baselines using a corpus-based evaluation.

● Assessed strengths and limitations in an end-to-end system with a real-world user study.

Asking for Help Using Inverse Semantics

Stefanie Tellex, Ross KnepperAdrian Li, Daniela Rus, Nicholas Roy

User Study ResultsSubjective Metrics

Effective Communication

Seconds Elapsed (without handoffs)

Training

● Collect parallel corpus of language paired with groundings.

Hand me the white legthat is on the table

corresponds to a

● Each user responded to 20 help requests.● Five algorithms for generating requests.

– “Help me.”

– Templates

– Approach 1

– Approach 2

– Handwritten requests.

● Total of 900 trials.

Limitations of Corpus-Based Evaluation

● Ambiguity between “near” and “under.”● Canned set of 5 choices.● No indication of how language generation acts

in context of the system.

User Comments“Help me”“I think if the robot was clearer or I saw it assemble the desk before, I would know more about what it was asking me.”

“Did not really feel like 'working together as a team'”

Inverse Semantics“More fun than working alone.”

“There was a sense of being much more productive than I would have been on my own.”

Stefanie Tellex Ross Knepper, Adrian Li, Daniela Rus ... · 33 Assembly System Strips-style...

Documents

Transcript of Stefanie Tellex Ross Knepper, Adrian Li, Daniela Rus ... · 33 Assembly System Strips-style...

Interpreting and Executing Recipes with a Cooking Robotpeople.csail.mit.edu/stefie10/publications/iserpaper.pdf · 2013-09-23 · Mario Bollini, Stefanie Tellex, Tyler Thompson, Nicholas

Descendants of Tilman Knepper - billterrillbillterrill.com/uploads/Descendants_of_Tilman_Knepper_and_Christin… · 1 Descendants of Tilman Knepper Generation No. 1 1. Tilman1 Knepper1,2,3

5S Stefanie Civilization

Kenton Knepper - Completely Cold Compressed)

portfolio stefanie put

Learning Semantic Maps from Natural Language Descriptions · Natural Language Descriptions Matthew R. Walter, 1Sachithra Hemachandra, Bianca Homberg, Stefanie Tellex, and Seth Teller

Grounding Language in Spatial Routines Stefanie Tellex · 2006-09-11 · Grounding Language in Spatial Routines by Stefanie Tellex Submitted to the Program in Media Arts and Sciences

Stefanie Kaegi 05.05

Stefanie Schneider Website

Asking for Help Using Inverse Semantics - Online Proceedings · Asking for Help Using Inverse Semantics Stefanie Tellex 1;2, Ross A. Knepper 3, Adrian Li4, Daniela Rus 3, and Nicholas

Natural Language and Spatial Reasoning · 2014. 8. 6. · Natural Language and Spatial Reasoning by Stefanie Anne Tellex Submitted to the Program in Media Arts and Science School

STEFANIE HOLBROOKstefholbrook.github.io/portfolio/resume.pdf · STEFANIE HOLBROOK stefholbrook@gmail.com |Salt Lake City, UT About Me Hello! I’m Stefanie Holbrook, a frontend developer

HILLARY J. KNEPPER PHD, MPA Knepper_CV_Feb... · Hillary J. Knepper Page 2 Adjunct Instructor (August 2006-January 2007) State and Local Government PACE UNIVERSITY TEACHING EXPERIENCE

Biology12flour tests Stefanie

GEORGE W. KNEPPER ENDOWED LECTURE IN AMERICAN …

Single Assembly Robot in Search of Human Partner ...Single Assembly Robot in Search of Human Partner: Versatile Grounded Language Generation Ross A. Knepper y, Stefanie Tellex , Adrian

Michel KNEPPER | Policy Lab Luxembourg 2014

Descendants of Tilman Knepper - billterrill · 1 Descendants of Tilman Knepper Generation No. 1 1. Tilman1 Knepper1,2,3 was born 1627 in Nordrhein, Westfalen, Germany4,5, and died

Kenton Knepper - Mind Reading

NEPA BlogCon 2013 - Blogging 101 (Knepper)