Download - Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Transcript
Page 1: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Mandy Korpusik, Jim GlassMIT Computer Science and Artificial Intelligence Laboratory

Cambridge, MA, USA

ICASSP, Brighton, UKMay 16, 2019

Dialogue State Tracking with Convolutional Semantic

Taggers

Page 2: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Motivation: Spoken Diet Tracking

2

Coco Nutritionist lets you record what you ate with everyday spoken natural language.

Page 3: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

3

Page 4: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Motivation: Nutrition Multi-turn Dialogue

Nutrition question answering– is grilled chicken or red meat better?– What should I eat for dinner?– What is a healthy breakfast– Which cereal is best to keep you satisfied?– How many calories in ## of food item– Is milk healthy? …

Personalized food recommendation(Korpusik et al., CBRecSys, 2016)

4

Page 5: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Nutrition question answering– is grilled chicken or red meat better?– What should I eat for dinner?– What is a healthy breakfast– Which cereal is best to keep you satisfied?– How many calories in ## of food item– Is milk healthy? …

Personalized food recommendation(Korpusik et al., CBRecSys, 2016)

4

Motivation: Nutrition Multi-turn Dialogue

Page 6: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Overview

• Motivation: Nutrition• Introduction• Our work in 3 state tracking challenges:

– DSTC7– DSTC6– DSTC2

• Conclusion

5

Page 7: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Introduction: Spoken Dialogue Systems

Semantic Tagging

Intent Detection Booking a restaurant

RestaurantKnowledge Base

can you book a table for two in bombay in a cheap price range

6

Database Retrieval

Page 8: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Introduction: Spoken Dialogue Systems

7

Our Goal: Develop ability to do dialogue state tracking.

Page 9: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Introduction: Dialogue State Tracking

8

System: any preference on a type of cuisine

Page 10: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Dialogue State Tracking Challenges (DSTC)

• DSTC1 (2013): human-computer bus timetables• DSTC2 and 3 (2014): human-computer restaurant info• DSTC4 (2015): human-human tourist info• DSTC5 (2016): multilingual tourist info• DSTC6 (2017): 3 tracks, end-to-end learning• DSTC7 (2019): 3 tracks (response selection, generation,

and audio-visual)

9

Page 11: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Dialogue State Tracking Challenges (DSTC)

• DSTC1 (2013): human-computer bus timetables• DSTC2 and 3 (2014): human-computer restaurant info• DSTC4 (2015): human-human tourist info• DSTC5 (2016): multilingual tourist info• DSTC6 (2017): 3 tracks, end-to-end learning• DSTC7 (2019): 3 tracks (response selection, generation,

and audio-visual)

9

Page 12: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC7

Student-Advisor Partial Dialogue:

10

(Korpusik et al., DSTC7 2019)

Page 13: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC7: Convolutional Neural Encoder

11

(Korpusik et al., DSTC7 2019)

Page 14: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6

12

Goal: select the best system response.

(Bordes & Weston, 2016)

Page 15: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Task 1: API Call

13

api_call indian bombay two cheap casual

RestaurantKnowledge Base

Page 16: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Task 2: Updating API Call

14

Page 17: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6 Data

15

• 10,000 simulated training dialogues per task• KB of restaurants

– 10 cuisines– 10 locations– 3 price ranges– 4 party sizes

Page 18: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Related Work

• 2 challenge participants achieved 100% on all tasks:– Extended Hybrid Code Networks for DSTC6 (Ham et al., 2017)

– Modeling Conversations to Learn Responding Policies (Bai et al., 2017)

16

Page 19: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Related Work

• 2 challenge participants achieved 100% on all tasks:– Extended Hybrid Code Networks for DSTC6 (Ham et al., 2017)– Modeling Conversations to Learn Responding Policies (Bai et

al., 2017)

16

Page 20: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Our Binary CNN Baseline

17

Page 21: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Our Full CNN Architecture

18

Approach: 1. Select action template with CNN.2. Populate action template with CNN-predicted semantic tags.

Page 22: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Our Full CNN Architecture

19

Approach: 1. Select action template with CNN.2. Populate action template with CNN-predicted semantic tags.

Page 23: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 1: Semantic Tagging

20

(Korpusik et al., SLT 2014)(Korpusik et al., ICASSP 2016)(Korpusik et al., Trans. ASLP 2017)

Problem: prior state-of-the-art Conditional Random Field (CRF) model requires hand-crafted features.

Solution: use a neural network to automatically learn features during training.

Page 24: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 1: Semantic Tagging

21

Page 25: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 1: Generating Tagging Data

22

api_call indian bombay two cheap casual…

Page 26: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 1: Semantic Tagging Results

23

Page 27: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 2: Action Template Selection

24

Page 28: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 3: Final Response Generation1) Action mask

api_call: masked out if any slots are still unspecified.request_api_slot: masked out if all slots are specified.

25

2) Use dialogue stateapi_call: populate slots with values in dialogue state.

request_api_slot: select the next slot missing a value.

api_call indian bombay two casualcheap

Page 29: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Test Results

26

100% precision on both tasks

Page 30: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

From Simulated to Real Data: WOZ 2.0Task: predict all the user’s requested and informableslots at each turn in a restaurant booking dialogue.

27

(Henderson et al., SLT 2014)Neural Belief Tracker: (Mrksic et al., ACL 2017)GLAD: (Zhong et al., ACL 2018)

(Korpusik et al., SLT 2018)

Page 31: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

From Simulated to Real Data: WOZ 2.0

Our Goal: Show our NN models generalize to real dialogues.

28

(Korpusik et al., SLT 2018)

Page 32: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

From Simulated to Real Data: WOZ 2.0

29

(Korpusik et al., SLT 2018)

Separately trained informable slot models.

Page 33: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

From Simulated to Real Data: WOZ 2.0

30

CNN is competitive with state-of-the-art, without requiring semantic dictionaries or pre-trained word vectors.

Page 34: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Conclusion

31

Demonstrated our neural network models’ ability to do dialogue state tracking in several domains.

Future Work:• Experiment on the remaining DSTC6 subtasks.• Jointly train tagger and action selector as end-to-end model.• Automatically learn action mask by adding a feature to action

selector model indicating whether all slots have values.• Apply these techniques to the nutrition domain!