Anjuli Kannan, Software Engineer, Google at MLconf SF 2016

Confidential + Proprietary

Smart Reply: Learning a Model of Conversation from DataAnjuli KannanSoftware Engineer, Google Brain

Problem


Can you do Tuesday or Wednesday?

Phil Sharp


Tuesday Wednesday

Can you do Tuesday or Wednesday?

Phil Sharp

Smart Reply feature

● Provide text assistance for email reply composition

● Targeted at mobile● Responses can be sent on their

own or extended

Smart Reply feature predicts email responses

Smart ReplyIncoming email

Response email

Why is this task hard?

● extracting meaning from previous message● generating language● grammatical transformations between call and response● matching style/tone

Why is this solution interesting?

● Model is learned fully from data


Neural network

Is a 4

Is a 5

...

...

Image: Wikipedia


Neural network

Neuron

Is a 4

Is a 5


Basic building block is the neuron

Greg Corrado


Neural network

Is a 4

Is a 5

...

...


Learn a function from one space to another

f(.)x ∈ Rn y ∈ Rm


Smartreply feature predicts email responses

SmartreplyIncoming email

Response email


Recurrent neural networks handle sequences of input

Diagram by Felix Gers


Recurrent neural networks handle sequences of input


Reading a word into a feed-forward neural network

cat

output


Reading a sequence of words into an RNN

That



That is



That is good



That is good !



That is good !

output

Sequence-to-sequence model

Sutskever et al, NIPS 2014


encoder decoder


Ingests incoming message Generates reply message

Inference


How


How are


How are you


How are you ?

Encoder ingests the incoming message

How are you ?

Internal state is a fixed length encoding of the message

Decoder is initialized with final state of encoder

How are you ? __

Decoder predicts next word

How are you ? __

Decoder predicts next word

How are you ? ____ I

Smartreply model

How

Message

Smartreply model

How are

Message

Smartreply model

How are you

Message

Smartreply model

How are you ?

Message

Smartreply model

How are you ? __

I

Message

Response

Smartreply model

How are you ? __ I

I am

Message

Response

Smartreply model

How are you ? __ I am

I am great

Message

Response

Smartreply model

How are you ? __ I am great

I am great !

Message

Response

Vinyals & Le, ICML DL 2015

Inference

● Resulting model is fully generative● Output distribution can be used to determine the most likely responses using a

beam search

Training

Training

● Training data is a corpus of email-reply pairs● Both encoder and decoder are trained together (end-to-end)


Key points about model

● Everything is learned from data, even features● Neural network smooths across language variation

Smart Reply in Production

Deployment & coverage

● Deployed in Inbox by Gmail● Used to assist with more than 10% of all mobile replies

Examples

Quality

● How do we ensure that the response options are always high quality in content and language?○ Avoid incorrect grammar and mechanics, misspellings e.g., your the best○ Avoid inappropriate, offensive responses. e.g., Leave me alone.○ Deal with wide variability, informal language. e.g., got it thx

● Restricting model vocabulary is not sufficient!

Solution: Restrict to a fixed set of valid responses, derived automatically from data.

Most frequently used clusters


What the model can do


What the model can't do

● Match every user's tone and style



● Match every user's tone and style● Ensure diverse options



● Match every user's tone and style● Ensure diverse options● Access and update any kind of state or knowledge base

Conclusions

Conclusions

● Sequence-to-sequence produces plausible email replies in many common scenarios, when trained on an email corpus

● Smart Reply is deployed in Inbox by Gmail and generates more than 10% of mobile replies


Conclusions

● A conversation model learned entirely from data is very powerful● A data-driven approach can be complementary to hand-crafted rules and

scenarios


Collaborators

- Greg Corrado, Oriol Vinyals (Google Brain)- Balint Miklos, Tobias Kaufman, Laszlo Lukacs, and Karol Kurach (GMail)- Sujith Ravi (Google Research)


Thank you!

Extra slides

Example

Unique cluster and suggestion usage

Ranking experiments

Anjuli Kannan, Software Engineer, Google at MLconf SF 2016

Technology

Transcript of Anjuli Kannan, Software Engineer, Google at MLconf SF 2016