Amazon AI (March 2017)

40
Hands-on with Amazon AI Julien Simon Principal Technical Evangelist [email protected] @julsimon

Transcript of Amazon AI (March 2017)

Page 1: Amazon AI (March 2017)

Hands-on with Amazon AI Julien Simon"Principal Technical Evangelist [email protected] @julsimon

Page 2: Amazon AI (March 2017)

Artificial Intelligence At Amazon

Thousands Of Employees Across The Company Focused on AI

Discovery & Search

Fulfilment & Logistics

Enhance Existing Products

Define New Categories Of

Products

Bring Machine Learning To All

Page 3: Amazon AI (March 2017)
Page 4: Amazon AI (March 2017)

Amazon AI: Three New Deep Learning Services

Polly Rekognition Lex

Life-like Speech Image Analysis Conversational Engine

Page 5: Amazon AI (March 2017)

Amazon Polly

Page 6: Amazon AI (March 2017)

What is Amazon Polly

•  A service that converts text into lifelike speech

•  Offers 47 lifelike voices across 24 languages

•  Low latency responses enable developers to build real-time systems

•  Developers can store, replay and distribute generated speech

Page 7: Amazon AI (March 2017)

Amazon Polly: Quality

Natural sounding speech A subjective measure of how close TTS output is to human speech.

Accurate text processing Ability of the system to interpret common text formats such as abbreviations, numerical sequences, homographs etc.

Today in Las Vegas, NV it's 54°F. "We live for the music", live from the Madison Square Garden. Highly intelligibile

A measure of how comprehensible speech is. ”Peter Piper picked a peck of pickled peppers.”

Page 8: Amazon AI (March 2017)

Amazon Polly: Language Portfolio Americas: •  Brazilian Portuguese •  Canadian French •  English (US) •  Spanish (US)

A-PAC: •  Australian English •  Indian English •  Japanese

EMEA: •  British English •  Danish •  Dutch •  French •  German •  Icelandic •  Italian •  Norwegian •  Polish •  Portuguese •  Romanian •  Russian •  Spanish •  Swedish •  Turkish •  Welsh •  Welsh English

Page 9: Amazon AI (March 2017)

Amazon Polly features: SSML

Speech Synthesis Markup Language is a W3C recommendation, an XML-based markup language for speech synthesis applications

<speak>

My name is Kuklinski. It is spelled

<prosody rate='x-slow'>

<say-as interpret-as="characters">Kuklinski</say-as>

</prosody>

</speak>

Page 10: Amazon AI (March 2017)

Amazon Polly features: Lexicons

Enables developers to customize the pronunciation of words or phrases My daughter’s name is Kaja.

<lexeme> <grapheme>Kaja</grapheme> <grapheme>kaja</grapheme> <grapheme>KAJA</grapheme> <phoneme>"kaI.@</phoneme>

</lexeme>

Page 11: Amazon AI (March 2017)

TEXT

Market grew by > 20%. WORDS PHONEMES

{ { { { {ˈtwɛn.ti pɚ.ˈsɛnt ˈmɑɹ.kət ˈgɹu baɪ ˈmoʊɹ

ˈðæn

PROSODY CONTOUR UNIT SELECTION AND ADAPTATION

TEXT PROCESSING

PROSODY MODIFICATION STREAMING

Market grew by more than

twenty percent

Speech units inventory

Page 12: Amazon AI (March 2017)

Polly Demo

Page 13: Amazon AI (March 2017)

Amazon Lex

Page 14: Amazon AI (March 2017)

Developer Challenges

Speech Recognition Language

Understanding

Business Logic

Disparate Systems

Authentication

Messaging platforms

Scale Testing

Security

Availability

Mobile

Conversational interfaces need to combine a large number of sophisticated algorithms and technologies

Page 15: Amazon AI (March 2017)

Text and Speech Language Understanding

Speech Recognition

Natural Language Understanding

Powered by the same Deep Learning technology as Alexa

Page 16: Amazon AI (March 2017)

Deployment to Chat Services

Amazon Lex

Facebook Messenger

Card Description Button 1 Button 2 Button 3

Card Description

Option 1

Option 2

Authentication

Rich Formatting One-Click Deployment

Mobile

Page 17: Amazon AI (March 2017)

Lex Bot Structure

Utterances Spoken or typed phrases that invoke your intent

BookHotel Intents An Intent performs an action in response to natural language user input

Slots Slots are input data required to fulfill the intent

Fulfillment Fulfillment mechanism for your intent

Page 18: Amazon AI (March 2017)

Utterances

I’d like to book a hotel

I want to make my hotel reservations

I want to book a hotel in New York City

Can you help me book my hotel?

Page 19: Amazon AI (March 2017)

Slots

Destination City New York City, Seattle, London, …

Slot Type Values

Check In Date Valid dates

Check Out Date Valid dates

Page 20: Amazon AI (March 2017)

Slot Elicitation

I’d like to book a hotel

What date do you check in?

New York City

Sure what city do you want to book?

Nov 30th Check In 11/30/2016

City New York City

Page 21: Amazon AI (March 2017)

Fulfillment

AWS Lambda Integration Return to Client

User input parsed to derive intents and slot values. Output

returned to client for further processing.

Intents and slots passed to AWS Lambda function for

business logic implementation.

Page 22: Amazon AI (March 2017)

“Book a Hotel”

Book Hotel NYC

“Book a Hotel in NYC”

Automatic Speech Recognition

Hotel Booking

New York City

Natural Language Understanding

Intent/Slot Model

Utterances Hotel Booking City New York City Check In Nov 30th Check Out Dec 2nd

“Your hotel is booked for Nov 30th”

Polly Confirmation: “Your hotel is booked for Nov 30th”

a in

“Can I go ahead with the booking?

Page 23: Amazon AI (March 2017)

Amazon Lex - Technology

Amazon Lex

Automatic Speech Recognition (ASR)

Natural Language Understanding (NLU)

Same technology that powers Alexa

Cognito CloudTrail CloudWatch

AWS Services

Action AWS Lambda

Authentication & Visibility

Speech API

Language API

Fulfillment

End-Users

Developers

Console

SDK

Intents, Slots, Prompts, Utterances

Input: Speech or Text

Multi-Platform Clients: Mobile, IoT, Web,

Chat

API

Response: Speech (via Polly TTS) or Text

Page 24: Amazon AI (March 2017)

Lex Demo

Page 25: Amazon AI (March 2017)

Amazon Rekognition

Page 26: Amazon AI (March 2017)

Amazon Rekognition Deep learning-based image recognition service Search, verify, and organize millions of images

Object and Scene Detection

Facial Analysis

Face Comparison

Facial Recognition

Page 27: Amazon AI (March 2017)

Amazon Rekognition API

DetectLabels

Object and Scene Detection Detect objects, scenes, and concepts in images

Page 28: Amazon AI (March 2017)

Amazon Rekognition API

DetectLabels

{ "Confidence": 94.62968444824219, "Name": "adventure" }, { "Confidence": 94.62968444824219, "Name": "boat" }, { "Confidence": 94.62968444824219, "Name": "rafting" }, . . .

Page 29: Amazon AI (March 2017)

Amazon Rekognition API Facial Analysis

Detect face and key facial characteristics

DetectFaces

Page 30: Amazon AI (March 2017)

Amazon Rekognition API

[ {

"BoundingBox": { "Height": 0.3449999988079071, "Left": 0.09666666388511658, "Top": 0.27166667580604553, "Width": 0.23000000417232513 }, "Confidence": 100, "Emotions": [ {"Confidence": 99.1335220336914, "Type": "HAPPY" }, {"Confidence": 3.3275485038757324, "Type": "CALM"}, {"Confidence": 0.31517744064331055, "Type": "SAD"} ], "Eyeglasses": {"Confidence": 99.8050537109375, "Value": false}, "EyesOpen": {Confidence": 99.99979400634766, "Value": true}, "Gender": {"Confidence": 100, "Value": "Female”}

DetectFaces

Page 31: Amazon AI (March 2017)

Demographic Data

Facial Landmarks

Sentiment Expressed

Image Quality

Facial Analysis

Brightness: 25.84 Sharpness: 160

General Attributes

Page 32: Amazon AI (March 2017)

CompareFaces

Amazon Rekognition API Face Comparison

Face-based user verification

Page 33: Amazon AI (March 2017)

Amazon Rekognition API

CompareFaces

{ "FaceMatches": [ {"Face": {"BoundingBox": { "Height": 0.2683333456516266, "Left": 0.5099999904632568, "Top": 0.1783333271741867, "Width": 0.17888888716697693}, "Confidence": 99.99845123291016}, "Similarity": 96 }, {"Face": {"BoundingBox": { "Height": 0.2383333295583725, "Left": 0.6233333349227905, "Top": 0.3016666769981384, "Width": 0.15888889133930206}, "Confidence": 99.71249389648438}, "Similarity": 0 } ], "SourceImageFace": {"BoundingBox": { "Height": 0.23983436822891235, "Left": 0.28333333134651184, "Top": 0.351423978805542, "Width": 0.1599999964237213}, "Confidence": 99.99344635009766} }

Page 34: Amazon AI (March 2017)

Face Comparison

Page 35: Amazon AI (March 2017)

Amazon Rekognition API Face Recognition

Index and Search faces in a collection

Index

Search

Collection

IndexFaces

SearchFacesByImage

Page 36: Amazon AI (March 2017)

Amazon Rekognition API

f7a3a278-2a59-5102-a549-a12ab1a8cae8 & v1

02e56305-1579-5b39-ba57-9afb0fd8782d & v2

Face ID & vector<float> Face

4c55926e-69b3-5c80-8c9b-78ea01d30690 & v3

tran

sfor

med

stor

ed

{ f7a3a278-2a59-5102-a549-a12ab1a8cae8, 02e56305-1579-5b39-ba57-9afb0fd8782d, 4c55926e-69b3-5c80-8c9b-78ea01d30690 }

IndexFace Collection

Page 37: Amazon AI (March 2017)

Amazon Rekognition API

Face

{ f7a3a278-2a59-5102-a549-a12ab1a8cae8, 02e56305-1579-5b39-ba57-9afb0fd8782d, 4c55926e-69b3-5c80-8c9b-78ea01d30690 }

SearchFacebyImage Collection Nearest neighbor

search

Face ID

Page 38: Amazon AI (March 2017)

Face Recognition

Page 39: Amazon AI (March 2017)

Rekognition Demo

Page 40: Amazon AI (March 2017)

Julien Simon [email protected] @julsimon

Your feedback is important to us!