Breaking language barriers with AI - Berlin Summit Marketing... · Bill payments Service updates...

60
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. SUMMIT Breaking Language Barriers with AI Boaz Ziniman Technical Evangelist Amazon Web Service @Ziniman ziniman

Transcript of Breaking language barriers with AI - Berlin Summit Marketing... · Bill payments Service updates...

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Breaking Language Barriers with AI

Boaz ZinimanTechnical EvangelistAmazon Web Service

@Ziniman ziniman

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Natural language processing (NLP)• Automatic Speech Recognition (ASR)• Natural Language Understanding (NLU)• Text to Speech• Translation

1970 1980 1990 2000 2010 2020

HUMAN ACCURACY

50% 55%60% 62%

70%

95%

Source: MindMeld

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Common Language Use Cases

Information Bots

Education

Accessibility

Knowledge Management Voice of Customer Applications

Customer Service/Call Centers

Enterprise Digital Assistant

Semantic Search

Captioning Workflows

LocalizationPersonalization

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

M L F R A M E W O R K S &

I N F R A S T R U C T U R E

The Amazon ML Stack: Broadest & Deepest Set of Capabilities

A I S E R V I C E SR E K O G N I T I O N

I M A G EP O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D

& C O M P R E H E N D M E D I C A L

L E XR E K O G N I T I O NV I D E O

Vision Speech Chatbots

A M A Z O N S A G E M A K E R

B U I L D T R A I N

F O R E C A S TT E X T R A C T P E R S O N A L I Z E

D E P L O Y

Pre-built algorithms & notebooks

Data labeling (G R O U N D T R U T H )

One-cl ick model training & tuning

Optimization (N E O )

One-cl ick deployment & hostingM L S E R V I C E S

F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e

E C 2 P 3 & P 3 d n

E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I CI N F E R E N C E

Reinforcement learningAlgorithms & models ( A W S M A R K E T P L A C E F O R M A C H I N E L E A R N I N G )

Language Forecasting Recommendations

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

AI Services

Pre-trained AI services that require no ML skills or training

Easily add intelligence to your existing apps and workflows

Quality and accuracy from continuously-learning APIs

A I S E R V I C E SR E K O G N I T I O N

I M A G EP O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D

& C O M P R E H E N D M E D I C A L

L E XR E K O G N I T I O NV I D E O

Vision Speech Chatbots

F O R E C A S TT E X T R A C T P E R S O N A L I Z E

Language Forecasting Recommendations

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Turn text into lifelike speech using deep learning

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Amazon Polly – Use Cases

Contact Centers

Special NeedsAI Assistant

Voiced videosand presentations

Languagelearning

Amazon Polly

Navigation

Podcasting,Voiced blogs

and news articles

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

“Today in Seattle, WA it’s 11°F”

Amazon Polly: Text In, Life-like Speech Out

52 voices across 25 languages

“Today in Seattle Washington it’s 11 degrees Fahrenheit”

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Amazon Polly: Text In, Life-like Speech Out

52 voices across 25 languages

“Today in Mumbai, India it’s 32°C”

“Today in Mumbai, India it’s 32 degrees Celcius”

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Amazon Polly: Text In, Life-like Speech Out

52 voices across 25 languages

“Esta é a minha 2ª visita ao WebSummit

Lisboa”

“Esta é a minha segundavisita ao WebSummit

Lisboa”

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

A Focus On Voice Quality & Pronunciation1. Automatic, Accurate Text Processing

“Today in Seattle, WA, it’s 11°F”

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

A Focus On Voice Quality & Pronunciation1. Automatic, Accurate Text Processing2. Intelligible and Easy to Understand

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

A Focus On Voice Quality & Pronunciation1. Automatic, Accurate Text Processing2. Intelligible and Easy to Understand3. Add Semantic Meaning to Text

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

A Focus On Voice Quality & Pronunciation

https://www.w3.org/TR/speech-synthesis/

<speak>

The spelling of my name is

<prosody rate='x-slow'>

<say-as interpret-as="characters">Boaz</say-as>

</prosody>

</speak>

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

A Focus On Voice Quality & Pronunciation

1. Automatic, Accurate Text Processing2. Intelligible and Easy to Understand3. Add Semantic Meaning to Text

“Richard’s number is 2122341237“

<say-as interpret-as="telephone">

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Voice Modification: Vocal Tract Length<speak>

This is Brian without any voice modifications.

<amazon:effect vocal-tract-length="+15%"> Imagine now that I got bigger… </amazon:effect>

<amazon:effect vocal-tract-length="+25%"> Suppose that I got even bigger still… </amazon:effect>

Now let's go back and hear the effect when I go in the opposite direction.

<amazon:effect vocal-tract-length="-15%"> Can you tell that I'm getting smaller? </amazon:effect>

<amazon:effect vocal-tract-length="-25%"> Now I'm even smaller than before. </amazon:effect>

</speak>

“This is a new technology that can give usersmore choice and better accessibility to our

content, so we wanted to create an experiment to dive deeper into the user

experience. After a month, we’ll take whatwe’ve learned about how users engage withthis feature to develop our first iteration of a

product with Amazon Polly.“

Joseph Price, Product ManagerThe Washington Post

https://www.washingtonpost.com/pr/wp/2017/06/09/the-washington-post-to-start-

experimenting-with-audio-articles-using-amazon-polly

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Navigation

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

AI Radio Announcer

• FM Wakayama: Japanese radio station

• Amazon Polly Japanese voices are used for

regularly scheduled broadcasts throughout

the week

• Emergency broadcast during 12/09 Typhoon

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Polly API exampleaws polly synthesize-speech \--output-format mp3 --voice-id Matthew --text-type ssml \--text '<speak>

<amazon:auto-breaths><prosody rate="x-slow" pitch="low">Here is my little secret.</prosody><amazon:breath duration="long" volume="x-loud"/><amazon:effect name="whispered">

<prosody rate="x-slow"><prosody pitch="x-low">I</prosody>killed Mufasa!

</prosody></amazon:effect>

</amazon:auto-breaths></speak>' \mufasa.mp3

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Automatic speech recognition

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Automatic speech recognition service

“Hello, this is Allan speaking”

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Amazon Transcribe – Key Features

Channel Identification

Custom vocabulary

Speaker Identification

Word-level time stamps

Punctuation and capitalization

Word-level confidence scores

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

ringDNARingDNA is an end-to-end communications platform for sales teams. Hundreds of enterprise organizations use RingDNA to increase productivity, engage in smarter sales conversations, gain predictive sales insights and improve their win rate.

Speech to Text"A critical component of RingDNA’s Conversation AI requires best of breed speech-to-text to deliver transcriptions of every phone call. RingDNA is excited about Amazon Transcribe since it provides high-quality speech recognition at scale, helping us to better transcribe every call to text"

Howard Brown, CEO & Founder, RingDNAhttps://www.youtube.com/watch?v=1ZJ_f1bDdog

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Natural and accurate language translation

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

21 Languages417 Combinations

Key Features

Real-time < 500ms / sentence on average< 150ms / conversational / short form

Tag HandlingXML tags placement maintains styling and formatting through translation

< / >

Data SecurityData ownershipEncryptionAccess Management

Ease of UseSimple API calls and partner solutions

$15/1M charactersOr $0.000075 per word;Pay as you go, 2M characters monthly free tier

HIPAA Eligible

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Amazon Translate Natural and fluent language translation

“Hello, what’s up? Do you want to go see a

movie tonight?”

"Olá, quer ir ver um filmehoje à noite?"

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Translate API example

boazz: ~$ aws translate translate-text \

--text "Hello, what’s up? Do you want to go see a movie tonight?" \

--source-language-code auto --target-language-code pt

{

"TargetLanguageCode": "pt",

"TranslatedText": "Olá, quer ir ver um filme hoje à noite?",

"SourceLanguageCode": "en"

}

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Translate API exampleimport boto3translate = boto3.client("translate")lang_flag_pairs = [("fr", "!"), ("de", """),

("es", "#"), ("pt", "$"),("zh", "%"), ("ar", "&"),("ja", "'"), ("ru", "("),("it", ")"), ("zh-TW", "*"),("tr", "+"), ("cs", ",")]

for lang, flag in lang_flag_pairs:print(flag)print(translate.translate_text(

Text="Hello, World",SourceLanguageCode="en",TargetLanguageCode=lang

)['TranslatedText'])

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Translate API example!

Bonjour, Monde"

Hallo, Welt#

Hola, Mundo$

Olá, Mundo%

�� ��

&

ملاعلا ،ابحرم

'

�����

(

Привет, Мир)

Ciao, Mondo*

���,��+

Merhaba, Dünya.,

Ahoj, světe.

https://github.com/ziniman/aws-translate-demo

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Hotels.com

Matt Fryer, VP and Chief Data Science Officer, Hotels.com

Machine Translation

At Hotels.com, we are committed to offering all of our customers the most relevant and up to date information about their destination. To achieve that, we operate 90 localized websites in 41 languages. We have more than 25M customer reviews and more are coming in every day, making a great candidate for machine translation. Having evaluated Amazon Translate and several other solutions, we believe that Amazon Translate presents a quick, efficient and most importantly, accurate solution.

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

https://github.com/ziniman/aws-translate-demo

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Discover insights and relationships in text

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Amazon ComprehendD i s c o v e r i n s i g h t s a n d r e l a t i o n s h i p s i n t e x t

Entities

Key Phrases

Language

Sentiment

Syntax

Grouping

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Topic modeling

STORM

WORLD SERIES

AUSTRALIASTOCK MARKET

WASHINGTON

HEALTH

CRISIS MACHINE LEARNING

LIBRARY OFNEWS ARTICLES *

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Accurately extract health information from patient notes, clinical trial reports, and other electronic health records using Amazon Comprehend

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Amazon Comprehend Medical

Entities

Medication

Medical condition

Test, treatments, and procedures anatomy

Protected Health Information (PHI)

Relationship extraction

Medication

Test, treatments, and procedures

Entity traits

Negation

Diagnosis signs and symptom

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Run Amazon Comprehend on S3 Bucketimport boto3

import json

s3 = boto3.resource('s3’)

bucket_name = ‘my_bucket’

region_name = ‘us-east-1’

bucket = s3.Bucket(bucket_name)

comprehend = boto3.client(service_name='comprehend', region_name=region)

for obj in bucket.objects.all():

body = obj.get()['Body'].read()

text = body

sentiment_response = comprehend.detect_sentiment(Text=text, LanguageCode='en’)

print(json.dumps(sentiment_response, sort_keys=True, indent=4))

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

https://github.com/ziniman/aws-comprehend-demo

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Conversational interfaces for your applications powered by the same deep learning technologies as Alexa

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Advent of conversational interactions

2nd gen: Pointers & sliders

3nd gen: Conversational

interfaces

1st gen: Punch cards & memory

registers

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Amazon Lex – use cases

CONTACT CENTER BOTS

Customer service IVR

Account inquiriesBill payments

Service updates

Single Sign OnUsers / Roles

GroupsAuditing / MonitoringRisk & Compliancy

Insights

SECURITYINFORMATIONAL BOTS

Answer questions

News updatesWeather information

Game scores

APPLICATION BOTS

Conversational interfaces

Book ticketsOrder food

Manage bank accounts

Single Sign OnUsers / Roles

GroupsAuditing / MonitoringRisk & Compliancy

Insights

SECURITYPRODUCTIVITY BOTS

Enterprise efficiencies

Check sales numbersInventory statusExpense reports

IoT BOTS

Device interactions

KiosksAppliances

Auto

A service for building conversational interfaces into your applications using voice and text

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Lex Use Case: Digital Assistant to Book a HotelBook hotel

NYC

“Book a hotel in NYC”

Automatic speech recognition

Hotel booking

New York City

Natural language understanding

Intent/slotmodel

UtterancesHotel booking

City New York City

Check in November 30

Check out December 2

“Your hotel is booked for November 30.”

Amazon Polly Confirmation: “Your hotel is booked for November 30.”

“Can I go aheadwith the booking?”

a

in

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Put AI to work for your business

Modernize your contact center to improve customer service

conversational chat bots | call transcription | intelligent routing | sentiment analysis | VoC analytics text-to speech | multilingual omni-channel communication

P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D L E X

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Amazon Connect

Real time andhistorical analytics

Skills-based routing[Automatic Call Distribution (ACD)]

Callrecording

High-qualityvoice capability

Easy to use, cloud-based contact center solution that scales to support businesses of any size

With tools that grow with your needs

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Improving Contact Centers With Artificial Intelligence

A m a z o n L e x

A m a z o n T r a n s c r i b e

A m a z o n C o m p r e h e n d

T R A N S C R I P T

A m a z o n C o n n e c t

Analytics

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Improving Contact Centers With Artificial Intelligence

A m a z o n L e x

A m a z o n T r a n s c r i b e

A m a z o n C o m p r e h e n d

T R A N S C R I P T

A m a z o n C o n n e c t

Analytics

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Connect Demo+1-979-335-5593

+1-979-335-5593

Next session for Boaz Ziniman is…

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Babel fish

"The Babel fish is small, yellow, leech-like - and probably the oddest thing in the universe. It

feeds on brain wave energy, absorbing all unconscious frequencies and then excreting

telepathically a matrix formed from the conscious frequencies and nerve signals picked up from

the speech centres of the brain, the practical upshot of which is that if you stick one in your ear,

you can instantly understand anything said to you in any form of language: the speech you hear

decodes the brain wave matrix."

The Hitchhiker's Guide to the Galaxy, Douglas Adams

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Build your own Babel fish

Person Speaks in English

Transcribe voice to text in English

Translate text to German

Speak in German

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

chat.boaz.cloud

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Babel Chatchat.boaz.cloud

{"room":"/","message":{"user":"Boaz","text":"I'm doing great","timestamp":1540978160578

},"clientId":"07958771059731382"

}

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Put AI to work for your business

Reduce localization costs & improve accuracy

custom vocabulary | timestamp generation | secure real-time translation | language identification

P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Scaling real-time translationUsing Amazon Translate, Lionbridge is able to scale machine translation in order to localize content faster and in more languages. Using Translate, Lionbridge was able to reduce translation costs by 20 percent.

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Put AI to work for your business

Understand the voice of your customer

sentiment analysis | app localization | translation services | transcription services | cataloging media | accessibility

T R A N S C R I B E T R A N S L A T E C O M P R E H E N DR E K O G N I T I O NI M A G E

R E K O G N I T I O NV I D E O

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT

Targeted customer acquisitionVidMob uses Amazon Rekognition and Amazon Transcribe for metadata extraction and sentiment analysis, to help marketers understand which videos resonate with audiences. This allows marketers to promote targeted content to acquire new customers.

Thank you!

SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Boaz ZinimanTechnical Evangelist, Amazon Web Service

@ziniman