Breaking language barriers with AI - Berlin Summit Marketing... · Bill payments Service updates...
Transcript of Breaking language barriers with AI - Berlin Summit Marketing... · Bill payments Service updates...
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Breaking Language Barriers with AI
Boaz ZinimanTechnical EvangelistAmazon Web Service
@Ziniman ziniman
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Natural language processing (NLP)• Automatic Speech Recognition (ASR)• Natural Language Understanding (NLU)• Text to Speech• Translation
1970 1980 1990 2000 2010 2020
HUMAN ACCURACY
50% 55%60% 62%
70%
95%
Source: MindMeld
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Common Language Use Cases
Information Bots
Education
Accessibility
Knowledge Management Voice of Customer Applications
Customer Service/Call Centers
Enterprise Digital Assistant
Semantic Search
Captioning Workflows
LocalizationPersonalization
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
M L F R A M E W O R K S &
I N F R A S T R U C T U R E
The Amazon ML Stack: Broadest & Deepest Set of Capabilities
A I S E R V I C E SR E K O G N I T I O N
I M A G EP O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D
& C O M P R E H E N D M E D I C A L
L E XR E K O G N I T I O NV I D E O
Vision Speech Chatbots
A M A Z O N S A G E M A K E R
B U I L D T R A I N
F O R E C A S TT E X T R A C T P E R S O N A L I Z E
D E P L O Y
Pre-built algorithms & notebooks
Data labeling (G R O U N D T R U T H )
One-cl ick model training & tuning
Optimization (N E O )
One-cl ick deployment & hostingM L S E R V I C E S
F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e
E C 2 P 3 & P 3 d n
E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I CI N F E R E N C E
Reinforcement learningAlgorithms & models ( A W S M A R K E T P L A C E F O R M A C H I N E L E A R N I N G )
Language Forecasting Recommendations
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
AI Services
Pre-trained AI services that require no ML skills or training
Easily add intelligence to your existing apps and workflows
Quality and accuracy from continuously-learning APIs
A I S E R V I C E SR E K O G N I T I O N
I M A G EP O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D
& C O M P R E H E N D M E D I C A L
L E XR E K O G N I T I O NV I D E O
Vision Speech Chatbots
F O R E C A S TT E X T R A C T P E R S O N A L I Z E
Language Forecasting Recommendations
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Turn text into lifelike speech using deep learning
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Amazon Polly – Use Cases
Contact Centers
Special NeedsAI Assistant
Voiced videosand presentations
Languagelearning
Amazon Polly
Navigation
Podcasting,Voiced blogs
and news articles
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
“Today in Seattle, WA it’s 11°F”
Amazon Polly: Text In, Life-like Speech Out
52 voices across 25 languages
“Today in Seattle Washington it’s 11 degrees Fahrenheit”
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Amazon Polly: Text In, Life-like Speech Out
52 voices across 25 languages
“Today in Mumbai, India it’s 32°C”
“Today in Mumbai, India it’s 32 degrees Celcius”
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Amazon Polly: Text In, Life-like Speech Out
52 voices across 25 languages
“Esta é a minha 2ª visita ao WebSummit
Lisboa”
“Esta é a minha segundavisita ao WebSummit
Lisboa”
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
A Focus On Voice Quality & Pronunciation1. Automatic, Accurate Text Processing
“Today in Seattle, WA, it’s 11°F”
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
A Focus On Voice Quality & Pronunciation1. Automatic, Accurate Text Processing2. Intelligible and Easy to Understand
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
A Focus On Voice Quality & Pronunciation1. Automatic, Accurate Text Processing2. Intelligible and Easy to Understand3. Add Semantic Meaning to Text
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
A Focus On Voice Quality & Pronunciation
https://www.w3.org/TR/speech-synthesis/
<speak>
The spelling of my name is
<prosody rate='x-slow'>
<say-as interpret-as="characters">Boaz</say-as>
</prosody>
</speak>
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
A Focus On Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing2. Intelligible and Easy to Understand3. Add Semantic Meaning to Text
“Richard’s number is 2122341237“
<say-as interpret-as="telephone">
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Voice Modification: Vocal Tract Length<speak>
This is Brian without any voice modifications.
<amazon:effect vocal-tract-length="+15%"> Imagine now that I got bigger… </amazon:effect>
<amazon:effect vocal-tract-length="+25%"> Suppose that I got even bigger still… </amazon:effect>
Now let's go back and hear the effect when I go in the opposite direction.
<amazon:effect vocal-tract-length="-15%"> Can you tell that I'm getting smaller? </amazon:effect>
<amazon:effect vocal-tract-length="-25%"> Now I'm even smaller than before. </amazon:effect>
</speak>
“This is a new technology that can give usersmore choice and better accessibility to our
content, so we wanted to create an experiment to dive deeper into the user
experience. After a month, we’ll take whatwe’ve learned about how users engage withthis feature to develop our first iteration of a
product with Amazon Polly.“
Joseph Price, Product ManagerThe Washington Post
https://www.washingtonpost.com/pr/wp/2017/06/09/the-washington-post-to-start-
experimenting-with-audio-articles-using-amazon-polly
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
AI Radio Announcer
• FM Wakayama: Japanese radio station
• Amazon Polly Japanese voices are used for
regularly scheduled broadcasts throughout
the week
• Emergency broadcast during 12/09 Typhoon
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Polly API exampleaws polly synthesize-speech \--output-format mp3 --voice-id Matthew --text-type ssml \--text '<speak>
<amazon:auto-breaths><prosody rate="x-slow" pitch="low">Here is my little secret.</prosody><amazon:breath duration="long" volume="x-loud"/><amazon:effect name="whispered">
<prosody rate="x-slow"><prosody pitch="x-low">I</prosody>killed Mufasa!
</prosody></amazon:effect>
</amazon:auto-breaths></speak>' \mufasa.mp3
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Automatic speech recognition
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Automatic speech recognition service
“Hello, this is Allan speaking”
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Amazon Transcribe – Key Features
Channel Identification
Custom vocabulary
Speaker Identification
Word-level time stamps
Punctuation and capitalization
Word-level confidence scores
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
ringDNARingDNA is an end-to-end communications platform for sales teams. Hundreds of enterprise organizations use RingDNA to increase productivity, engage in smarter sales conversations, gain predictive sales insights and improve their win rate.
Speech to Text"A critical component of RingDNA’s Conversation AI requires best of breed speech-to-text to deliver transcriptions of every phone call. RingDNA is excited about Amazon Transcribe since it provides high-quality speech recognition at scale, helping us to better transcribe every call to text"
Howard Brown, CEO & Founder, RingDNAhttps://www.youtube.com/watch?v=1ZJ_f1bDdog
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Natural and accurate language translation
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
21 Languages417 Combinations
Key Features
Real-time < 500ms / sentence on average< 150ms / conversational / short form
Tag HandlingXML tags placement maintains styling and formatting through translation
< / >
Data SecurityData ownershipEncryptionAccess Management
Ease of UseSimple API calls and partner solutions
$15/1M charactersOr $0.000075 per word;Pay as you go, 2M characters monthly free tier
HIPAA Eligible
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Amazon Translate Natural and fluent language translation
“Hello, what’s up? Do you want to go see a
movie tonight?”
"Olá, quer ir ver um filmehoje à noite?"
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Translate API example
boazz: ~$ aws translate translate-text \
--text "Hello, what’s up? Do you want to go see a movie tonight?" \
--source-language-code auto --target-language-code pt
{
"TargetLanguageCode": "pt",
"TranslatedText": "Olá, quer ir ver um filme hoje à noite?",
"SourceLanguageCode": "en"
}
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Translate API exampleimport boto3translate = boto3.client("translate")lang_flag_pairs = [("fr", "!"), ("de", """),
("es", "#"), ("pt", "$"),("zh", "%"), ("ar", "&"),("ja", "'"), ("ru", "("),("it", ")"), ("zh-TW", "*"),("tr", "+"), ("cs", ",")]
for lang, flag in lang_flag_pairs:print(flag)print(translate.translate_text(
Text="Hello, World",SourceLanguageCode="en",TargetLanguageCode=lang
)['TranslatedText'])
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Translate API example!
Bonjour, Monde"
Hallo, Welt#
Hola, Mundo$
Olá, Mundo%
�� ��
&
ملاعلا ،ابحرم
'
�����
(
Привет, Мир)
Ciao, Mondo*
���,��+
Merhaba, Dünya.,
Ahoj, světe.
https://github.com/ziniman/aws-translate-demo
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Hotels.com
Matt Fryer, VP and Chief Data Science Officer, Hotels.com
Machine Translation
At Hotels.com, we are committed to offering all of our customers the most relevant and up to date information about their destination. To achieve that, we operate 90 localized websites in 41 languages. We have more than 25M customer reviews and more are coming in every day, making a great candidate for machine translation. Having evaluated Amazon Translate and several other solutions, we believe that Amazon Translate presents a quick, efficient and most importantly, accurate solution.
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
https://github.com/ziniman/aws-translate-demo
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Discover insights and relationships in text
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Amazon ComprehendD i s c o v e r i n s i g h t s a n d r e l a t i o n s h i p s i n t e x t
Entities
Key Phrases
Language
Sentiment
Syntax
Grouping
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Topic modeling
STORM
WORLD SERIES
AUSTRALIASTOCK MARKET
WASHINGTON
HEALTH
CRISIS MACHINE LEARNING
LIBRARY OFNEWS ARTICLES *
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Accurately extract health information from patient notes, clinical trial reports, and other electronic health records using Amazon Comprehend
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Amazon Comprehend Medical
Entities
Medication
Medical condition
Test, treatments, and procedures anatomy
Protected Health Information (PHI)
Relationship extraction
Medication
Test, treatments, and procedures
Entity traits
Negation
Diagnosis signs and symptom
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Run Amazon Comprehend on S3 Bucketimport boto3
import json
s3 = boto3.resource('s3’)
bucket_name = ‘my_bucket’
region_name = ‘us-east-1’
bucket = s3.Bucket(bucket_name)
comprehend = boto3.client(service_name='comprehend', region_name=region)
for obj in bucket.objects.all():
body = obj.get()['Body'].read()
text = body
sentiment_response = comprehend.detect_sentiment(Text=text, LanguageCode='en’)
print(json.dumps(sentiment_response, sort_keys=True, indent=4))
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
https://github.com/ziniman/aws-comprehend-demo
SUMM IT © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Conversational interfaces for your applications powered by the same deep learning technologies as Alexa
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Advent of conversational interactions
2nd gen: Pointers & sliders
3nd gen: Conversational
interfaces
1st gen: Punch cards & memory
registers
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Amazon Lex – use cases
CONTACT CENTER BOTS
Customer service IVR
Account inquiriesBill payments
Service updates
Single Sign OnUsers / Roles
GroupsAuditing / MonitoringRisk & Compliancy
Insights
SECURITYINFORMATIONAL BOTS
Answer questions
News updatesWeather information
Game scores
APPLICATION BOTS
Conversational interfaces
Book ticketsOrder food
Manage bank accounts
Single Sign OnUsers / Roles
GroupsAuditing / MonitoringRisk & Compliancy
Insights
SECURITYPRODUCTIVITY BOTS
Enterprise efficiencies
Check sales numbersInventory statusExpense reports
IoT BOTS
Device interactions
KiosksAppliances
Auto
A service for building conversational interfaces into your applications using voice and text
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Lex Use Case: Digital Assistant to Book a HotelBook hotel
NYC
“Book a hotel in NYC”
Automatic speech recognition
Hotel booking
New York City
Natural language understanding
Intent/slotmodel
UtterancesHotel booking
City New York City
Check in November 30
Check out December 2
“Your hotel is booked for November 30.”
Amazon Polly Confirmation: “Your hotel is booked for November 30.”
“Can I go aheadwith the booking?”
a
in
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Put AI to work for your business
Modernize your contact center to improve customer service
conversational chat bots | call transcription | intelligent routing | sentiment analysis | VoC analytics text-to speech | multilingual omni-channel communication
P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D L E X
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Amazon Connect
Real time andhistorical analytics
Skills-based routing[Automatic Call Distribution (ACD)]
Callrecording
High-qualityvoice capability
Easy to use, cloud-based contact center solution that scales to support businesses of any size
With tools that grow with your needs
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Improving Contact Centers With Artificial Intelligence
A m a z o n L e x
A m a z o n T r a n s c r i b e
A m a z o n C o m p r e h e n d
T R A N S C R I P T
A m a z o n C o n n e c t
Analytics
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Improving Contact Centers With Artificial Intelligence
A m a z o n L e x
A m a z o n T r a n s c r i b e
A m a z o n C o m p r e h e n d
T R A N S C R I P T
A m a z o n C o n n e c t
Analytics
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Connect Demo+1-979-335-5593
+1-979-335-5593
Next session for Boaz Ziniman is…
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Babel fish
"The Babel fish is small, yellow, leech-like - and probably the oddest thing in the universe. It
feeds on brain wave energy, absorbing all unconscious frequencies and then excreting
telepathically a matrix formed from the conscious frequencies and nerve signals picked up from
the speech centres of the brain, the practical upshot of which is that if you stick one in your ear,
you can instantly understand anything said to you in any form of language: the speech you hear
decodes the brain wave matrix."
The Hitchhiker's Guide to the Galaxy, Douglas Adams
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Build your own Babel fish
Person Speaks in English
Transcribe voice to text in English
Translate text to German
Speak in German
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Babel Chatchat.boaz.cloud
{"room":"/","message":{"user":"Boaz","text":"I'm doing great","timestamp":1540978160578
},"clientId":"07958771059731382"
}
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Put AI to work for your business
Reduce localization costs & improve accuracy
custom vocabulary | timestamp generation | secure real-time translation | language identification
P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Scaling real-time translationUsing Amazon Translate, Lionbridge is able to scale machine translation in order to localize content faster and in more languages. Using Translate, Lionbridge was able to reduce translation costs by 20 percent.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Put AI to work for your business
Understand the voice of your customer
sentiment analysis | app localization | translation services | transcription services | cataloging media | accessibility
T R A N S C R I B E T R A N S L A T E C O M P R E H E N DR E K O G N I T I O NI M A G E
R E K O G N I T I O NV I D E O
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.SUMM IT
Targeted customer acquisitionVidMob uses Amazon Rekognition and Amazon Transcribe for metadata extraction and sentiment analysis, to help marketers understand which videos resonate with audiences. This allows marketers to promote targeted content to acquire new customers.