Transforming Contact Centers with Speech and IP

Jack Chase, Director of Product Management , NMSRob Kassel, Senior Manager, Network Speech Products, Nuance

Agenda The Evolution of Contact Centers

Business trends Architectures

Speech Technology Update — Rob Kassel, Nuance MRCP-enabled speech

www.nmscommunications.com

Contact Center Evolution

Single anddistributed sites

Some use of IVRU and ACD

Screen pops

Some call routing via ACD

Evolution of Contact Centers: Business Trends

First Generation Second Generation Third Generation

Hardware-basedCost Center

Integration andTechnology

Solving BusinessProblems:Profit Center

Stand-alone sites

Limited PBX routing

Customer talks into phone Agent types into computer

Virtual Call Center

IVRU & ACD integration

Multi-media access: Email, fax, web

Integrated ERP/CRM

Skills-based routing

The Obvious Cost Savings Target

Agent Costs66%

Telecom Costs15%

Outsourced Calls7% Technology

Source: Benchmark Portal, 2002

The Cost of Customer Interactionis Reduced with Self Service

Chat Phone

$0.24 $0.45

$7.00$5.50

$10.00

$12.00

$14.00

$16.00

Assisted Service

Self-Service

Source: Gartner Group, 2002

Evolution of Contact Centers: Technology Trends Self-service using web, ASR and TTS is

reducing the dependency on live agents; costs Web, email, and messaging are freely mixed

with phone calls in a single queue Network based contact centers are becoming a

significant phenomenon VoIP is lowering system costs at the agent and

between system components By 2007, 30% of contact center agents will be on VoIP

Circuit-Based Contact Center

CircuitData

VoIP in an IP Contact Center

Site A

Site BIP-PBX

Contact Center(ACD+CTI

+IVR+Speech)

Self-Service

Operations Center

CircuitDataVOIP

Upgrading with MRCP and VXML

Site A

Site BIP-PBX

Operations Center

CircuitDataVOIP

Media Server

Application Server

VXML Server

Speech Server

SIP, CCXML

Speech Technology Update

Rob Kassel, Senior Manager, Network Speech Products, Nuance

www.nuance.com

The Need For Speech Recognition DTMF often is used for customer self-service

Numeric entry is easy… unless you are reading Spelling entry is more difficult Menus need to be enumerated, can’t be too long Deep menu structure becomes tiresome Assignment inconsistent between vendors (e.g., voicemail) How do you enter “5 ½%” or “Albuquerque”?

With speech, questions are answered naturally Caller satisfaction is higher Fewer zero-outs leads to additional cost savings

www.nuance.com

Speech Recognition Process

FeatureExtraction

SpeechDetector

ConfidenceScoring

Speech

Results

Grammar

GrammarCompiler

SystemDictionary

PronunciationRules

PhonemeClassifier Acoustic

Models

Search

www.nuance.com

Speech Recognition Challenges Processor and memory demands Speech can be difficult to decode, even for humans

Fixed, confusable vocabularies: “B-C-D-E-G-P-T-V-Z” Ambiguous boundaries: “It’s hard to wreck a nice beach!”

Speaker variability: dialect, volume, gender, etc. Noise rejection: hands-free, mobile, telematics Out-of-vocabulary rejection & confidence measures Callers don’t always say what you might expect…

Yes or no?

www.nuance.com

Speech Recognition: State of the Art Callers speak naturally in directed dialogs High accuracy, infrequent confirmation Million-word vocabularies:

stocks, proper names, street addresses Scripting to control values returned to application:

“half past three” can return “1530” or “afternoon” Open-ended responses, especially for call routing

Allows for questions like “How may I help you?” Based on statistical methods trained from examples

www.nuance.com

The Need For Text-To-Speech Professional recordings best for fixed content Word concatenation is difficult to do well

Often used for numeric output Can sound mechanical; irritating when frequent

Large output vocabularies fairly common(e.g. city names)

Some applications defy recordings(e.g. messaging)

www.nuance.com

TTS Text Analysis

PronunciationGeneration

TextNormalization

Source Text

Annotated Text

SystemDictionary

PronunciationRules

ProsodyGeneration

“Are you there?” are + you + there + <question>$31 thirty one dollarsATM eh tee em NATO nay-tohA.M. eh em CUL8R see you later

HomographDisambiguation

minute = 60 seconds minute = tinyDr. Jones doctor jones Jones Dr. jones drive11210 eleven thousand two hundred ten (number)11210 one one two one oh (ZIP code)

Determine which words require emphasisInsert pauses based on phrase boundaries, lung capacityAssign duration, pitch, and volume to each phoneme

www.nuance.com

TTS Waveform Generation

Can mimic natural speech if parameters are set by hand

In practice sounds somewhat robotic, the “drunken Swede”

Can produce a variety of voicesExtremely compact

Units can be smaller or larger than a phoneme

Database tends to be very largePreserves speaker characteristics

and speaking style of voice talent

Annotated Text

Speech

VoiceDatabase

UnitSelection

Concatenateand Smooth

Annotated Text

Speech

ParameterGeneration

Vocal TractModel

Parametric Concatenative

FEMALE FEMALE CHILDwww.nuance.com

Text-to-Speech: State of the Art Naturalness of concatenative TTS is generally

preferred for call center applications …but voice talent takes direction, more expressive Custom voices to maintain brand identity Use one voice talent for both recordings and TTS

Seamlessly mix dynamic data with static prompts Apply prompt “patches” rapidly until

cost of recording session can be justified

www.nuance.com

Designing Speech Applications Observe & interview call center agents Listen to calls, develop caller profiles

Who are they? What do they know? Where are they calling from? What are their goals? What are their priorities?

Determine business objectives & rules Define speech user interface

Call flows Prompt wording Error recovery; help and instructions Anthropomorphism and persona

www.nuance.com

MRCP and Natural Access

What is MRCP v1?

Speech servers are connected by VoIP to IVR servers Standard API for ASR and TTS Easy to reconfigure system as needs change Easy to implement redundancy

Control: MRCP/ RTSP/ TCP/ IP

Speech: G.711/ RTP/ UDP/ IP MRCP Server

Speech

ServersIP

PSTN IVR

ServersIVR

ServersSpeech

Servers

Natural Access and MRCP

Service Managers, Libraries

Driver Driver Driver IPC

Call Control

CX Boards AG Boards CG Boards PacketMediaHMP

PCI PCI PCI IP

IVRServices

PSTNTrunking

VoIP(Fusion)

Conferencing

FaxServices

USAI(MRCP)

VideoAccess

Universal Speech Access Makes Speech Integration Easy

Current Support for Universal Speech Access

Vendor Type Universal Speech Access 1.0

Universal Speech Access 1.1

Nuance ASR MRCP Server SP5 Nuance 8.5

MRCP Server SP7 Nuance 8.5

Nuance(ScanSoft)

ASR OSMS 2.0.1OSR 2.0

SWMS 3.1OSR 3.0

Nuance TTS Vocalizer 3.0 Vocalizer 3.0.8

Nuance(ScanSoft)

TTS OSMS 2.0.1Speechify 2.0

SWMS 3.1RealSpeak 4.0

Telisma ASR Philsoft 3.2 teliSpeech 1.0 SP4

Loquendo ASR N/A Loquendo ASR LSS 6.0

What’s Next for MRCP? MRCP v2

draft-ietf-speechsc-mrcpv2-06, Feb 20, 2005 Adds SIP/SDP for session setup

Replaces RTSP Adds support for speaker verification Little deployment yet NMS will update USAI when deployments occur

Questions?

Contact Info:jack_chase@nmss.comrob.kassel@nuance.com

Transforming Contact Centers with Speech and IP

Documents

Transcript of Transforming Contact Centers with Speech and IP

Department of Speech, Language, and Hearing Sciences ... · 1. Sargent College Clinical Centers –Academic Speech and Language Center Observation of ongoing evaluations and treatment

Data Centers - WBS...transforming new Data Centers. We have witnessed the growth and demand in the tech industry and have been proud to put our mark into the development. The end clients

Shared Service Centers: Transforming Treasury and · PDF fileShared Service Centers: Transforming Treasury and Finance Functions INSIGHTS | Corporate Clients Transaction Services Asia

DATA FUSION CENTERS MARKETING SITE - Esri · Data Fusion Centers Transforming Public Safety Information into Actionable Knowledge

Transforming STEM Leadership Culture - AWIS€¦ · 6 TRANSFORMING STEM LEADERSHIP CULTURE Government: National Labs There are 46 federally funded research and development (R&D) centers

Music Flagship: Transforming Students, Transforming ... of Music... · Music Flagship: Transforming Students, Transforming Communities ... MUSIC FLAGSHIP: TRANSFORMING STUDENTS, TRANSFORMING

Siemens PLM Software Transforming the Digital … · solutions to help manufacturers transform . ... design centers through a single, ... data analytics and connectivity capabilities,

Neurological Rehabilitation - OhioHealth and speech therapies, ... or at our outpatient OhioHealth Neurological Rehabilitation centers. ... maintain independence and

Blockchain & the Cloud: Transforming Data Center ... · 1 Blockchain & the Cloud: Transforming Data Center Architecture for Tomorrow How data centers can facilitate the rapidly increasing

Robust Automatic Speech Recognition by Transforming Binary Uncertainties

Transforming Community Health Centers into Patient ... · National Committee for Quality Assurance–Patient Centered Medical Home (NCQA– PCMH) recognition, another 12 percent have

Speech centers

2017 ANNUAL REPORT · 2018-04-03 · occupational and speech therapists who provide care to patients in acute skilled nursing centers and within the community via outpatient centers

Outpatient Therapy Centers - Texas Children's … Therapy Centers Medical Center/Central Houston ... Children’s Physical Therapy ... Rice Hearing & Speech Center 2311 Canal St.,

Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.

Enhancement of an Arabic Speech Emotion …call centers. Emotions in natural speech databases reflect real life situations and may convey a mixture of emotions [8, 9]. available as

The Mobile-Cloud Era is Transforming Data Centers – How Should IT Respond?

WELLS FARGO DISCUSSES TRANSFORMING R …...How did Aster R help me? 10 • Big data problems speech transcriptions: • Speech transcriptions are messy • I’ve cleaned up samples

Centers of Excellence in Health A permanent approach to improving health systems in Africa Transforming Africas health systems from within.

Bangalore University Library Bangalore – 560 056 Transforming Libraries into Learning Centers using Open Source Technologies Open Source Technologies in.