83150286 Voice Browser

30
WELCOME

Transcript of 83150286 Voice Browser

Page 1: 83150286 Voice Browser

WELCOME

Page 2: 83150286 Voice Browser

Voice Browser

Presented BySharmin Sirajudeen

S7 CSReg No : 07412017

Page 3: 83150286 Voice Browser

What is a Voice Browser?

A voice browser is a device :

that interprets voice input and interprets voice markup languages to generate voice output.

that interprets a script which specifies exactly what to verbally present to the user as well as when to present each piece of information.

Page 4: 83150286 Voice Browser

MotivationThere are 10 times as many telephones as

connected PCs.Cell phones usage is growing dramatically. Speaking and listening are the natural usage

modes for modes.

Page 5: 83150286 Voice Browser

OverviewTime frame: 1998 to ??Hands-free accessing of web.Pragmatic interface for functionally blind

users.

Page 6: 83150286 Voice Browser

Key TechnologiesSpeech Recognition Speech Synthesis

Page 7: 83150286 Voice Browser

Speech Recognition

Voice input VoXML file Text

Page 8: 83150286 Voice Browser

Speech Synthesis

Text VoXML file Output(Pre-recorded)

Page 9: 83150286 Voice Browser

Standardization World Wide Web Consortium(W3C)

Voice Browser Working GroupSpeech Interface Framework

Page 10: 83150286 Voice Browser

W3C Voice Browser Working Group

Established on 26 March 1999.Re-chartered through 31 January 2009.W3C Team Contacts are Kazuyuki Ashimura

and Matt Womer.Co-chaired by Jim Larson and Scott

McGlashan .

Page 11: 83150286 Voice Browser

Speech Interface FrameworkVoiceXML 1.0VoiceXML 2.0VoiceXML 2.1Voice XML 3.0Speech Recognition Grammar Specification (SRGS) 1.0Speech Synthesis Markup Language (SSML) 1.0Speech Synthesis Markup Language (SSML) 1.1Call Control XML (CCXML)State Chart XML (SCXML)Semantic Interpretation (SISR) 1.0Pronunciation Lexicon Specification (PLS) 1.0

Page 12: 83150286 Voice Browser

Voice XML(VoXML)Version 1.0 - designed for creating audio dialogs

.Version 2.0 - uses form interpretation

algorithm(FIA).Version 2.1 - 8 additional elements.Version 3.0 - relationship between

semantics and (31 August 2010) syntax.

Page 13: 83150286 Voice Browser

What about HTML ?HTML don’t have

Tampered promptsGrammar specifying alternative words that the

user can speak in response to the question.Instructions to the text-to-speech synthesizer

about how to say words and phrases.

Adding these capabilities would complicate HTML,a language developed just for visual UI.

Page 14: 83150286 Voice Browser
Page 15: 83150286 Voice Browser

Speech Recognition Grammar Specification(SRGS)Version 1.0 -for specifying grammars of each

user input to a speech application.

Page 16: 83150286 Voice Browser

Speech Synthesis Markup Language(SSML)Version 1.0 -for specifying the rendering of

synthesized speech to the user.Version 1.1 - enhancement of SSML 1.0 for

better support of the world's languages including Asian, Eastern European, and Middle Eastern languages.

Page 17: 83150286 Voice Browser

Call Control XML(CCXML) For specifying call control functions

State Chart XML(SCXML) Execution environment based on CCXML

and Harel State Tables.

Page 18: 83150286 Voice Browser

Semantic Interpretation Speech Recognizer(SISR) Version 1.0 - For specifying possible

translation of text from the output of a speech recognizer.

Pronunciation Lexicon Specification (PIS)

Version 1.0 - Syntax for specifying pronunciation lexicons to be used by Speech Recognition and Speech Synthesis.

Page 19: 83150286 Voice Browser
Page 20: 83150286 Voice Browser

Model Architecture

Page 21: 83150286 Voice Browser

ApplicationsIt can be divided into three categories :

Web BrowsingLimited information AccessSpoken Dialog Systems

Page 22: 83150286 Voice Browser

Web BrowsingBrowse any web pages using speech input.Parsing for the purpose of voice recognition

done when the page is accessed.May or may not produce a voice feed back.

Page 23: 83150286 Voice Browser

Limited Information AccessUseful information in limited domains like

weather in a city, checking stock updates etc.Audio feed back

Page 24: 83150286 Voice Browser

Spoken Dialog SystemsClient-server architecture is usedUsed for connecting to a remote server by a

Java applet(client).Examples are connecting to email servers

Page 25: 83150286 Voice Browser

BenefitsVoice is a very natural user interface which

speeds up browsing.Less space requirements.Portable voice browsers can also be

implemented.Practical interface for functionally blind

users.Users can browse web while keeping there

hands and eyes for other jobs

Page 26: 83150286 Voice Browser

FutureVoice browsing will become visual(Multi-

modal)Can be integrated to an OSIntegrated to every application.

Page 27: 83150286 Voice Browser

ConclusionsBrowser technology is changing very fast

these days and we are moving from the visual paradigm to the voice paradigm.

Voice browser is the technology to enter this paradigm.

Voice browser is a device which interpret voice input and generate voice output.

Page 28: 83150286 Voice Browser

Referenceshttp://www.w3.org/standards/webofdevices/

voicehttp://xml.coverpages.org/ccxml.htmlhttp://reactos.ccp14.ac.uk/Voice/http://www.w3.org/Voice/1998/Workshop/

PhilJenkins.html (for IBM)

Page 29: 83150286 Voice Browser

THANK YOU

Page 30: 83150286 Voice Browser

?