Selected topics from

4
elected topics from 40 years of research on speech and speaker recognition Sadaoki Furui Tokyo Institute of Technology Department of Computer Science [email protected]

description

Selected topics from. 40 years of research on speech and speaker recognition. Sadaoki Furui. Tokyo Institute of Technology Department of Computer Science [email protected]. Generations of ASR technology. 1950. 1960. 1970. 1980. 1990. 2000. 2010. 1952. 1968. 1G. - PowerPoint PPT Presentation

Transcript of Selected topics from

Page 1: Selected topics from

Selected topics from 40 years of research on

speech and speaker recognition

Sadaoki Furui Tokyo Institute of Technology Department of Computer Science [email protected]

Page 2: Selected topics from

Generations of ASR technology

1950 1960 1970 1980 1990 2000 2010

1952 1968 1G Heuristic approaches

(analog filter bank + logic circuits)

1968 2G 1980

Pattern matching (LPC, FFT, DTW)

1980 3G 1990 Statistical framework

(HMM, n-gram, neural net)

1990 3.5G Discriminative approaches, robust training,

Prehistory normalization, adaptation, spontaneous speech, rich transcription

? 4G Extended knowledge processing

Our research NTT Labs (+Bell Labs), Tokyo Tech Collaboration with other labs

Page 3: Selected topics from

Japanese traditional cuisine “Kaiseki-ryori”

Page 4: Selected topics from

ATTENTION!

TRIAL LIMITATION - ONLY 3 SELECTED PAGES MAY BE CONVERTED PER CONVERSION.

PURCHASING A LICENSE REMOVES THIS LIMITATION. TO DO SO, PLEASE CLICK ON THE FOLLOWING LINK:

https://www.pdfconverter.com/purchase/