ECE 598: The Speech Chain
description
Transcript of ECE 598: The Speech Chain
![Page 1: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/1.jpg)
ECE 598: The Speech ECE 598: The Speech ChainChain
Lecture 10: Auditory Lecture 10: Auditory PhysiologyPhysiology
![Page 2: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/2.jpg)
TodayToday Outer Ear: Sound LocalizationOuter Ear: Sound Localization Middle Ear: Impedance MatchingMiddle Ear: Impedance Matching Basilar Membrane: Frequency AnalysisBasilar Membrane: Frequency Analysis
Mechanical PrinciplesMechanical Principles Frequency Response of Auditory FiltersFrequency Response of Auditory Filters Nonlinearity of Basilar Membrane Nonlinearity of Basilar Membrane
ResponseResponse Mechano-Electric TransductionMechano-Electric Transduction
Inner and Outer Hair CellsInner and Outer Hair Cells Neuro-transmitter Uptake ModelsNeuro-transmitter Uptake Models Neural Activation ThresholdsNeural Activation Thresholds
![Page 3: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/3.jpg)
Auditory Anatomy: Auditory Anatomy: OverviewOverview
![Page 4: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/4.jpg)
Localization of Sound: Inter-Localization of Sound: Inter-Aural Time Delay (ITD)Aural Time Delay (ITD)
r
rcosr(/2-)
ITD = (r/c)(/2-+cos)
Wavefronts (lines of constant pressure)
Wave traveling direction
Diffusion of sound around the head
![Page 5: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/5.jpg)
Localization of Sound: Inter-Localization of Sound: Inter-Aural Amplitude DifferenceAural Amplitude Difference
f < c/4r ~ 1kHz: head << f < c/4r ~ 1kHz: head << =c/f; sound diffuses around head=c/f; sound diffuses around head f > c/2r ~ 2kHz: head > f > c/2r ~ 2kHz: head > , so sound is blocked by the head, so sound is blocked by the head
Low frequency, Long wavelength(shown: wave “troughs”, i.e., pressure minima)
High frequency, Short wavelength(shown: wave “troughs”, i.e., pressure minima)
Sound diffuses around the head; no shadow
Head shadow: Sound unable to diffuse around large obstacle
![Page 6: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/6.jpg)
Localization of Sound: Echoes Localization of Sound: Echoes from the Pinna and Shouldersfrom the Pinna and Shoulders
Direct Sound
Pinna Echo
Shoulder Echo
![Page 7: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/7.jpg)
Localization of Sound, Localization of Sound, Summary: Head-Related Summary: Head-Related
Transfer FunctionTransfer Function Source: Source:
s(t) = cos(s(t) = cos(t)t) Received at near ear:Received at near ear:
xxRR(t) = A(t) = ARR(() cos() cos(t+t+RR(()))) Received at far ear: Received at far ear:
xxLL(t) = A(t) = ALL(() cos() cos(t+t+LL(())ITDITD)) Near ear frequency response:Near ear frequency response:
HHRR(() = A) = ARR(()e)ejjRR(())
Far ear frequency response:Far ear frequency response:
HHLL(() = A) = ALL(()e)ej(j(LL(()-)-ITDITD))
![Page 8: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/8.jpg)
Middle Ear FunctionsMiddle Ear Functions Impedance Matching: Impedance Matching:
Sound transmission in water (inner ear) requires Sound transmission in water (inner ear) requires much higher pressure than sound transmission in air much higher pressure than sound transmission in air (outer ear).(outer ear).
Without middle ear, sound incident on oval window Without middle ear, sound incident on oval window would bounce away (would bounce away (=1) =1)
Middle ear reduces g so that not all sound is Middle ear reduces g so that not all sound is reflectedreflected
Reduce Exposure to Loud EnvironmentsReduce Exposure to Loud Environments Strap muscles loosen in loud environments, reducing Strap muscles loosen in loud environments, reducing
the amplitude of sound transmitted to inner earthe amplitude of sound transmitted to inner ear Effect is relatively slow (hundreds of milliseconds), Effect is relatively slow (hundreds of milliseconds),
so not useful for adaptation to rapid sounds so not useful for adaptation to rapid sounds (gunshots)(gunshots)
![Page 9: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/9.jpg)
Impedance Mis-Match Between Water Impedance Mis-Match Between Water and Air: Without a Middle Ear, What and Air: Without a Middle Ear, What
Would You Hear?Would You Hear?
Continuity of pressure at the boundary:Continuity of pressure at the boundary:
(p(pa+a++p+pa-a-) = (p) = (pw+w++p+pw-w-)) Continuity of volume velocity at the boundary:Continuity of volume velocity at the boundary:
(A/(A/aaccaa) (p) (pa+a+ppa-a-) = (A/) = (A/wwccww) (p) (pw+w+ppw-w-)) Densities: Densities: a a = 0.001 g/cc, = 0.001 g/cc, w w = 1 g/cc= 1 g/cc Speeds of Sound: cSpeeds of Sound: caa = 354m/s, c = 354m/s, cww=1000m/s=1000m/s Suppose pSuppose pw-w-=0, meaning that the only input sound is p=0, meaning that the only input sound is pa+a+
Then…Then…
The reflected sound is: pThe reflected sound is: pa-a- = ( = (wwccwwaaccaa)/()/(wwccww++aaccaa) p) pa+a+ = 0.9994 p = 0.9994 pa+a+
The transmitted sound is: pThe transmitted sound is: pw+w+ = 2 = 2aaccaa/(/(wwccww++aaccaa) p) pa+a+ = 0.0006 p = 0.0006 pa+a+
pa+ pa- pw-
pw+
Scala Vestibuli, contents: perilymph ≈ sodium water
Auditory Canal, contents: air
Oval Window
A
![Page 10: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/10.jpg)
Hammer-Anvil-Stirrup = Lever-Hammer-Anvil-Stirrup = Lever-Based Impedance Matching Based Impedance Matching
SystemSystem
Lever system reduces the effective input Lever system reduces the effective input impedance (z=p/v) of water by a factor of Limpedance (z=p/v) of water by a factor of L22
Resulting reflection coefficientResulting reflection coefficient
= (= (wwccwwLL22aaccaa)/()/(wwccww+L+L22aaccaa) ~ 0.98-0.99 < 1) ~ 0.98-0.99 < 1
L unitslength 1 unit length
Eardrum Velocity = (1/aca)(pa+pa-)
Pressure = (pa++pa-) Oval Window Velocity = (1/Laca)(pa+pa-)Pressure = L (pa++pa-)
![Page 11: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/11.jpg)
Acoustic Impedance of Ear Canal Acoustic Impedance of Ear Canal Informative About Middle Ear Informative About Middle Ear
FunctionFunction
Remember how to calculate impedance?Remember how to calculate impedance?1.1. Impose a Boundary Condition at Far EndImpose a Boundary Condition at Far End
ppa-a-==ppa+a+
2.2. Calculate z=p/v at Near EndCalculate z=p/v at Near Endz = p/v = z = p/v = c (pc (pa+a+ee-jkx-jkx+p+pa-a-eejkxjkx)/(p)/(pa+a+ee-jkx-jkx-p-pa-a-eejkxjkx))
= = c (ec (e2jkL 2jkL + + )/(e)/(e-2jkL -2jkL )) So by measuring the acoustic input impedance So by measuring the acoustic input impedance
of the auditory canal very precisely, it’s possible of the auditory canal very precisely, it’s possible to deduce to deduce at different frequencies, and thus to at different frequencies, and thus to learn something about health of the middle ear learn something about health of the middle ear (product: Mimosa Acoustics)(product: Mimosa Acoustics)
pa+ pa-
Outer Ear, contents: air
Eardrum
-L 0x
![Page 12: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/12.jpg)
Inner Ear AnatomyInner Ear Anatomy(image courtesy Alec Salt, Otolaryngology, Washington University)(image courtesy Alec Salt, Otolaryngology, Washington University)
![Page 13: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/13.jpg)
Inner Ear Anatomy: Charged FluidsInner Ear Anatomy: Charged Fluids(image courtesy of Alec Salt, Otolaryngology, Washington University)(image courtesy of Alec Salt, Otolaryngology, Washington University)
![Page 14: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/14.jpg)
Cross-Section of the Basilar Cross-Section of the Basilar MembraneMembrane
(image courtesy wikipedia)(image courtesy wikipedia)
![Page 15: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/15.jpg)
Frequency Selectivity of Places on Frequency Selectivity of Places on the Basilar Membranethe Basilar Membrane
Unroll
Basilar Membrane (separates scala media & scala tympani)
Base, x=0mm:High StiffnessLow Massfc = (k/m)1/2/2 ~ 16000Hz
Apex, x~3cm:Low Stiffness
High Massfc = (k/m)1/2/2
~ 40Hz
Oval Window
In between: Each position, x, is tuned to a different mechanical resonancex(fc) ~ 30mm – (11mm) ln(1 + 46fc/(fc+14700))
![Page 16: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/16.jpg)
Frequency Selectivity of Places on Frequency Selectivity of Places on the Basilar Membranethe Basilar Membrane
Wave pw+e-jx/c propagates forward at c=1000m/s until…
Wave energy is absorbed by oscillation of the basilar membrane at x(fc=/2)
Scala Vestibuli
Scala Tympani
Oval
Win
dow
Rou
nd
Win
dow
![Page 17: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/17.jpg)
Frequency Selectivity of Places on Frequency Selectivity of Places on the Basilar Membranethe Basilar Membrane
Traveling waves in the Traveling waves in the cochlea.cochlea.
““Concerning the Concerning the pleasures of pleasures of observing, and the observing, and the mechanics of the mechanics of the inner ear,” inner ear,”
Nobel Lecture, 1961, Nobel Lecture, 1961, Georg von Békésy Georg von Békésy
(courtesy of Pacific (courtesy of Pacific Biosciences Research Biosciences Research Center Hawaii)Center Hawaii)
![Page 18: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/18.jpg)
Bandwidth of the Auditory Filters: Bandwidth of the Auditory Filters: 100Hz at f100Hz at fcc<500Hz, 0.2f<500Hz, 0.2fcc at at
ffcc>500Hz>500Hz(image courtesy Julius Smith and Jonathan Abel, CCRMA, Stanford)(image courtesy Julius Smith and Jonathan Abel, CCRMA, Stanford)
Equivalent Equivalent RectangulRectangular ar Bandwidth Bandwidth (ERB) = (ERB) =
Bandwidth of Bandwidth of an ideal an ideal BPF that BPF that passes the passes the same total same total energy as energy as the basilar the basilar membrane membrane section at section at the same the same ffcc
![Page 19: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/19.jpg)
Velocity of Basilar Membrane Velocity of Basilar Membrane Causes Inner Hair Cell Follicles to Causes Inner Hair Cell Follicles to
BendBend
![Page 20: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/20.jpg)
Velocity of Basilar Membrane Velocity of Basilar Membrane Causes Inner Hair Cell Follicles to Causes Inner Hair Cell Follicles to
BendBend
![Page 21: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/21.jpg)
Bending of Follicles Causes Bending of Follicles Causes Depolarization of IHCDepolarization of IHC
Scala Media (+80mV)
Organ of Corti (0mV)
Cations enter through follicle tips when follicles bend
![Page 22: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/22.jpg)
Depolarization of IHC Causes Depolarization of IHC Causes Release of NeurotransmitterRelease of Neurotransmitter
Cations enter through follicle tips when follicles bend
Synapse, afferent neuron
Neurotransmitter released
![Page 23: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/23.jpg)
Neurotransmitter DynamicsNeurotransmitter Dynamics(Three-Store Model: Meddis, JASA 1986)(Three-Store Model: Meddis, JASA 1986)
Result: probability of neuron firing is a smoothed (lowpass filtered) version of the IHC voltage
Neurotransmitter release (instant)
Neurotransmitter Re-uptake (several ms)
Neurotransmitter binding (several ms)
![Page 24: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/24.jpg)
Signal Processing in the Inner Ear Signal Processing in the Inner Ear (Simulated)(Simulated)
![Page 25: ECE 598: The Speech Chain](https://reader036.fdocuments.us/reader036/viewer/2022062314/56813e61550346895da86b0c/html5/thumbnails/25.jpg)
Neural Response to a Synthetic Neural Response to a Synthetic VowelVowel
(Cariani, 2000)(Cariani, 2000)