Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming...
-
Upload
monica-young -
Category
Documents
-
view
236 -
download
1
Transcript of Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming...
![Page 1: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/1.jpg)
Speech CodingUsing LPC
![Page 2: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/2.jpg)
What is Speech Coding
Speech coding is the procedure of transforming speech signal into more compact form for
Transmission Available Bandwidth
Encryption
![Page 3: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/3.jpg)
Uncompressed Speech signal
Analog speech is a bandpassed signal between 200 and 3400 Hz.
Uncompressed digital speech is a bit stream at 64kB/s.
Transmission technology must transmit the signals from point A to point B:
with minimum degradationusing minimum bandwidth
![Page 4: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/4.jpg)
Speech coding
By coding we mean an efficient representation of the signal
– COMPRESSION
The main approaches: waveform coding transform coding Parametric / hybrid coding
} smart quantizers
![Page 5: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/5.jpg)
{
How each of these works:
Waveform coders: try to find an efficient representation of the waveform, directly.
Transform coders: try to find an efficient representation in the frequency domain.
Parametric coders: try to find a small set of parameters that are an efficient representation of the signal.
FFT, etc.
)(Hexc. speech
![Page 6: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/6.jpg)
Comparison of Comparison of speech coders
![Page 7: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/7.jpg)
LPC (Linear Predictive coding)
LPC is a model for signal production: it is based on the assumption that the speech signal is produced by a very specific model.
![Page 8: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/8.jpg)
Speech Production in HumaSpeech Production in Humans
The speech signal is created by: A pressure source (lungs),
exciting ... A Filter (Vocal tract:
pharynx - mouth [soft palate, tongue] - nasal cavity)
![Page 9: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/9.jpg)
For DSP Engineer For DSP Engineer
An excitation source A time varying filter
H(t, )
filter:Excitation speech
![Page 10: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/10.jpg)
The model and its representationThe model and its representation
The LPC model looks at speech as: Excitation:
periodic (voiced) - originating in the larynx
noise (unvoiced) - fricative, produced in the mouth
An all-pole filter representing the vocal tract
H()
all polefilter:.. ..
![Page 11: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/11.jpg)
Block Diagram
![Page 12: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/12.jpg)
Why the name “Linear Predictive Coding”
It is assumed that the new sample is the weighted linear combination of previous samples
p
inGeins
ians
1)()()(
![Page 13: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/13.jpg)
Z-Plane RepresentationZ-Plane Representation
In the z-plane we can write the model as a transfer function:
H zG
a zii
i
p( )
1
1
• Clearly this transfer function has only poles - which is why it represents an all pole filter.
![Page 14: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/14.jpg)
Mathematical analysisMathematical analysis
Reminder: our problem is to find the LPC parameters, for a given speech signal. This is called the Inverse Problem.
How do we find the set of parameters that gives the best match to the signal?
![Page 15: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/15.jpg)
What are these Parameters
The Coefficients of the All Pole Filter Pitch of the speech
![Page 16: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/16.jpg)
How do we find the Coefficients: least squares
Formulation: Given a signal s(n); Defining an error as:
Find the set of that will minize the mean square error:
p
ii insansne
1)()()(
ai
E e nn
2( )
![Page 17: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/17.jpg)
Solution:Solution:
Simply equate the derivative of E to zero:
E
ai p
i
0 1, ...
• Which gives us the Normal Equations:
piinsnsinsknsan
p
k nk ...1,)()()()(
1
• These are no more than p linear equations in p unknowns...
![Page 18: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/18.jpg)
Or in matricial form:
n
n
n
p
nnn
nnn
nnn
nspns
nsns
nsns
a
a
a
pnspnsnspnsnspns
pnsnsnsnsnsns
pnsnsnsnsnsns
)()(
)()2(
)()1(
)()()2()()1()(
)()2()2()2()1()2(
)()1()2()1()1()1(
2
1
![Page 19: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/19.jpg)
A correlation; in other words: take the signal, multiply it by a shifted version, and sum.
Since our signal is long and time varying- we did it on short windows
Two variants: autocorrelation method covariance method
?)()( n
inskns
What is each element of the form-
![Page 20: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/20.jpg)
Solving the Matrix
Found the Coefficients a(i) by Using the
Levinson-Durbin recursion method
![Page 21: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/21.jpg)
Second Parameter
Pitch was found by the finding the correlation of the signal window with itself
Then these parameters were transmitted
![Page 22: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/22.jpg)
Predictor coefficients 18 * 8 = 144
Gain 5
Pitch period 6
Voiced/unvoiced switch
1
Total 156
Overall bit rate50 * 156 = 7800
bits / second
Bit rate for plain LPC vocoder
![Page 23: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/23.jpg)
Predictor coefficients
18 * 8 = 144
Gain 5
DCT coefficients
40 * 4 = 160
Total 309
Overall bit rate
50 * 309 = 15450 bits /
second
Bit rate for voice-excited LPC vocoder with DCT
![Page 24: Speech Coding Using LPC. What is Speech Coding Speech coding is the procedure of transforming speech signal into more compact form for Transmission.](https://reader036.fdocuments.us/reader036/viewer/2022062802/56649e865503460f94b895e4/html5/thumbnails/24.jpg)
Conclusion Sound produced through LPC method is
not exactly the real sound but it sounds intelligibly understandable
LPC can be used in Speech recognition systems
LPC was widely used in Military because of low bit rate in transmission
There are many variants over the basic scheme: LPC-10, CELP, MELP, RELP, VSELP, ASELP, LD-CELP...