Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone

Joseph TabrikianDept. of Electrical and Computer Engineering

Ben-Gurion University of the Negev

Workshop on:Speech Enhancement and Multichannel Audio Processing

Technion 22.2.2007

Outline

Motivation Single source pitch estimation and tracking Multiple source pitch estimation and tracking Experiments Conclusion

Motivation Speech enhancement Sensitivity of many audio processing

algorithms to interference. For example: Automatic speech/speaker recognition Speech/music compression

Single microphone blind source separation (BSS)

Karaoke

Single Source - Modeling Voice frames - harmonic model:

additive Gaussian noise In matrix notation:

( ) cos( ) ( ), 1, ,K

n k n k nk

y t b t v t n N

( ) - nv t

1 1 1 1

2 2 2 2

1 cos cos sin sin

1 cos cos sin sin( )

1 cos cos sin sinN N N N

t K t t K t

( ) , ~ (0, )N vy A b v v R

0 1 1 T

c c cK s sKb b b b b b

Single Source – Pitch Tracking Maximum Likelihood (ML) estimator:

Pitch tracking: The data vector at the mth frame:

- first-order Markov process: Maximum A-posteriori Probability (MAP) pitch tracking

via the Viterbi algorithm.(Tabrikian-Dubnov-Dickalov 2004)

( ) , 1, ,m m m m m M y A b v

11/ 2 1 1/ 2

ˆ arg max ( )

( ) ( ) ( ) ( ) ( )H H

v v vR A

P R A A R A A R

( , , ) ( | )M

M m mm

Single Source - Voicing Decision Unvoiced model

Colored Gaussian noise model:

Voiced/unvoiced decision by the

Generalized Likelihood Ratio Test (GLRT):

~ ( , )N yy 0 R

2 2, ,

max ( | , , ; )GLRT=

max ( | ; ) ( )

voiced

unvoiced

Hv voiced

unvoicedH

y R I P y

(Fisher-Tabrikian-Dubnov 2006)

Multiple Sources ML estimator of from under the

model: with unknown signal and unknown (Gaussian) noise covariance:

j j js y a v

ˆ arg max log max( , )max( , )

, ( ), : ( 1)

ML ll l

T Tsvd L L

A y A A AG T R T T I a a T1

ˆ0 arg max logL

1ˆ arg max MVDRT

ya R a

(Harmanci-Tabrikian-Krolik 2000)

Multiple Sources Voiced model:

v includes other interferences. is unknown. Using J overlapping subframes of size Ls

(2K+1<J< Ls):

jth column of :

ˆ arg max log ( ) ,

1( ) ( ) ( ) , ( ) ( ) ( ) ( )

A A A A AG Y I U U Y A U Λ V

JyR YY

Y 1 1, , ,T

j j j N Jy y y

( ) , ~ (0, )N vy A b v v R

Multiple Sources Pitch tracking:

The data vector at the mth frame:

- first-order Markov process

Maximum A-posteriori Probability (MAP) pitch tracking via the Viterbi algorithm

( ) , 1, ,m m m m m M y A b v

Multiple Sources - Voicing Decision Unvoiced model

Colored Gaussian noise model:

Voiced/unvoiced decision by the GLRT:

~ ( , )N yy 0 R

max ( | , , ; )GLRT=

max ( | ; )

voiced

unvoiced

HJvoiced j

junvoiced jH

v Rb R

(Fisher-Tabrikian-Dubnov 2007)

Multiple Source Models Exact ML for the strongest voiced signal, and

“locally ML” for other voiced signals

ˆ ˆML LML 2,

Experiments – Single Source

Experiments - Two Sources

150 200 250 300 350-90

Frequency [Hz]

Two voiced sources

Experiments – Voicing Decision

Experiments - – Voicing Decision

Conclusions ML pitch estimation for single and multiple sources

have been developed under the harmonic model for voiced frames.

The derived likelihood functions under the two models allow implementation of the Viterbi algorithm for MAP pitch tracking.

The GLRT for voicing decision is derived under the two models.

Future work: development of multiple hypothesis tracking methods for

single microphone BSS. Adaptive estimation of the number of harmonics

Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone

Documents

Transcript of Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone

Using Blind Source Separation and a Compact Microphone ...

Blind Source Separation Using Hessian Evaluation

Blind Separation of Speech Mixtures

One Microphone Source Separation - NYU Computer Scienceroweis/papers/onemic.pdf · One Microphone Source Separation Sam T. Roweis Gatsby Unit, University College London roweis@gatsby.ucl.ac.uk

An Information-Maximization Approach to Blind Separation ...

ADAPTIVE APPROACH FOR BLIND SOURCE SEPARATION OF …

Compressive Blind Source Separation

Introduction to blind source separation - UU

Blind source separation, wavelet denoising and ...w3.cran.univ-lorraine.fr/perso/radu.ranta/pdf/RomoVazquezBSPC2.pdf · Blind source separation, wavelet denoising and discriminant

Introduction to blind source separation - Uppsala University · April - 2006 Signaler & System Uppsala universitet 3 Blind source separation Blind source separation • A number,

Algoritmi per la Blind Audio Source Separation

Blind Signal Separation: Statistical Principlesread.pudn.com/downloads165/doc/752204/cardoso_introBSS.pdf · JEAN-FRAN¸COIS CARDOSO, MEMBER, IEEE Invited Paper Blind signal separation

Audio object separation using microphone array beamformingepubs.surrey.ac.uk/808158/14/Audio object separation using microph… · Audio object separation using microphone array beamforming

Blind Source Separation : from source separation to pixel classication

Blind Separation of Radio Signals in Fading Channelspapers.nips.cc/paper/1366-blind-separation-of-radio-signals-in... · Blind Separation of Radio Signals in Fading Channels 757 into

Blind Source Separation using Dictionary Learning

Blind source separation techniques for the decomposition ...

Blind Source Separation by Independent Components Analysis

Blind Source Separation for Speech Application Under …cdn.intechopen.com/pdfs/39841/InTech-Blind_source_separation_for... · 0 Blind Source Separation for Speech Application Under

Diploma thesis Online Signal Separation Based on Microphone Arrays