Machine Learning in Voice Biometrics

27
Machine Learning in Voice Biometrics Michał Dankiewicz DataKRK Meetup 30.01.2017 basic concepts

Transcript of Machine Learning in Voice Biometrics

Page 1: Machine Learning in Voice Biometrics

Machine Learningin Voice Biometrics

Michał DankiewiczDataKRK Meetup30.01.2017

basic concepts

Page 2: Machine Learning in Voice Biometrics

Agenda

● Biometrics in general

● R&D @ VoicePIN.com

● Machine learning techniques in voice biometrics

● Features extraction

● Gaussian Mixture Models

● i-vectors

● Gotchas

● Challenge

Page 3: Machine Learning in Voice Biometrics

Biometrics

sources: commons.wikimedia.orgclipartfest.compixabay.com

Page 4: Machine Learning in Voice Biometrics

Voice Biometrics

= excitation + filter

sources: synthschool.comcommons.wikimedia.org https://youtu.be/ZQcEyXI1OGM?t=54s

Page 5: Machine Learning in Voice Biometrics

Voice Biometrics

Access control

Transaction authentication

Internet of Things

Law enforcement

sources: giphy.com

Page 6: Machine Learning in Voice Biometrics

Enrollment & verification

Enrollment

Bob

Verification

sources: pixabay.com

Bob

Page 7: Machine Learning in Voice Biometrics

First attempts

Enrollment

Bob

Bob

Verification

cross-correlationsources: pixabay.comcommons.wikimedia.org

Page 8: Machine Learning in Voice Biometrics

R&D @ VoicePIN.com

Page 9: Machine Learning in Voice Biometrics

R&D @ VoicePIN.com

Page 10: Machine Learning in Voice Biometrics

Features extraction

framing: waveform → matrix

MFCC – mel-frequency cepstral coefficients

● psychoacoustics

bottleneck features

● ANN with a narrow layer

sources: Deep neural networks in speaker recognition – K. Odrzywołek, 2016

Page 11: Machine Learning in Voice Biometrics

Gaussian Mixture Models

sources: commons.wikimedia.org

λ – model x – observation (1 point)X – sequence of observations

Page 12: Machine Learning in Voice Biometrics

Identification

Charlie

p(X|λ)

sources: pixabay.comcommons.wikimedia.org

Alice

Bob

argmax

Bob

Page 13: Machine Learning in Voice Biometrics

Verification, UBM & LLR

p(X|λ)Bob

P(λ|X )=p(X|λ)P (λ )

P (X )Bayes’ theorem

we want this

GMM formula same for every speaker

- negligible

P(X )≈p(X|λ?)

P(λ|X)

LLR = log( p(X|λ)

p(X|λUBM))

Totally Bob

UBM – Universal Background Model

LLR – log-likelihood ratiosources: pixabay.comcommons.wikimedia.org

Page 14: Machine Learning in Voice Biometrics

Verification, UBM & LLR

Clear conditions Noisy conditions-2

0

2

4

6

8

10

LLR is a solution

Alice's voice Bob's voice

log(p(X | λ))- log(p(X | UBM)

Alice's modelin relation to

UBM

Clear conditions Noisy conditions

-39

-37

-35

-33

-31

-29

-27

-25

Problem with p(X|λ) - where should we set a threshold?

Alice's voice Bob's voice

log(p(X|λ))

Alice's model

Page 15: Machine Learning in Voice Biometrics

i-vectors

● D-dimensional GMM with C components has D*C mean values● Concatenation of them is a mean supervector

M = m + T*w

speaker supervector[D*C × 1]

UBM supervector[D*C × 1]

total variabilitymatrix

[D*C × W]

speaker i-vector[W × 1]

Page 16: Machine Learning in Voice Biometrics

source: Low-dimensional speech representation based on Factor Analysis and its applications - Najim Dehak and Stephen Shum

Page 17: Machine Learning in Voice Biometrics

Gotchas

● quality of recordings

Page 18: Machine Learning in Voice Biometrics

Gotchas

● quality of recordings

train test

Page 19: Machine Learning in Voice Biometrics

Gotchas

● quality of recordings

train test

Page 20: Machine Learning in Voice Biometrics

Gotchas

● quality of recordings● gender

train test

Page 21: Machine Learning in Voice Biometrics

Gotchas

● quality of recordings● gender

train test

Page 22: Machine Learning in Voice Biometrics

Gotchas

● quality of recordings

● gender

● device/channel

train test

Page 23: Machine Learning in Voice Biometrics

Gotchas

● quality of recordings

● gender

● device/channel

● real case (conditions)

train test

Page 24: Machine Learning in Voice Biometrics

Gotchas

● quality of recordings

● gender

● device/channel

● real case (conditions)

● session variability

train test

Page 25: Machine Learning in Voice Biometrics

Challenge Sneakers, 1992

https://www.youtube.com/watch?v=-zVgWpVXb64

Page 26: Machine Learning in Voice Biometrics

Challenge

„What if someone records my voice?”

www.spoofingchallenge.org

Page 27: Machine Learning in Voice Biometrics

@VoicePINcom

Thanks!

VoicePIN.com