CS4495/6495 Introduction to Computer Vision8D-L3 Hidden Markov Models CS4495/6495 Introduction to...

8D-L3 Hidden Markov Models

CS4495/6495 Introduction to Computer Vision

Outline

• Time Series

•Markov Models

•Hidden Markov Models

•3 computational problems of HMMs

•Applying HMMs in vision – Gesture Recognition

Audio Spectrum Audio Spectrum

of the Song of the Prothonotary Warbler

Chestnut-sided Warbler Prothonotary Warbler

Questions One Could Ask

• What bird is this?

• How will the song continue?

• Is this bird sick?

• What phases does this song have?

Time series classification

Time series prediction

Outlier detection

Time series segmentation

Cisco General Electric

Microsoft

•Will the stock go up or down?

•What type stock is this (eg, risky)?

• Is the behavior abnormal?

Time series prediction

Outlier detection

Music Analysis

• Is this Beethoven or Bach?

•Can we compose more of that?

•Can we segment the piece into themes?

Time series prediction/generation

Time series segmentation

For vision: Waving, pointing, controlling?

The Real Question

• How do we model these problems?

• How do we formulate these questions as a inference/learning problems?

Outline For Today

• Time Series

•Markov Models

Weather: A Markov Model (maybe?)

75% 5%

Probability of moving to a given

state depends only on the current

state: 1st Order

Markovian

Ingredients of a Markov Model

• States:

• State transition probabilities:

• Initial state distribution:

1[ ]i iP q S

1 2{ , ,..., }NS S S

1( | )ij t i t ja P q S q S

75% 5%

Ingredients of a Markov Model • States: • State transition • probabilities:

• Initial state distribution:

{ , , }sunny rainy snowyS S S

.8 .15 .05

.38 .6 .02

.75 .05 .2

(.7 .25 .05)

75% 5%

Probability of a Time Series

•Given:

•What is the probability of this series?

(.7 .25 .05) .8 .15 .05

.38 .6 .02

.75 .05 .2A

0.7 0.15 0.6 0.6 0.02 0.2 0.0001512

( ) ( | ) ( | ) ( | )

( | ) ( | )

sunny rainy sunny rainy rainy rainy rainy

snowy rainy snowy snowy

P S P S S P S S P S S

P S S P S S

Outline For Today

• Time Series

•Markov Models

Hidden Markov Models: Intuition

• Suppose you can’t observe the state

• You can only observe some evidence…

Hidden Markov Models: Weather Example

Observables:

Emission probabilities: ( ) ( | )j t t ib k P o k q S

Hidden Markov Models: Weather Example

75% 5%

50% 0% 50%

NOT OBSERVABLE

OBSERVABLE

Emission Probabilities

•Given:

•What is the probability of this series?

.6 .3 .1.05 .3 .650 .5 .5

(.7 .25 .05) .8 .15 .05

.38 .6 .02

.75 .05 .2A

•Given:

.6 .3 .1.05 .3 .650 .5 .5

1 7 1 7all ,...,

( | ) ( ) ( | ,..., ) ( ,..., )Q q q

P O Q P Q P O q q P q q

( ) ( , , ,..., )coat coat umbrella umbrella

P O P O O O O

62 4(0.3 0.1 0.6) (0.7 0 ) ....8

(.7 .25 .05) .8 .15 .05

.38 .6 .02

.75 .05 .2A

(All sun!)

Specification of an HMM

N - number of states

• S = {𝑆1, , 𝑆2, … 𝑆𝑁} – set of states

• 𝑄 = {𝑞1; 𝑞2; … ; 𝑞𝑇} – sequence of states

Specification of an HMM: 𝜆= (A,B,π)

A - the state transition probability matrix 𝑎𝑖𝑗 = 𝑃(𝑞𝑡+1 = 𝑗|𝑞𝑡 = 𝑖)

B- observation probability distribution

Discrete: 𝑏𝑗 𝑘 = 𝑃 𝑜𝑡 = 𝑘 𝑞𝑡 = 𝑗 1 ≤ 𝑘 ≤ 𝑀

Continuous: 𝑏𝑗(𝑥) = 𝑝(𝑜𝑡 = 𝑥

| 𝑞𝑡 = 𝑗)

π - the initial state distribution

𝜋 (𝑗) = 𝑃(𝑞1 = 𝑗)

S3 S2 S1

Some form of output symbols

• Discrete – finite vocabulary of symbols of size M. One symbol is “emitted” each time a state is visited

• Continuous – an output density in some feature space associated with each state where a output is emitted with each visit

Considering a given observation sequence O

• 𝑂 = {𝑜1; 𝑜2; … ; 𝑜𝑇} – oi observed symbol or feature at time i

(sometimes a set of them)

Specification of an HMM: 𝜆= (𝐴, 𝐵, 𝜋)

𝐴 - the state transition probability matrix 𝑎𝑖𝑗 = 𝑃(𝑞𝑡+1 = 𝑗|𝑞𝑡 = 𝑖)

S3 S2 S1

𝐵- observation probability distribution

Discrete: 𝑏𝑗 𝑘 = 𝑃 𝑜𝑡 = 𝑘 𝑞𝑡 = 𝑗 1 ≤ 𝑘 ≤ 𝑀

Continuous: 𝑏𝑗(𝑥) = 𝑝(𝑜𝑡 = 𝑥

| 𝑞𝑡 = 𝑗)

S3 S2 S1

𝜋 - the initial state distribution

𝜋 (𝑗) = 𝑃(𝑞1 = 𝑗)

S3 S2 S1

What does this have to do with Vision? • Given some sequence of observations, what “model”

generated those?

• Using the previous example: given some observation sequence of clothing:

• Is this Philadelphia, Boston or Newark?

• Notice that if Boston vs. Arizona would not need the sequence!

Outline For Today

• Time Series

•Markov Models

The 3 great problems in HMM modelling

1. Evaluating 𝑃(𝑂|𝜆): Given the model 𝜆 = (𝐴, 𝐵, 𝜋) what is the probability of occurrence of a particular observation sequence

𝑂 = 𝑜1, … , 𝑜𝑇

• Classification/recognition problem: I have a trained model for each of a set of classes, which one would most likely generate what I saw.

2. Decoding: Optimal state sequence to produce an observation sequence

𝑂 = {𝑜1, … , 𝑜𝑇}

• Useful in recognition problems – helps give meaning to states.

3. Learning: Determine model λ, given a training set of observations

• Find λ, such that 𝑃(𝑂|𝜆) is maximal

Problem 1 𝑃(𝑂|𝜆) : Naïve solution

Assume

• We know state sequence 𝑄 = 𝑞1, … 𝑞T

• Independent observations:

1 1 2 2

( | , ( | , ) ( ) ( )... ( )T

t t q q qT Ti

P O q P o q b o b o b o

•But we know the probability of any given sequence of states:

1 1 2 2 3 ( 1)( | ) ...q q q q q q T qTP q a a a

•Given

•We get:

But this is summed over all paths. There are 𝑁𝑇 states paths, each ‘costing’ 𝑂(𝑇) calculations, leading to 𝑂(𝑇𝑁𝑇)

time complexity.

1 1 2 21

( | , ( | , ) ( ) ( )... ( )T

t t q q qT Ti

P O q P o q b o b o b o

( | ) ( | , ) ( | )q

P O P O q P q

1 1 2 2 3 ( 1)( | ) ...q q q q q q T qTP q a a a

Problem 1 𝑃(𝑂|𝜆) : Efficient solution

Define auxiliary forward variable α:

𝛼𝑡(𝑖) is the probability of observing a partial

sequence of observables 𝑜1, … 𝑜𝑡 AND at time t,

state 𝑞𝑡 = 𝑖

1( ) ( ,..., , | )

t t ti P o o q i

Problem 1 𝑃(𝑂|𝜆) : Efficient solution

Forward Recursive algorithm:

• Initialise:

• Each time step:

• Conclude:

• Complexity: O(𝑁2𝑇)

1 1( ) ( )i ii b o

( ) ( ) ( )N

t t ij j t

j i a b o

( | ) ( )N

Can reach 𝑗 from any preceding

Probability of the entire observation sequence is just

sum of observations and ending up in state i, for all i.

Rest of HMMs (in brief)

• The forward recursive algorithm could compute the likelihood of being in a state i at time t and having observed the sequence from the start until t, given HMM λ

Rest of HMMs (in brief)

•A backward recursive algorithm could compute the likelihood of being in a state i at time t and observing the remainder of the observed sequence, given HMM λ

So… or hmmmm…

1. If we know HMM 𝜆 then with the forward and backward algorithm we can get an Estimate of the distribution over which state the system is in at time t.

2. With those distributions and having actually observed output data, I can determine the emission probabilities 𝑏𝑗 𝑘 that would Maximize the probability of the sequence.

So… or hmmmm…

3. Given distribution about state can also determine the transition probabilities 𝑎𝑖𝑗 to

Maximize probability.

4. With the new 𝑎𝑖𝑗 and 𝑏𝑗 𝑘 I can get a new

estimate of the state distributions at all time. (Go to 1)

HMMs: General

• HMMs: Generative probabilistic models of time series (with hidden state)

• Forward-Backward: Algorithm for computing probabilities over hidden states • Given the forward-backward algorithms you can also

train the models.

• Best known methods in speech, computer vision, robotics, though for really big data CRFs winning.

Some thoughts about gestures

• There is a conference on Face and Gesture Recognition so obviously Gesture recognition is an important problem…

• Prototype scenario: • Subject does several examples of "each gesture" • System "learns" (or is trained) to have some sort of

model for each • At run time compare input to known models and pick

New found life for Gesture Recognition:

Generic Gesture Recognition using HMMs

Nam, Y., & Wohn, K. (1996, July). Recognition of space-time hand-gestures using hidden Markov model. In ACM symposium on Virtual reality software and technology (pp. 51-58).

Generic Gesture Recognition using HMMs (1)

Data glove

Pluses and minuses of HMMs in Gesture

Good points about HMMs:

• A learning paradigm that acquires spatial and temporal models and does some amount of feature selection.

• Recognition is fast; training is not so fast but not too bad.

Pluses and minuses of HMMs in Gesture

Not so good points:

• Not great for on the fly labeling – e.g. segmentation of input streams. Requires lots of data to train for that – much like language.

• Works well when problem is easy. Less clear other times.

CS4495/6495 Introduction to Computer Vision8D-L3 Hidden Markov Models CS4495/6495 Introduction to...

Documents

Transcript of CS4495/6495 Introduction to Computer Vision8D-L3 Hidden Markov Models CS4495/6495 Introduction to...

DINAMICAS INTERCULTURALES 8€¦ · Fundación CIDOB - Calle Elisabets, 12 - 08001 Barcelona, España - Tel. (+34) 93 302 6495 - Fax. (+34) 93 302 6495 - info@cidob.org Documentos

REVISTA CIDOB d’AFERS INTERNACIONALS 34-35.Fundación CIDOB - Calle Elisabets, 12 - 08001 Barcelona, España - Tel. (+34) 93 302 6495 - Fax. (+34) 93 302 6495 - info@cidob.org REVISTA

CS4495/6495 Introduction to Computer Vision...Example of keypoint detection (a) 233x189 image (b) 832 DOG extrema Overall SIFT Procedure 1.Scale-space extrema detection 2.Keypoint

Agilent 6495 Triple Quadrupole LC/MS System 6495 Triple Quadrupole LC/MS System Experience A New Level of Confidence Agilent Technologies 7/31/2014 Agilent 6495 QQQ LC/MS System

The New 6495 Triple Quadrupole LC/MS - Agilent · 2016-09-11 · Automation 6/30/2014 Internal Sales Training - The new 6495 QQQ LC/MS 1 . Overview of Topics The New 6495 QQQ LC/MS

CS 4495 Computer Vision Binary images and Morphologyafb/classes/CS4495-Fall2014/slides/CS4… · CS4495 Computer Vision – A. Bobick Morphology Aaron Bobick. School of Interactive

SCIENTIFIC ABSTRACT MARKOV, P.U. - MARKOV, V.A. · Title: SCIENTIFIC ABSTRACT MARKOV, P.U. - MARKOV, V.A. Subject: SCIENTIFIC ABSTRACT MARKOV, P.U. - MARKOV, V.A. Keywords: 303!5

Report 0-6495-1 - TxDOT and Electric Power Transmission Lines

CS4495 Hough

CS4495/6495 Introduction to Computer Vision...The Kalman Filter •A method for tracking linear dynamical models in Gaussian noise •Predicted/corrected state distributions are Gaussian

CS 4495 Computer Vision Introduction - College of …afb/classes/CS4495-Fall2014/slides/CS4495... · CS 4495 Computer Vision . Introduction. ... Google Goggles ... Industrial Light

DOCUMENTOS CIDOB AMERICA LATINA 31 DEMOCRACIA … · Elisabets, 12 - 08001 Barcelona, España - Tel. (+34) 93 302 6495 - Fax. (+34) 93 302 6495 - info@cidob.org ... de la política

(+34) 93 302 6495 - Fax. (+34) 93 302 6495 - info@cidob ... · **Profesor de Relaciones Internacionales, Universitat Autònoma de Barcelona ***Profesora titular de Relaciones Internacionales.

CS4495/6495 Introduction to Computer VisionCS4495/6495 Introduction to Computer Vision Thanks to Srinivasa Narasimhan, Shree Nayar, David Kreigman, Marc Pollefeys Shape from shading

Administrivia – Fall 2013cc.gatech.edu/~afb/classes/CS4495-Fall2013/slides/CS4495... · 2013. 8. 23. · Administrivia – Fall 2013 • August 21: • We removed the permit and

CS 4495 Computer Vision Hidden Markov Modelsafb/classes/CS4495-Fall...Time series prediction ... • Will the stock go up or ... Hidden Markov Models . Problem 1: Naïve solution .

(+34) 93 302 6495 - Fax. (+34) 93 302 6495 - info@cidob ... · Ideologías de las independencias africanas, de Yves Bénot. Por su parte, la editorial Guadarrama también publicó,

CS 4495 Computer Vision Segmentation - Home | College …afb/classes/CS4495-Fall2014/slides/CS4495... · CS 4495 Computer Vision. Segmentation. ... If using Python and OpenCVyou should

CS 4495 Computer Vision Binary images and Morphologycc.gatech.edu/~afb/classes/CS4495-Fall2014/slides/CS4495...• Thresholding a gray-scale image • Determining good thresholds •

CS4495/6495 Introduction to Computer Vision...Intuition: Why YUV? •Easier clustering of pixels •Efficient encoding by chroma subsampling •Recall, human vision is more sensitive

(+34) 93 302 6495 - Fax. (+34) 93 302 6495 - info@cidob ... · Profesor de Relaciones Internacionales, Universitat Autònoma de Barcelona *Profesora titular de Relaciones Internacionales.