2012　mdsp pr07 bayes decision

Course Calendar Class DATE Contents

1 Sep. 26 Course information & Course overview

2 Oct. 4 Bayes Estimation

3 〃 11 Classical Bayes Estimation - Kalman Filter -

4 〃 18 Simulation-based Bayesian Methods

5 〃 25 Modern Bayesian Estimation ：Particle Filter

6 Nov. 1 HMM(Hidden Markov Model)

Nov. 8 No Class

7 〃 15 Bayesian Decision

8 〃 29 Non parametric Approaches

9 Dec. 6 PCA(Principal Component Analysis)

10 〃 13 ICA(Independent Component Analysis)

11 〃 20 Applications of PCA and ICA

12 〃 27 Clustering, k-means et al.

13 Jan. 17 Other Topics 1 Kernel machine.

14 〃 22(Tue) Other Topics 2

Lecture Plan

Bayes Decision

1. Introduction 1.1 Pattern Recognition- 1.2 An Example Classification/Decision Theory 2. Bayes Decision Theory 2.1 Decision using Posterior Probability 2.2 Decision by Minimizing Risk 3. Discriminate Function 4. Gaussian Case

1. Introduction

1.1 Pattern Recognition The second part of this course is concerned about Pattern Recognition.

Pattern recognitions (Machine Learning) want to give very high skills

for sensing and taking actions as humans do according to what they

observe.

Definitions of Pattern Recognition appeared in books

“The assignment of a physical object or event to one of several pre-

specified categories”

by Duda et al.[1]

“The science that concerns the description or classification

(recognition) of measurements”

by Schalkoff (Wiley Online Library)

Fish-Sorting Process

Sea bass 鱸

Salmon 鮭

R.O. Duda, P.E. Hart, and D. G. Stork, “Pattern Classification”, John Wiley & Sons, 2nd edition, 2004

:: feature vector in 2-d feature space

: action

"Correct dicision " should be an appropriate function of data

x lightness

x width

1.2 An Example (Duda, Hart, & Stork 2004)

Automatic Fish-Sorting Process

action １

belt conveyer action 2

Typical pattern Recognition issues:

■ Classification ■ Regression

■ Clustering ■ Dimension Reduction

(Visualization)

Pattern Recognition System

Measurement Preprocessing

Dimension Reduction Feature Selection

Recognition Classification

Model change Evaluation

analysis results

PCA (ICA)

Clustering Cross-Validation PDF estimation

PDF: Probability Density Function

Classification/ Decision Theory

Suppose we observe fish image data x, then we want to classify it to

“sea bass” or “salmon” based on the joint probability distributions

The classification problem is to answer “How do we make the best

decision?”

p ," sea bass" , p ," salmon"x x

Decision Boundary

Classification:

Assign input vector to

one of two classes

Framework: - Two Category case (fish sorting example) -

■ State of nature (Class) ω (discrete random variable)

■ Prior Probability

■ Class-conditioned Probability (Likelihood)

Measurement x : brightness of fish (scalar continuous variable)

Class-conditional probability density function for each class:

: sea bass

: salmon

2. Bayes’ Decision Theory

where 1

PDF for given that the state of nature is

Fig. 1 Class-conditioned probabilities

2.1 Decision Using Posterior Probability

■ Posterior Probabilities

■ Decision Rule (1) Minimizing error probability

■ Decision Rule (2) Likelihood ratio

the probability of being given that has been measuredDefine

Bayes rule derives

p x PP x

P x P x

Decide

if p x P

Decide

independent of

observation x

11 Fig. 2 Decision

(a) Posterior Probabilities

(b) Likelihood ratio

Probability of Error

■ Error probability for a measurement x by decision

■ Average probability of error

2 1 2 2 1 1

1 2 1 2

if we decide ( )2 1 1

if we decide ( )1 2 2

P x P x P x P P x P

P x x R

P error xEx

p x dx p x dx dx dx

P error x p x dx

P error x

P error

R R R R

Fig. 3 P(error)

2.2 Decision by Minimizing Risk

■ Alternate Bayes Decision based on risk which defines “how much

costly each action is ?”

Suppose we observe x then take action according to make a decision

(ωi) if the true state of nature is ωj , we introduce the loss function

■ Example of loss function

From a medical image we want to classify (determine) whether it

contains cancer tissues or not.

cancer, normal,

cancer, normal

cancer normal

cancer 0 1

normal 100 0

Loss Function

Expected Loss

■ Conditional risk is the expected loss if we take action for a

measurement x.

■Action: = Deciding (i=12)

■Loss:

■Conditional Risks:

■The Overall Risk:

:i i j i j j

R x Ex P x

:ij i j

1 11 1 12 2

2 21 1 22 2

R x P x P x

minimization

(minmum value R : Bayes Risk )

R R x x p x dx

Minimum Risk Decision Rule (1)

R x R x

Decide

21 11 1 12 22 2

Here , <

R x R x

P x P x

Minimum Risk Decision Rule (2)

1 12 22 2

21 11 12

Otherwise decide

threshold

Decide

Fig. 4 Likelihood ratio

Minimum error probability decision

=Minimizing the risk with zero-one loss function

Zero-One Loss Function:

Likekihood ratio decision rule (13) becomes

minimum error decisionP x P

Zero-One Loss Function:

0 if 0 1

, 1 if 1 0

i j ij

General Framework:

■ Finite set of states of nature (c Classes) :

■ Actions :

■ Loss:

■ Measurement:

1 2, , c

Generalization

: d-dimensional vector (feature vector)x

1 2, , a

: 1,..., 1,...,ij i j i a j c

3. Discriminant Function

Classifiers represented by discriminant functions : gi(x) i=1,…c

max gi(x)

g1(x) g2(x) gc(x)

where arg max

Classifier minimizing the conditional risk: = i ig x R x

Minimizing error probability: =

Alternate function: =ln ln

i i i i

g x P x p x P

g x p x P

xd x1 … input

discriminant fnctions

Classifier Network structure

action

■ Single discriminant function:

Two-category case

gives the decision boundary

g x g x

Decide

4．Gaussian Case:

Multivariate Gaussian: ,

=ln ln

1 1ln 2 ln ln

i i i i i

g x p x P

dx x P

(16) 1 2=g x g x g x

1 1 = ln 2 ln ln

1 1 1ln ln

i i i i i i

i i i i i i i i

dg x x x P

x x x P

i i i ig x x Tx W x

1 1 ln ln

i i i i i iP

Case (i=1,2)

Boundary is given by a linear line

i 1 2General Case

Boundary is quadratic curves

decision boundary

1 11where ,

2i i i i i W

References: 1) R.O. Duda, P.E. Hart, and D. G. Stork, “Pattern Classification”, John Wiley & Sons, 2nd edition, 2004 2) C. M. Bishop, “Pattern Recognition and Machine Learning”, Springer, 2006 3) E. Alpaydin, Introduction to Machine Learning, MIT Press, 2009 4) A. Huvarinen et. al., ”Independent Component Analysis” Wiley-Interscience 2001

Another action : Rejection

No classification for lower degree of conviction case

What next ? In the discussions so far all of the relevant probabilities are known, but this assumption will not be assured. Fukunaga’s definition of Pattern Recognition: “A problem of estimating density functions in a high–dimensional space and dividing the space into the regions of categories or classes”

/2 1/2

1 1 1, exp

is d-dimensional random vector

: Determinant of

Cov x E x x

Appendix: Multivariable Gaussian Density Distribution

2012 mdsp pr07 bayes decision

Technology

Transcript of 2012 mdsp pr07 bayes decision

Naïve Bayes 𝑖 𝜶 - Kangwoncs.kangwon.ac.kr/.../2015_MachineLearning/07_naive_bayes.pdf · 2016. 6. 17. · Naïve Bayes •Bayes rule을적용하면모든데이터에대하여고려해야함

Lecture 9: Bayesian Learning - Otto-Friedrich- · PDF fileLEARNING, MDL principle, Bayes Optimal Classiﬁer, Naive Bayes Classiﬁer, Bayes Belief Networks ... on Bayes theorem Lecture

Forensic Video Enhancement - MDSP - MotionDSP · 2020. 9. 18. · Title: Forensic Video Enhancement - MDSP Created Date: 11/21/2018 8:32:36 AM

Experiment Design Based on Bayes Risk and Weighted Bayes ...bb/PODE/PODE2014_Slides/RogerJelliffe_PODE_2014.pdf1 Experiment Design Based on Bayes Risk and Weighted Bayes Risk with

Naive Bayes 1, Naive Bayes and Text Classication I

23: Naïve Bayes - Stanford Universityweb.stanford.edu/.../lectures/23_naive_bayes_blank.pdf21 “Brute Force Bayes” 24b_brute_force_bayes 32 Naïve Bayes Classifier 24c_naive_bayes

NCIC CTG PR.3/ MRC PR07/ SWOG JPR3

Spam Filtering with Naive Bayes - Which Naive Bayes? · PDF fileSpam Filtering with Naive Bayes – Which Naive Bayes? ∗ Vangelis Metsis † Institute of Informatics and Telecommunications,

MDSP RESOLUTION ENHANCEMENT SOFTWARE USER’S …milanfar/software/SR-MANUAL-GUI.pdf · MDSP RESOLUTION ENHANCEMENT SOFTWARE USER’S MANUAL 1 ... (MDSP) research lab at the University

Spam Filtering with Naive Bayes – Which Naive Bayes? · Spam Filtering with Naive Bayes – Which Naive Bayes? Vangelis Metsis1,2, Ion Androutsopoulos1 and Georgios Paliouras2 1Department

Bayesian inference, Naïve Bayes model - Svetlana Lazebnikslazebni.cs.illinois.edu/fall16/lec13_bayesian_inference.pdf · Bayesian inference, Naïve Bayes model ... Bayes Rule •

MULTIPURPOSE DISATER SHELTER PROJECT (MDSP) FIELD …mdspbd.com/uploads/mneconsultant/reports/bhola/fvrbhola1.pdf · MULTIPURPOSE DISATER SHELTER PROJECT (MDSP) Monitoring & Evaluation

Fundamental Advantages of Bayes in Drug Developmenthbiostat.org/doc/bayes/meetup.pdf · 2020-04-26 · Fundamental Advantages of Bayes in Drug Development Background Freq&Bayes Needed

Classification: Naïve Bayes - University of Belgradeai.fon.bg.ac.rs/wp-content/uploads/2016/10/Naive-Bayes-Labs-2016.pdf · Naive Bayes classifier • Based on the Bayes rule •

Forensic Video Enhancement - MDSP - MotionDSP · Title: Forensic Video Enhancement - MDSP Created Date: 11/21/2018 8:32:36 AM

2012 mdsp pr12 k means mixture of gaussian

Submitted to Statistical Science Bayes, Oracle …statweb.stanford.edu/~ckirby/brad/papers/2017...Submitted to Statistical Science Bayes, Oracle Bayes, and Empirical Bayes Bradley

Naive Bayes Text Classification · Naive Bayes Text Classification Sandeep Avula asandeep@live.unc.edu. Outline Basic Probability and Notation Bayes Law and Naive Bayes Classification

33. MDSP 805 Understanding Power Industry New

Spam Filtering with Naïve Bayes – Which Naïve Bayes?cobweb.cs.uga.edu/.../CSCI6900...Presentation2.pdf · Title: Spam Filtering with Naïve Bayes – Which Naïve Bayes? Author:

2012　mdsp pr07 bayes decision

Transcript of 2012　mdsp pr07 bayes decision

2012　mdsp pr12 k means mixture of gaussian