Image Segmentation & Classification...

7
1 May 19, 2008 1 Image Segmentation & Classification APR Course, Delft, The Netherlands Marco Loog May 19, 2008 2 Outline Image segmentation Clustering Supervised Semi-supervised & active learning Image classification “Holistic” Combining classifiers Multiple instance learning May 19, 2008 3 Image Segmentation Label pixels as coming from two or more classes / divide images in two or more regions Core technique in image analysis Radiological images, computer aided diagnosis and detection, satellite images, automated sorting machines, image guided surgery, geological analyses, and so on… May 19, 2008 4 Image Segmentation via Clustering Possible preprocessing E.g. for patch-based supervised labeling Or interactive labeling Divide image in “homogeneous” patches Other unsupervised approaches may be rephrased in similar terms, or are at least related E.g. normalized cuts, [fuzzy] connectedness, region growing, watershed segmentation, etc. May 19, 2008 5 E.g. Color Image Segmentation PCA projection from RGB to 2D subspace image(y); z = im2feat(y); zz = gendat(z,0.01); figure; scatterd(zz*pca(z,2)); May 19, 2008 6 Clustering Result EM-clustering nmc model = k-means 4 clusters How to attach meaning to clusters? Colors are cluster means [lab,w] = emclust(zz,nmc,4); labim = classim(z*w); image(labim); m = meancov(dataset(z,z*w*labeld)); colormap(+m);

Transcript of Image Segmentation & Classification...

Page 1: Image Segmentation & Classification Outlinecic.puj.edu.co/wiki/lib/exe/fetch.php?media=grupos:...Image Segmentation & Classification APR Course, Delft, The Netherlands Marco Loog May

1

May 19, 2008

1

Image Segmentation & Classification

APR Course, Delft, The Netherlands

Marco Loog

May 19, 2008 2

Outline

• Image segmentation

• Clustering

• Supervised

• Semi-supervised & active learning

• Image classification

• “Holistic”

• Combining classifiers

• Multiple instance learning

May 19, 2008 3

Image Segmentation

• Label pixels as coming from two or more classes / divide images in two or more regions

• Core technique in image analysis

• Radiological images, computer aided diagnosis and detection, satellite images, automated sorting machines, image guided surgery, geological analyses, and so on…

May 19, 2008 4

Image Segmentation via Clustering

• Possible preprocessing

• E.g. for patch-based supervised labeling

• Or interactive labeling

• Divide image in “homogeneous” patches

• Other unsupervised approaches may be rephrased in similar terms, or are at least related

• E.g. normalized cuts, [fuzzy] connectedness, region growing, watershed segmentation, etc.

May 19, 2008 5

E.g. Color Image Segmentation

•PCA projection from RGB to 2D subspace

image(y);

z = im2feat(y);

zz = gendat(z,0.01);

figure; scatterd(zz*pca(z,2));

May 19, 2008 6

Clustering Result

•EM-clustering

•nmc model = k-means

•4 clusters

•How to attach meaning to clusters?

•Colors are cluster means

[lab,w] = emclust(zz,nmc,4);labim = classim(z*w);image(labim); m = meancov(dataset(z,z*w*labeld));colormap(+m);

Page 2: Image Segmentation & Classification Outlinecic.puj.edu.co/wiki/lib/exe/fetch.php?media=grupos:...Image Segmentation & Classification APR Course, Delft, The Netherlands Marco Loog May

2

May 19, 2008 7

From Clusters to Classes

• Class names [meaning] may be assigned to clusters based on our interpretation of classifier outputs

• Raw per-cluster outputs of trained nmc model

Fields Road

Sky Objects

May 19, 2008 8

Change in Number of Clusters

sky

road

fieldsobjects

sky

road

fieldsobjects

mountains

May 19, 2008 9

Some Remarks

• To find right objects they have to be rather homogeneous

• Other features can be used, but is often problematic

• Generally, over or under segmentations will occur

• Unsupervised segmentation may be useful if

• Expert labeling is very expensive, only coarse segmentation is necessary, too many possible objects, too much variability, as preprocessing…

May 19, 2008 10

E.g. Interactive Segmentation

• Liver segmentation

[Olsen and Nielsen, 1997]

May 19, 2008 11

Supervised Segmentation

• Potentially, considerably more powerful than unsupervised methods

• Obviously, need for examples / training data

• Many possibilities, many approaches, many published methods; all variations on few themes?

• Active shape models, active appearance models, random field, pixel-based classification, patch classification

May 19, 2008 12

Pixel-Based Methods

• Direct application of supervised pattern recognition techniques

• “Avoid” difficult modeling

• Simple idea

• Extract features per pixel [filter bank, raw intensities, etc.]

• Train classifier and classify pixels based on these features

• Performs state of the art on several problems

Page 3: Image Segmentation & Classification Outlinecic.puj.edu.co/wiki/lib/exe/fetch.php?media=grupos:...Image Segmentation & Classification APR Course, Delft, The Netherlands Marco Loog May

3

May 19, 2008 13

E.g. Retinal Vessel Segmentation

[Niemeijer et al, 2004]

May 19, 2008 14

Pixel Classification Setup

•Experiments on 40 images from DRIVE database [www.isi.uu.nl/Research/Databases]

•Features : Gaussian filters

•Derivatives up to order 2 at

scales 1, 2, 4, 8, 16 and original green plane value,

i.e., 31 features per pixel

•30NN classifier [also tested LDC and QDC]

[Niemeijer et al, 2004]

May 19, 2008 15

Results / ROC

[Niemeijer et al, 2004]

May 19, 2008 16

Another E.g.

• T->B, L->R : orig., man., ASM, AAM, PC, PC+

[Van Ginneken et al, 2006]

May 19, 2008 17

Another E.g.

•PC performs very well

•PC+ outperforms 2nd human observer on lungs

•Classifier combination performs best

•Which approach should be used where?

[Van Ginneken et al, 2006]

May 19, 2008 18

Some Remarks

• Supervised methods can perform very well

• How complex should segmentation approach be?

• Choice of features important and could [still] be difficult

• Solve through feature extraction and selection, dimensionality reduction

• Unclear how much training data needed / level of further improvement possible

• Training data remains serious bottleneck

Page 4: Image Segmentation & Classification Outlinecic.puj.edu.co/wiki/lib/exe/fetch.php?media=grupos:...Image Segmentation & Classification APR Course, Delft, The Netherlands Marco Loog May

4

May 19, 2008 19

Opportunities [and Challenges]

• Semi-supervised learning

• Possibly not hard to collect loads of image data

• Mostly used, however, in interactive setting

• Active learning

• Intelligent, adaptive querying

• May need significantly fewer labeled examples for training

May 19, 2008 20

Semi-Supervised Segmentation

• E.g. interactive, via graph cuts

[Boykov and Funka-Lea, 2006], cf. [Blum et al, 2004]

May 19, 2008 21

Remarks

• Semi-supervised learning employed in image segmentation, but not part of learning process

• Segmentations are done on per-image basis; no “bootstrapping” takes place

• Semi-supervised training / testing loop seems good idea

• How do we know problem is due to current labeling and not, say, choice of features?

• More problems?

• However, have not seen this fully functional yet…

May 19, 2008 22

Active Learning

• Equip learner with possibility to present one or more objects that operator should [re]label

• Preferably, objects picked such that they are the most informative to the learner

• Can e.g. be measured through uncertainty or stability

• Exploration and exploitation may play a role

• Seems very valuable technique, but needs even more research than semi-supervised…

May 19, 2008 23

Image Classification

• Every single image is being labeled as belonging to a certain class

• Possible approaches considered here

• “Holistic”

• Combining classifiers

• Multiple instance learning

May 19, 2008 24

“Holistic” Image Classification

• Consider every image as a feature vector on which classifiers can be trained

• Possible preprocessing

• Subsampling

• Histogram equalization

• [De]blurring

• Cropping

Page 5: Image Segmentation & Classification Outlinecic.puj.edu.co/wiki/lib/exe/fetch.php?media=grupos:...Image Segmentation & Classification APR Course, Delft, The Netherlands Marco Loog May

5

May 19, 2008 25

“Holistic” Image Classification

• Obviously, feature dimensionality very large, so often use feature extraction / reduction strategies

• Mostly linear

• LDA, PCA

• Other option : Regularization

• Little study into image-specific regularization techniques

May 19, 2008 26

Face Recognition

• Holistic approach popular and rather successful

• Lot of research in feature extraction / dimensionality reduction techniques

• Error on notorious “ORL database” [40 subjects with 10 92×112 face images each] close to zero

May 19, 2008 27

Face Recognition

• FERET database

• Combining LDA or PCA with nearest neighbor

• 93.2%vs.80.0%

[Roth]

May 19, 2008 28

Combining Classifiers Approach

• Useful if possible to identify object or person based on local [image] features f

• I being the image, c the class label, and assuming indepedence given c, we have

May 19, 2008 29

Combining Classifiers Approach

• Identification of I goes through fs

• So, if our classifier can provide posteriors for these local features…

May 19, 2008 30

Some Remarks

• Approach is product classifier combination

• May be problems with product

• Multiplication of a lot of small numbers

• E.g. if every pixel represented by feature vector f

• Obviously, we might

• Substitute product for other combiners

• Deal directly with local class labels, not probs

Page 6: Image Segmentation & Classification Outlinecic.puj.edu.co/wiki/lib/exe/fetch.php?media=grupos:...Image Segmentation & Classification APR Course, Delft, The Netherlands Marco Loog May

6

May 19, 2008 31

Detecting Changes in Mammograms

•Aims :

•Separation of hormone replacement therapy [HRT] group form placebo

•Detection of aging effects

•[Ultimately : utilization in cancer risk quantization]

[Raundahl et al, 2008]

May 19, 2008 32

Detecting Changes in Mammograms

•Hessian-based features

•Assign to every pixel posterior probability of belonging to a specific group using 100NN classifier

•Fuse local posteriors into single score simply by averaging

[Raundahl et al, 2008]

May 19, 2008 33

Main Results [for Completeness]

• Method can quantify both age-related effects and effects caused by HRT

• Age effects are significantly detected

• Standard methodologies fail

• Separation of HRT subpopulations is comparable to best methodology

• Latter is interactive

May 19, 2008 34

Also for Completeness

• I mention multiple instance learning [MIL]

• Also used for the classification of images

• Main assumption : image label is + iff there is at least 1 feature vector labeled +

• Seems reasonable, for example in detection of diseases

• “Combining with max operator”

• Seems sensitive to mistraining / mislabeling

May 19, 2008 35

Axis-Parallel Rectangles

•Classical approach to MIL [not necessarily for images]

•Problem on the right•Closed are +

•Open are –

•Construct box that at least includes one + from every positive instance and excludes as many –

[Dietterich et al, 1997]

May 19, 2008 36

Axis-Parallel Rectangles

•Construct box that at least includes one + from every positive instance and excludes as many –

•Difficult problem…

•Generally, don’t think it has been convincingly demonstrated that MIL is the way to approach image classification…

[Dietterich et al, 1997]

OK solution?

Page 7: Image Segmentation & Classification Outlinecic.puj.edu.co/wiki/lib/exe/fetch.php?media=grupos:...Image Segmentation & Classification APR Course, Delft, The Netherlands Marco Loog May

7

May 19, 2008 37

References

• Olsen and Nielsen, Multi-Scale Gradient Magnitude Watershed Segmentation, 1997

• Niemeijer et al, Comparative study of retinal vessel segmentation methods on a new publicly available database, 2004

• Van Ginneken et al, Segmentation of anatomical structures in chest radiographs using supervised methods : a comparative study on a public database, 2006

• Loog and Van Ginneken, Segmentation of the posterior ribs in chest radiographs using iterated contextual pixel classification, 2004

May 19, 2008 38

References

• Boykov and Funka-Lea, Graph Cuts and Efficient N-D Image Segmentation, 2006

• Roth, Face Recognition

• Raundahl et al, Automated Effect-Specific Mammographic Pattern Measures, 2008

• Loog and Van Ginneken, Static posterior probability fusion for signal detection, 2004

• Dietterich et al, Solving the multiple instance problem with axis-parallel rectangles, 1997