Recognition: A machine learning approach

Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, Kristen Grauman, and Derek Hoiem

The machine learning framework

• Apply a prediction function to a feature representation of the image to get the desired output:

f( ) = “apple”

f( ) = “tomato”

f( ) = “cow”

The machine learning framework

y = f(x)

• Training: given a training set of labeled examples {(x1,y1), …, (xN,yN)}, estimate the prediction function f by minimizing the prediction error on the training set

• Testing: apply f to a never before seen test example x and output the predicted value y = f(x)

output prediction function

Image feature

Prediction

StepsTraining Labels

Training Images

Training

Image Features

Testing

Test Image

Learned model

Slide credit: D. Hoiem

Features

• Raw pixels

• Histograms

• GIST descriptors

• …

Classifiers: Nearest neighbor

f(x) = label of the training example nearest to x

• All we need is a distance function for our inputs• No training required!

Test example

Training examples

from class 1

Training examples

from class 2

Classifiers: Linear

• Find a linear function to separate the classes:

f(x) = sgn(w x + b)

• Images in the training set must be annotated with the “correct answer” that the model is expected to produce

Contains a motorbike

Recognition task and supervision

Unsupervised “Weakly” supervised Fully supervised

Definition depends on task

Generalization

• How well does a learned model generalize from the data it was trained on to a new test set?

Training set (labels known) Test set (labels unknown)

Generalization• Components of generalization error

– Bias: how much the average model over all training sets differ from the true model?• Error due to inaccurate assumptions/simplifications made by

the model– Variance: how much models estimated from different training

sets differ from each other

• Underfitting: model is too “simple” to represent all the relevant class characteristics– High bias and low variance– High training error and high test error

• Overfitting: model is too “complex” and fits irrelevant characteristics (noise) in the data– Low bias and high variance– Low training error and high test error

Bias-variance tradeoff

Training error

Test error

Underfitting Overfitting

Complexity Low BiasHigh Variance

High BiasLow Variance

Bias-variance tradeoff

Many training examples

Few training examples

Complexity Low BiasHigh Variance

High BiasLow Variance

Effect of Training Size

Testing

Training

Generalization Error

Number of Training Examples

Fixed prediction model

Datasets

• Circa 2001: 5 categories, 100s of images per category

• Circa 2004: 101 categories• Today: up to thousands of categories, millions of

images

Caltech 101 & 256

Griffin, Holub, Perona, 2007

Fei-Fei, Fergus, Perona, 2004

http://www.vision.caltech.edu/Image_Datasets/Caltech101/http://www.vision.caltech.edu/Image_Datasets/Caltech256/

Caltech-101: Intraclass variability

The PASCAL Visual Object Classes Challenge (2005-present)

Challenge classes:Person: person Animal: bird, cat, cow, dog, horse, sheep Vehicle: aeroplane, bicycle, boat, bus, car, motorbike, train Indoor: bottle, chair, dining table, potted plant, sofa, tv/monitor

http://pascallin.ecs.soton.ac.uk/challenges/VOC/

• Main competitions– Classification: For each of the twenty classes,

predicting presence/absence of an example of that class in the test image

– Detection: Predicting the bounding box and label of each object from the twenty target classes in the test image

• “Taster” challenges– Segmentation:

Generating pixel-wise segmentations giving the class of the object visible at each pixel, or "background" otherwise

– Person layout: Predicting the bounding box and label of each part of a person (head, hands, feet)

• “Taster” challenges– Action classification

Russell, Torralba, Murphy, Freeman, 2008

LabelMehttp://labelme.csail.mit.edu/

80 Million Tiny Imageshttp://people.csail.mit.edu/torralba/tinyimages/

ImageNethttp://www.image-net.org/

Recognition: A machine learning approach

Documents

Transcript of Recognition: A machine learning approach

Pattern recognition and machine learning

Machine learning based Indian spam recognition

MUSIC EMOTION RECOGNITION: A MULTIMODAL MACHINE LEARNING …research.sabanciuniv.edu/39455/1/10234904_CemreGokalp.pdf · MUSIC EMOTION RECOGNITION: A MULTIMODAL MACHINE LEARNING APPROACH

Machine Learning for Speech Recognition

Machine learning applied to speech recognition

Shape Recognition, the magnitude of the challenge a ... · Shape Recognition, the magnitude of the challenge a machine learning approach ... Using machine learning one can put the

Image Analysis Lecture 9.3 - Introduction to Machine Learning€¦ · Machine learning (Pattern recognition) • Recognition of individuals (instance recognition) • Discrimination

Machine Learning for Speaker Recognition

Pattern Recognition and Machine ... - thoth.inrialpes.fr

Face Recognition Committee Machine

Cellular Automata Machine For Pattern Recognition

Pattern recognition and Machine Learning.

Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.

001Pattern Recognition and Machine Learning 1

Computer Science Chapman & Hall/CRC Machine Learning ... methods - Zhou.pdf · Machine Learning & Pattern Recognition Series ... machine learning approach, ... • Introduces the

Facial Emotion Recognition Using Machine Learning

pattern recognition and machine learning.pdf

The Tsetlin Machine { A Game Theoretic Bandit Driven Approach to Optimal Pattern Recognition … · In this paper, we introduce the Tsetlin Machine | the rst learning machine that

A Machine-Learning Approach to Keypoint Detection and · PDF file · 2013-12-31detecting a sparse set of 3D mesh vertices, ... recognition · Machine learning ·LDA ·AdaBoost ...

The NIST 2014 Speaker Recognition i-Vector Machine ... · The NIST 2014 Speaker Recognition i-Vector Machine Learning Challenge ... design, and performance ... speaker recognition