Statistics of Natural Image Categories - EECS at UC...

Statistics of Natural ImageCategories

Authors: Antonio Torralba and Aude Oliva

Presented by:Sebastian Scherer

Experiment

Please estimate the average depth from the camera viewpoint to all locations(pixels) in the next picture.

Next you will see a circle for 3s, then a picture for 1s.Concentrate on the circle.After that I will ask you about the average depth of the picture.

What would you estimate the mean depth of this picture was?

What is the mean depth of this picture?

Problem

We want to determine global properties about an image.

However avoid• Explicit segmentation• No object recognition

Properties that are important for later stages of image processing are

• Scale• Type

Outline

Global features: power spectra

Examples

Localized spectra

Applications of global features• Naturalness/Openness classification• Scene categorization• Object recognition• Depth estimation

Power spectra

Decompose the image using a discrete Fourier transform:

The power spectrum is then given by the amplitude and phase

Power spectrum plot

Horizontal

VerticalContour plotrepresentinga percentageof total energyof the spectrum

Computing and visualizing a spectrum in Matlab•Computing the Spectrum (Matlab):

• Ifft = abs(fftshift(fft2(I,w,h)));

•Visualization:• imshow(log(Ifft)/max(max(log(Ifft))));• colormap(cool);

•(From Jonathan Huang's slides)

Interesting fact: 1/f Spectra

Natural Image Spectra follow a power law!

As(θ) is called the Amplitude Scaling Factor

2-η(θ) is the Frequency Exponent. η clusters around 0 for natural images.

Any guesses on why this law holds?

Use the spectra of images in order to categorize a scene

Why is this a good idea?• Texture varies with scene scale:

– Atmosphere is a low pass filter?– Sky– Mountains

– vs– Leaves– Objects

• Phase varies with environment:– Man-made– Nature

Spectral signatures for different scenes

HorizonAll orientations

Buildings

Scene scales

“The point of view that any given observer adopts on a specific scene is constrained by the volume of the scene.”

What can one spectrum of an image not capture?

Generally we have a upright viewpoint.The horizon is towards the top.

Non-stationary power spectra

Decompose the image using a DFT in local regions:

The localized power spectrum is then given by the amplitude and phase for a specific region

In their case: 8x8 spatial locations

Non-stationary power spectra at different depth scales

Man-made

Natural

What can we do with the power spectra?

Replicate perception of humans along different scales• Naturalness• Openness

Semantic categorization• Determine context of scene• Apply specialized methods after context is determined

Object recognition• Determine if an object exists in the scene• Only presence no location• Likely regions of objects

Depth estimation• Estimate the mean depth of the scene• Provides cue for object recognition

Naturalness vs. Openness

Projection of images on thesecond and third principalcomponent.

Openness

Naturalness is represented bythe third principal component

The power spectrumNormalization:

Perform PCA on the normalized power spectrum to get the spectral principal components (SPC).

The first principal components of a set of images

Openness

Naturalness

Semantic categorization

Semantic categorization calculation

Object recognition

Object recognition – Algorithm

During training phase learn from a set of annotated pictures.O: object class, v_c: image statisticsBayes rule (without marginalization):

Equal prior is assumed:

Estimate with a mixture of Gaussians from

training set.

Object recognition – Results 1

Object recognition – Results 2

True positive rate True negative rate

Adding spatial information

Split the picture in four equal regions.Learn a mixture of Gaussians in order to determine the region where one is most likely to find an object. Given the spectral feature at four location what is the most likely position of a face.

Attention will be on the most likely region to find a face.

Regions of interest

90% of faces were within a region of 35% of the size of the image of the largest P(x|v_c)

Use the global features as a prior on the location of objects in a object detection and localization algorithm. Since x is dependent on many factors only learn y and s.

where π(q) are the mixing weights, W is the regression matrix, µ are mean vectors, and Σ are covariance matrices for cluster q.

Gist – Results

Depth Estimation – Feature vector

Have a feature vector v.v' consists of the downsampled energy vector

k: wavelet index, x: location, M spatial resolutionFeature vector size: M^2 K

Apply PCA to reduce the dimensionality of v' to get v.v is a L-dimensional vector obtained by projecting v' on the first L principal components.

=> v is size L.

Depth Estimation - Learning

Want to optimize this expression

D:depth, v: features, Nc: number of clusters, p(ci): cluster weight, g(v|ci): multivariate gaussian, g(D|v,ci):

Result is a mixture of linear regressions:

Depth Estimation – Global features

Depth Estimation – Localized features

Scene category from depth

Depth Estimation – Face Detection

Determine the size of an object as

Now have approximately the right scale for object detection:

Discussion

Why do power spectra work so well?

Why is there such a large distinction between man-made and human? Are there possibly more distinct classes? Rural streets?

How do humans calculate mean depth when estimating the depth for training?

Statistics of Natural Image Categories - EECS at UC...

Documents

Transcript of Statistics of Natural Image Categories - EECS at UC...

Learning to Detect Natural Image Boundaries Using Local ...efros/courses/LBMV07/presentations/0201Proba… · Boundaries Using Local Brightness, Color and Texture Cues David Martin,

CHILD ABUSE AND NEGLECT STATISTICS Report Library/S12059_Child...July 2009 — June 2010. TABLE OF CONTENTS 2 TABLE OF CONTENTS Statistics-Introduction 3 ... Child Death Categories

IBM SPSS Categories 19 - Illinoiscda.psych.uiuc.edu/statistical_learning_course/IBM SPSS Categories 19.pdfIBM® SPSS® Statistics is a comprehensive system for analyzing data. The

The crime statistics cover five broad categories of crime: Contact crimes Contact-related crimes Property-related crimes Crimes heavily dependent.

Visual Categorization with Bags of Keypointsefros/courses/LBMV07/Papers/... · 2006-01-16 · Visual Categorization with Bags of Keypoints Gabriella Csurka, Christopher R. Dance,

Object Detection Using the Statistics of Partsefros/courses/LBMV07/Papers/schneiderman-ijcv-04.pdf · Object Detection Using the Statistics of Parts 155 Figure 4.Pair-wise mutual

Classification Manual...5.5 Types of Expenditure Statistics – Regular, Exhibit, and Derived 5.6 Functional Categories for Regular Finance Expenditure Statistics 5.7 Tables Chapter

Statistics Review Levels of Measurement. Nominal scale Nominal measurement consists of assigning items to groups or categories. No quantitative information.

Statistics, vision, and the analysis of artistic stylepeople.hws.edu/graham/wires_paper_FINAL.pdfartworks with respect to general stylistic categories (e.g., broad stylistic groupings

Corporation Income 2013 Tax Returns Complete ReportThe statistics are classified primarily by major industries, sectors, return types, and specific categories. The statistics . in

Framework of Statistical Information. This is a typology of the categories or classes of statistical information. Remember the relationship between statistics.

Court statistics for€¦ · Act 2005, and probate applications. Within these categories, there are further breakdowns depending on what it is relevant to report. Court statistics

Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.

2015 Statistics - National Conference of Bar - · PDF file2015 StatiSticS This section includes data, by jurisdiction, on the following categories for 2015: • the number of persons

Undergraduate Admissions Statistics · Undergraduate admissions statistics are provided in the following categories: 1 School/college type 4 2 Region 6 3 UCAS tariff 9 4 Subject 11

Doing statistics, enacting the nation: The performative ... · ORIGINAL ARTICLE Doing statistics, enacting the nation: The performative powers of categories ... rather than a tool

Categories Background Statistics Time in Africa Return to India Ashram First movement

Ethiopiaodin.opendatawatch.com/.../ETH_profile_ODIN2015.pdfWebsite: RANK out of 125 countries SUBSCORES BY MAJOR DATA CATEGORIES COVERAGE subscore Economic Statistics 370/0 33% COVERAGE

Shape Guided Object Segmentationefros/courses/LBMV07/Papers/Borenstein06.pdfShape Guided Object Segmentation Eran Borenstein Division of Applied Mathematics Brown University eranb@dam.brown.edu

Categories Administrative Security Categories 57 Hidden Categories 2