Visual Search

27
Visual Search A behind-the-scenes look at image retrieval Amit Prabhudesai SAIT-India

description

A gentle introduction to some fundamentals of image retrieval

Transcript of Visual Search

Page 1: Visual Search

Visual Search

A behind-the-scenes look at image retrieval

Amit Prabhudesai SAIT-India

Page 2: Visual Search

Outline What is Visual search? Use-cases & applications Basics of a Image Retrieval system

Descriptors Similarity measures Indexing schemes

How do you measure performance?

Page 3: Visual Search

Why do we need visual search?

How do I find what I’m looking for?!

Page 4: Visual Search

What is visual search?

Text query, textual results

Text query, visual results

Visual query, visual results …a.k.a. Visual Search

Page 5: Visual Search

Visual search – use-cases Search by browsing

User begins by submitting a keyword for the object-of-interest

System returns visual results (images/videos)

User browses through them and marks interesting results and asks system to return similar content Search by browsing …

a.k.a. “show me similar content”

Page 6: Visual Search

Uses-cases of visual search Search by example

User has a specific query – e.g. she may be looking for red-cars

Uploads an example of the object-of-interest

System returns similar visual content

Search by example … a.k.a. “Show me all red-cars!”

Page 7: Visual Search

Uses-cases (contd.) … Search by drawing

“A picture speaks better than a thousand words”!!

User draws out an object/concept that she has in mind

System returns visual content similar to the object drawn

Search by drawing …

Page 8: Visual Search

Use-cases (contd.) … Search by category

User wants to retrieve all visual content in a particular category

Difficult problem! Semantic gap: gap

between the user’s understanding and the features computed by a machine

Search by category …

Page 9: Visual Search

Applications of visual search Art galleries & museum management Searching product catalogs Architectural & engineering design Geographical information systems Picture archiving Law-enforcement & criminal investigations

Page 10: Visual Search

Visual search a.k.a. Content Based Image Retrieval (CBIR)

Comparison

Query Index Query Index

Result

Image database

Block-diagram of a typical content-based image retrieval system

Page 11: Visual Search

Components of a CBIR system Image content descriptor

Compute machine-understandable attributes Similarity/distance metrics

Measure the similarity (or lack thereof) between query and database sample

Indexing schemes How do you efficiently search the image/visual content

database Relevance feedback

Use the users’ choices to improve retrieval Performance measurement

Metrics to measure effectiveness

Page 12: Visual Search

Image content descriptors Different attributes are used

Color Shape Texture Spatial layout

Page 13: Visual Search

Image content descriptors – Color Used extensively for

image retrieval Motivation: human

visual system Simple & intuitive! Not very discriminative Used as a first pass to

filter out unlikely examples

Page 14: Visual Search

Image descriptors – Color Apples are red …

… But tomatoes are too!!!

Page 15: Visual Search

Image descriptors – Color Color descriptors

Color histograms – local/global Color moments Color coherence vector Color correlogram

Page 16: Visual Search

Image descriptors - shape Segment foreground

‘objects’ Shape can be used to

describe these objects Desirable attributes

Should be invariant to translation, rotation and scaling

Page 17: Visual Search

Image descriptors – shape Classical shape representation uses

moment variants Boundary based methods

Turning function or Turning angle Geometrical attributes

Aspect ratios, (relative) dimensions

Page 18: Visual Search

Image descriptors – Texture Different scenes may

have same color! Taking a cue from the

human visual system (HVS)

Page 19: Visual Search

Image descriptors – Texture

Texture differentiates between a Lawn and a Forest

Page 20: Visual Search

Image descriptors – Texture Wavelet transform features

Multi-resolution approach to texture analysis Texture described at various scales

Gabor filter features Orientation and scale-tunable line (bar)

detector Tamura features & Wold features

Based on characteristics like coarseness, contrast, directionality, regularity (or lack thereof)

Page 21: Visual Search

Image descriptors – spatial information Sky is blue … but so is

water! What differentiates

them is spatial layout! Some common

descriptors 2D strings Spatial quad-tree Symbolic image

Page 22: Visual Search

The whole is greater than the sum of the parts!! Any one simple

descriptor cannot give results required in ‘usable’ systems!

State-of-the-art systems use combination of descriptors

Page 23: Visual Search

Similarity/distance measures Exact match cannot be found! Similarity/distance measured by

Quadratic-form distance Mahalanobis distance Minkowski-form distance Histogram intersection Kullback-Leibler divergence

Target application decides which distance measure is used

Page 24: Visual Search

Indexing scheme Features typically have high

dimensionality Efficient indexing becomes a critical

performance issue Dimensionality reduction (e.g., PCA) R-tree, linear quad-trees, K-d-B-Tree, grid files

Page 25: Visual Search

Performance Evaluation Precision

Precision is the fraction of retrieved images that are indeed relevant to the query

Recall Recall is the fraction of relevant images

returned by the query Trade-off between precision and recall

Recall tends to increase as # retrieved items increases; precision decreases

Page 26: Visual Search

What is out there? Some Web Resources

http://labs.ideeinc.com/multicolr/ http://www.gazopa.com/ http://www.bing.com/images

Page 27: Visual Search

Thank You