Top-Down Inhibition of Search Distractors in Parallel Visual Search
Visual Search
-
Upload
amit-prabhudesai -
Category
Technology
-
view
981 -
download
0
description
Transcript of Visual Search
Visual Search
A behind-the-scenes look at image retrieval
Amit Prabhudesai SAIT-India
Outline What is Visual search? Use-cases & applications Basics of a Image Retrieval system
Descriptors Similarity measures Indexing schemes
How do you measure performance?
Why do we need visual search?
How do I find what I’m looking for?!
What is visual search?
Text query, textual results
Text query, visual results
Visual query, visual results …a.k.a. Visual Search
Visual search – use-cases Search by browsing
User begins by submitting a keyword for the object-of-interest
System returns visual results (images/videos)
User browses through them and marks interesting results and asks system to return similar content Search by browsing …
a.k.a. “show me similar content”
Uses-cases of visual search Search by example
User has a specific query – e.g. she may be looking for red-cars
Uploads an example of the object-of-interest
System returns similar visual content
Search by example … a.k.a. “Show me all red-cars!”
Uses-cases (contd.) … Search by drawing
“A picture speaks better than a thousand words”!!
User draws out an object/concept that she has in mind
System returns visual content similar to the object drawn
Search by drawing …
Use-cases (contd.) … Search by category
User wants to retrieve all visual content in a particular category
Difficult problem! Semantic gap: gap
between the user’s understanding and the features computed by a machine
Search by category …
Applications of visual search Art galleries & museum management Searching product catalogs Architectural & engineering design Geographical information systems Picture archiving Law-enforcement & criminal investigations
Visual search a.k.a. Content Based Image Retrieval (CBIR)
Comparison
Query Index Query Index
Result
Image database
Block-diagram of a typical content-based image retrieval system
Components of a CBIR system Image content descriptor
Compute machine-understandable attributes Similarity/distance metrics
Measure the similarity (or lack thereof) between query and database sample
Indexing schemes How do you efficiently search the image/visual content
database Relevance feedback
Use the users’ choices to improve retrieval Performance measurement
Metrics to measure effectiveness
Image content descriptors Different attributes are used
Color Shape Texture Spatial layout
Image content descriptors – Color Used extensively for
image retrieval Motivation: human
visual system Simple & intuitive! Not very discriminative Used as a first pass to
filter out unlikely examples
Image descriptors – Color Apples are red …
… But tomatoes are too!!!
Image descriptors – Color Color descriptors
Color histograms – local/global Color moments Color coherence vector Color correlogram
Image descriptors - shape Segment foreground
‘objects’ Shape can be used to
describe these objects Desirable attributes
Should be invariant to translation, rotation and scaling
Image descriptors – shape Classical shape representation uses
moment variants Boundary based methods
Turning function or Turning angle Geometrical attributes
Aspect ratios, (relative) dimensions
Image descriptors – Texture Different scenes may
have same color! Taking a cue from the
human visual system (HVS)
Image descriptors – Texture
Texture differentiates between a Lawn and a Forest
Image descriptors – Texture Wavelet transform features
Multi-resolution approach to texture analysis Texture described at various scales
Gabor filter features Orientation and scale-tunable line (bar)
detector Tamura features & Wold features
Based on characteristics like coarseness, contrast, directionality, regularity (or lack thereof)
Image descriptors – spatial information Sky is blue … but so is
water! What differentiates
them is spatial layout! Some common
descriptors 2D strings Spatial quad-tree Symbolic image
The whole is greater than the sum of the parts!! Any one simple
descriptor cannot give results required in ‘usable’ systems!
State-of-the-art systems use combination of descriptors
Similarity/distance measures Exact match cannot be found! Similarity/distance measured by
Quadratic-form distance Mahalanobis distance Minkowski-form distance Histogram intersection Kullback-Leibler divergence
Target application decides which distance measure is used
Indexing scheme Features typically have high
dimensionality Efficient indexing becomes a critical
performance issue Dimensionality reduction (e.g., PCA) R-tree, linear quad-trees, K-d-B-Tree, grid files
Performance Evaluation Precision
Precision is the fraction of retrieved images that are indeed relevant to the query
Recall Recall is the fraction of relevant images
returned by the query Trade-off between precision and recall
Recall tends to increase as # retrieved items increases; precision decreases
What is out there? Some Web Resources
http://labs.ideeinc.com/multicolr/ http://www.gazopa.com/ http://www.bing.com/images
Thank You