Open Relevance Project
description
Transcript of Open Relevance Project
Lucid Imagination, Inc.
Open Relevance Project
In the beginning…
Lucene Connector Framework
SF Lucene Meetup
Lucid Imagination, Inc.
Agenda
Announcements and Introductions
Solr 1.4 and Data Discovery – Hoss
Tips and Tricks with Analyzers – Tom Hill
Optimizing Solr for performance and User Experience – Brian Doll
Spatial Search with Lucene and Solr - Grant
Lucid Imagination, Inc.
Introductions and Announcements
Intro
Haiti Fundraiser/Rafflehttp://forms.lucidimagination.com/go/lucidimagination/donate2haiti
Lucene Connector Framework in Incubatorhttp://incubator.apache.org/connectors
Spatial Search
Lucid Imagination, Inc.
Introduction
What is it?
Use Cases
Background Knowledge
Implementing
Status
Resources
Lucid Imagination, Inc.
What is It?
Spatial search is the merging of unstructured content with spatial information to produce search results specific to one or more locations
In other words“Find me all the Starbucks selling double espressos within 5 miles of my current location”
Lucene/Solr solve specific spatial problems, but doesn’t provide a general purpose GIS solution
Lucid Imagination, Inc.
Use Cases
Asset Management“Dude, where’s my stuff?”
Social NetworkingFind all friends near me
Targeted, local search results and ads“restaurants in Austin Texas”
“Starbucks, 55313”
Business IntelligenceRestrict doc set for analysis by location
Lucid Imagination, Inc.
Spatial Search Concepts
Spatial Data TypesPoints (latitude/longitude)
Lines
Shapes
Maps and overlaysStreets, POI
Integration with unstructured textMetadata, descriptions, user reviews, etc.
http://www.openstreetmap.org/?lat=44.9744&lon=-93.2484&zoom=14&layers=B000FTFT
Lucid Imagination, Inc.
Implementation Needs
Indexing support for data types
Query Parsing
Efficient distance calculationsEuclidean, Great Circle (Haversine), Vincenty’s
FilteringBounding Box
Sort by Distance
Relevance Enhancement
Faceting
Advanced: shape intersections, routes
Lucid Imagination, Inc.
Things you can do now in Lucene/Solr
Distance FunctionsSolr has: haversine (great circle), Euclidean, Manahattan (others)
FilterFrange, Cartesian Tier/Tile coming soon, Lat/Long Range
Boost Scorehttp://localhost:8983/solr/select/?q=_val_:"recip(dist(2, store, point(34.0232,-81.0664)),1,1,0)"&fl=*,score
SortSOLR-1297 added support for sorting by Function (any function)
Contrib/Spatial has similar support
Lucid Imagination, Inc.
What’s next/needed
Faceting
“Generic” Filtering support
Function Query results as fields in output
Query Parsing SupportGeocoding, etc.
Shapes, lines, intersections?
Lucid Imagination, Inc.
Resources
https://issues.apache.org/jira/browse/SOLR-773Several issues in Lucene for improvements
http://wiki.apache.org/solr/SpatialSearch
http://www.nsshutdown.com/projects/lucene/whitepaper/locallucene_v2.html
http://www.ibm.com/developerworks/opensource/library/j-spatial/index.html
http://www.lucidimagination.com/search/?q=spatial