Internet Resources Discovery (IRD) Meta-Search Engines (MSEs)
-
date post
22-Dec-2015 -
Category
Documents
-
view
224 -
download
1
Transcript of Internet Resources Discovery (IRD) Meta-Search Engines (MSEs)
T.Sharon-A.Frank2
Contents• Meta Search Engine (MSE)• Why use several SEs?• Highlighted MSEs• Hebrew MSEs• MSE comparison• When to use MSE – pros and cons• How to choose MSE?
T.Sharon-A.Frank3
Search Engines Generations
• 1st Generation - Basic SEs:
• 2nd Generation - Meta SEs:
• 3rd Generation - Popularity SEs:
T.Sharon-A.Frank4
2nd Generation SEs - MetaSEs
• Using several SEs in parallel.• The results are filtered, ranked and
presented to the user as a uniformed list.• The ranking is a combination of the
number of sources each page appeared in, and the ranking in each source.
T.Sharon-A.Frank5
Meta SE is a Meta-Service
• It doesn’t use an Index/database of its own.
• It uses other external search services that provide the information necessary to fulfill user queries.
T.Sharon-A.Frank6
Meta Search Engine
MetaCrawler
Yahoo Web Crawler Open Text Lycos InfoSeek Inktomi Galaxy Excite
Google · Yahoo · Jeeves Ask About · LookSmart · OvertureFindWhat
T.Sharon-A.Frank7
Premises of a Meta SE
• No single search is sufficient.
• Problem in expressing the query.
• Low quality references can be detected.
T.Sharon-A.Frank9
Overlap between Google and Yahoo
Source: Jux2 analysis of 500 top search terms, April 2004
http://www.jux2.com/stats.php
T.Sharon-A.Frank12
MSE - Motivation
1. The number and variety of SEs.
2. Each SE provides an incomplete snapshot of Web.
3. Users are forced to try and retry their queries across different SEs.
4. Each SE has its own interface.
5. Irrelevant, outdated or unavailable responses.
6. Each query is independent.
7. No individual customization.8. The result is not homogenized.
T.Sharon-A.Frank13
Problems of MSEs
• No advanced search options.
• Using the lowest common denominator.
• Sponsored results from the SEs are not highlighted.
T.Sharon-A.Frank23
Vivisimo• Vivísimo supports the most advanced features
of the major search engines.• Need to just use Vivísimo syntax, which
follows the most standard conventions. • Vivísimo translates your query into the
corresponding syntax of each underlying search engine.
• Also, Vivísimo only queries the search engines that support your chosen syntax.
T.Sharon-A.Frank32
When to use a MSE?• When single Basic-SE fails to provide good
results.
• One-stop shopping - prefer to search multiple SEs/sites at once to get blended ranked results (so as to save effort/time).
• Searching for multi-faceted topics.
• Want to get clustered results to focus search on the relevant keywords.
• Looking for current events/news.
T.Sharon-A.Frank33
MSE pros
• Useful when you want to retrieve a relatively small number of relevant results.
• An excellent choice for obscure topics.
• A good option when you are not having luck finding what you want when you search.
• Appropriate when you want to get an overall picture of what is available on the Web on your topic.
T.Sharon-A.Frank34
MSE cons
• Use is limited primarily to simple queries.
• Little or no field searching is available.
• Most services return a limited number of results that do not represent the total results from any source engine.
• Sponsored results are not highlighted (even though probably not first).
T.Sharon-A.Frank35
How to Choose your MetaSE
• Search engines used
• Operators supported
• Special features
• Speed
• Presentation
T.Sharon-A.Frank37
Practical RecommendationsUse Ixquick for fast results and maximal
syntax flexibility.Use Vivisimo/Clusty (as a start) for
Clustering and/or Hebrew.Use Dogpile to include Google, date
range, or spelling corrections.Use none for non-MSE tasks
(see MSE cons)…
T.Sharon-A.Frank38
Bibliography• http://www.cs.washington.edu/homes/etzioni/papers/m
etacrawler.pdf• http://www.cs.washington.edu/homes/etzioni/papers/ie
ee-metacrawler.pdf• http://searchenginewatch.com/links/article.php/2156241
• http://vivisimo.com/advanced?form=Advanced• http://vivisimo.com/help.html• http://searchenginewatch.com/searchday/article.php/
2226841