Google

Post on 28-Jan-2015

44 views 0 download

Tags:

description

 

Transcript of Google

- Pratik Jain

Google…

What Google Search Results look like…

• URL, size, date last crawled

• Cached link

• Pages like this one

Database Google Used

Approximate # of hits

Ads selected by Google based on you search

terms

Search terms are in bold

How Google

Works

Search is Mostly Invisible

user interface

Like an iceberg,2/3 below water

search functionalitycontent

What happens when you Google?

COMPONENTS

Googlebot, Google’s

Web Crawler

Google’s Query

Processor

Google’s Indexer

Googlebot…

• An automated program that accesses a web site, goes through the site following the links,sends data back.

• It consists of many computers.

• It can request thousands of different pages simultaneously

• A web crawling robot.

• A little spider scurrying across the strands of cyberspace.

Google’s Indexer…

• It processes crawled web pages into a database and then analyses them efficiently and quickly.

• It will look at the data on each page using various indexing techniques to

• Manipulate,

• Analyse,

• Understand the meaning,

• Mark the page up in a number of ways for storage in the database.

Simple Index Diagram

Google’s Query Processor…

A Query - a request for information from a search engine.

Google uses more than 200 signals, including the PageRank™* algorithm,

To examine the entire link structure of the web and determine which pages are most important. Google puts the most relevant

and reliable results first

By combining overall importance and query specific relevance

Runs on a distributed network of 1000’s of low-cost computers Thus carrys out fast parallel processing.

 Do you know where google actually runs ? 

Parallel processing-

• Simultaneous use of more than one processor core to execute a program or multiple computational threads.

• Speeds up data processing.

Thank you