Wikkey_Criteria_Based_Search_Engine

28
CRITERIA BASED SEARCH ENGINE Final Review Project Guide Mrs. Mini Prince Batch Members Dinesh Kumar. K (112711205013) Sugumar. S (112711205048) Vignesh. J (112711205053) Vignesh. S (112711205054)

Transcript of Wikkey_Criteria_Based_Search_Engine

CRITERIA BASED

SEARCH ENGINE

Final Review

Project Guide

Mrs. Mini Prince

Batch Members

Dinesh Kumar. K (112711205013)

Sugumar. S (112711205048)

Vignesh. J (112711205053)

Vignesh. S (112711205054)

ABSTRACT

Search Engine is an entry point to the world of Internet.

Aim of this project is to provide web pages based on some

criteria.

Can save time of searching and will be more efficient.

Will filter the links based on the user information.

OBJECTIVES

To increase the efficiency in retrieving the relevant pages.

To stop giving extra preference for specific sites in the

search results.

To provide more types of links in each and every results

page.

To increase the filtering of web pages, criteria is added as

an index.

EXISTING SYSTEM

Retrieves web pages based on the search query of theuser and there is no criteria based retrieval of pages.

There is no filtering for the types of data.

Provides pages based on the Page Rank Algorithm.

Extra preferences are given for specific sites in most of thesearch engines.

WORKING OF SEARCH ENGINE

PROPOSED SYSTEM

Each and every 10 search results may consist of all types ofdata such as web pages, pdf, videos, ppt, wikipedia and images.

Results get easily filtered by the user by clicking oncheckboxes for specific types of data.

The process of giving priority for specific sites is avoidedduring the search engine design.

Can change the criteria available as dropdown list in resultspage to get more accurate results.

SOFTWARE REQUIREMENT

Operating system : Windows Technology Used : HTML, PHP IDE : Dreamweaver Tools used : Xampp

HARDWARE REQUIREMENT

Processor : Intel Dual core Motherboard : Intel RAM : Min 2 GB Hard Disk : Min 500 GB

Literature Survey

Dr(col.) Allam appa rao, Dr Amit Kumar, Monika Kumari, Pooja and Vinit Kumar Gunjan,

“Search engine optimization with Google” ,International Journal of Computer Science IssuesJanuary 2012, Vol. 9, Issue 1, No 3, P.Nos:206-214

Search engine optimization is a strategic technique to take a web

document in top search results of a search engine.

This work describes the tweaks of taking the page on top position

in Google by increasing the Page rank which may result in the

improved visibility and profitable deal for an organization.

Awodele Oludele, Kanu Richmond U, Kuyoro Shade O, and Okolie Samuel O,

“Trends in Web-Based Search Engine” ,Journal of Emerging Trends in Computing

and Information SciencesJune 2012 ,VOL. 3, No. 6, P.Nos:942-948

This paper gives an insight into the trend of web-based search

engine, diverse ways in which it works, and its future.

Disadvantage:

It has challenges in the fields of information retrieval and artificial

intelligence.

Managing huge amount of data is difficult.

Dr. S.K. Gupta, Rahul Mahajan and Mr. Rajeev Bedi,“Challenges and Design Issues in Search Engine

and Web Crawler” ,International Journal of Computational Engineering

Research (IJCER)June – 2014,Vol, 04 Issue, 6, P.Nos:42-44

Web Search engines are used to find specific information on the

World Wide Web.

Web crawling is the process used by search engines to collect

pages from the Web.

Disadvantage:

It cannot maintain the freshness of the database and the quality

of pages etc.

Karthick k and Sathiya V,“Detecting Nepotistic Links Based On Qualified

Link Analysis and Language Models” ,International Journal of Computer Trends and Technology

May/June 2013 , P.Nos:106-109

Spam is a problem in the search engines so to detect the spam

sites it uses two techniques.

In this paper, they present an efficient client spam detection

system based on a classifier that combines new link-based features

with Language Model (LM) based ones.

Andri Mirzal ,“ Design and Implementation of a Simple Web Search Engine” ,International Journal of Multimedia and Ubiquitous Engineering

January-2014,Vol. 7, No. 1, P.Nos:54-60

It is a simple web search engine for indexing and searching html

documents using python programming language.

Because python is well known for its simple syntax and strong

support for main operating systems.

Disadvantage:

Crawler has design issues.

ARCHITECTURE DIAGRAM

MODULES

Module 1: Search engine design

Module 2: Information collection and crawling

Module 3: Indexing and filtering

MODULE 1: SEARCH ENGINE DESIGN

Index.html file is the homepage of the search engine.

Home page has a search box and a search button.

Search results page has a special combo box to change the

criteria and the results may vary with respect to the criteria

selected.

Also added checkboxes for the type of data such as pdf, ppt,

image and wiki.

USER INTERFACE

BEFORE CRITERIA SELECTION

AFTER CRITERIA SELECTION

Collected URL’s for websites, images, videos and wikipedia.

Written php code to crawl the pages from the website.

Can crawl any kind of websites from the internet.

The crawled sites are categorized by the criteria.

Each page will be indexed by the indexing pattern.

MODULE 2: INFORMATION COLLECTION & CRAWLING

CRAWLING PROGRAM

MODULE 3: INDEXING & FILTERING

Two attributes are added to the database to select type ofdata and criteria.

Criteria is used as an indexing pattern.

Data retrieved as search results from the database when acriteria is selected.

When a checkbox is checked then only the type of dataselected is displayed.

SEARCH RESULTS PAGE CODE

DATABASE pg:1

DATABASE pg:2

DATABASE pg:3

REFERENCESDr(col.) Allam appa rao, Dr Amit Kumar, Monika Kumari, Pooja and Vinit Kumar Gunjan, “Search engine optimization with Google” ,International Journal of Computer Science Issues January 2012, Vol. 9, Issue 1, No 3. Andri Mirzal , “ Design and Implementation of a Simple Web Search Engine” ,International Journal of Multimedia and Ubiquitous Engineering January-2014,Vol. 7, No. 1.Awodele Oludele, Kanu Richmond U, Kuyoro Shade O and Okolie Samuel O,“Trends in Web-Based Search Engine” ,Journal of Emerging Trends in Computing and Information Sciences June 2012 ,VOL. 3, NO. 6.Dr. S.K. Gupta, Rahul Mahajan and Mr. Rajeev Bedi, “Challenges and Design Issues in Search Engine and Web Crawler” ,International Journal of Computational Engineering Research (IJCER) June – 2014,Vol, 04 Issue, 6. Karthick k and Sathiya V, “Detecting Nepotistic Links Based On Qualified Link Analysis and Language Models” , International Journal of Computer Trends and Technology May/June 2013.

www.makeuseof.com

www.sourceforge.net

www.wikipedia.org

www.wikihow.com

Thank You