Wikkey_Criteria_Based_Search_Engine
-
Upload
vignesh-santhanam -
Category
Internet
-
view
29 -
download
0
Transcript of Wikkey_Criteria_Based_Search_Engine
CRITERIA BASED
SEARCH ENGINE
Final Review
Project Guide
Mrs. Mini Prince
Batch Members
Dinesh Kumar. K (112711205013)
Sugumar. S (112711205048)
Vignesh. J (112711205053)
Vignesh. S (112711205054)
ABSTRACT
Search Engine is an entry point to the world of Internet.
Aim of this project is to provide web pages based on some
criteria.
Can save time of searching and will be more efficient.
Will filter the links based on the user information.
OBJECTIVES
To increase the efficiency in retrieving the relevant pages.
To stop giving extra preference for specific sites in the
search results.
To provide more types of links in each and every results
page.
To increase the filtering of web pages, criteria is added as
an index.
EXISTING SYSTEM
Retrieves web pages based on the search query of theuser and there is no criteria based retrieval of pages.
There is no filtering for the types of data.
Provides pages based on the Page Rank Algorithm.
Extra preferences are given for specific sites in most of thesearch engines.
PROPOSED SYSTEM
Each and every 10 search results may consist of all types ofdata such as web pages, pdf, videos, ppt, wikipedia and images.
Results get easily filtered by the user by clicking oncheckboxes for specific types of data.
The process of giving priority for specific sites is avoidedduring the search engine design.
Can change the criteria available as dropdown list in resultspage to get more accurate results.
SOFTWARE REQUIREMENT
Operating system : Windows Technology Used : HTML, PHP IDE : Dreamweaver Tools used : Xampp
HARDWARE REQUIREMENT
Processor : Intel Dual core Motherboard : Intel RAM : Min 2 GB Hard Disk : Min 500 GB
Dr(col.) Allam appa rao, Dr Amit Kumar, Monika Kumari, Pooja and Vinit Kumar Gunjan,
“Search engine optimization with Google” ,International Journal of Computer Science IssuesJanuary 2012, Vol. 9, Issue 1, No 3, P.Nos:206-214
Search engine optimization is a strategic technique to take a web
document in top search results of a search engine.
This work describes the tweaks of taking the page on top position
in Google by increasing the Page rank which may result in the
improved visibility and profitable deal for an organization.
Awodele Oludele, Kanu Richmond U, Kuyoro Shade O, and Okolie Samuel O,
“Trends in Web-Based Search Engine” ,Journal of Emerging Trends in Computing
and Information SciencesJune 2012 ,VOL. 3, No. 6, P.Nos:942-948
This paper gives an insight into the trend of web-based search
engine, diverse ways in which it works, and its future.
Disadvantage:
It has challenges in the fields of information retrieval and artificial
intelligence.
Managing huge amount of data is difficult.
Dr. S.K. Gupta, Rahul Mahajan and Mr. Rajeev Bedi,“Challenges and Design Issues in Search Engine
and Web Crawler” ,International Journal of Computational Engineering
Research (IJCER)June – 2014,Vol, 04 Issue, 6, P.Nos:42-44
Web Search engines are used to find specific information on the
World Wide Web.
Web crawling is the process used by search engines to collect
pages from the Web.
Disadvantage:
It cannot maintain the freshness of the database and the quality
of pages etc.
Karthick k and Sathiya V,“Detecting Nepotistic Links Based On Qualified
Link Analysis and Language Models” ,International Journal of Computer Trends and Technology
May/June 2013 , P.Nos:106-109
Spam is a problem in the search engines so to detect the spam
sites it uses two techniques.
In this paper, they present an efficient client spam detection
system based on a classifier that combines new link-based features
with Language Model (LM) based ones.
Andri Mirzal ,“ Design and Implementation of a Simple Web Search Engine” ,International Journal of Multimedia and Ubiquitous Engineering
January-2014,Vol. 7, No. 1, P.Nos:54-60
It is a simple web search engine for indexing and searching html
documents using python programming language.
Because python is well known for its simple syntax and strong
support for main operating systems.
Disadvantage:
Crawler has design issues.
MODULES
Module 1: Search engine design
Module 2: Information collection and crawling
Module 3: Indexing and filtering
MODULE 1: SEARCH ENGINE DESIGN
Index.html file is the homepage of the search engine.
Home page has a search box and a search button.
Search results page has a special combo box to change the
criteria and the results may vary with respect to the criteria
selected.
Also added checkboxes for the type of data such as pdf, ppt,
image and wiki.
Collected URL’s for websites, images, videos and wikipedia.
Written php code to crawl the pages from the website.
Can crawl any kind of websites from the internet.
The crawled sites are categorized by the criteria.
Each page will be indexed by the indexing pattern.
MODULE 2: INFORMATION COLLECTION & CRAWLING
MODULE 3: INDEXING & FILTERING
Two attributes are added to the database to select type ofdata and criteria.
Criteria is used as an indexing pattern.
Data retrieved as search results from the database when acriteria is selected.
When a checkbox is checked then only the type of dataselected is displayed.
REFERENCESDr(col.) Allam appa rao, Dr Amit Kumar, Monika Kumari, Pooja and Vinit Kumar Gunjan, “Search engine optimization with Google” ,International Journal of Computer Science Issues January 2012, Vol. 9, Issue 1, No 3. Andri Mirzal , “ Design and Implementation of a Simple Web Search Engine” ,International Journal of Multimedia and Ubiquitous Engineering January-2014,Vol. 7, No. 1.Awodele Oludele, Kanu Richmond U, Kuyoro Shade O and Okolie Samuel O,“Trends in Web-Based Search Engine” ,Journal of Emerging Trends in Computing and Information Sciences June 2012 ,VOL. 3, NO. 6.Dr. S.K. Gupta, Rahul Mahajan and Mr. Rajeev Bedi, “Challenges and Design Issues in Search Engine and Web Crawler” ,International Journal of Computational Engineering Research (IJCER) June – 2014,Vol, 04 Issue, 6. Karthick k and Sathiya V, “Detecting Nepotistic Links Based On Qualified Link Analysis and Language Models” , International Journal of Computer Trends and Technology May/June 2013.
www.makeuseof.com
www.sourceforge.net
www.wikipedia.org
www.wikihow.com