Search Engine with Intelligence - gaset-gbset.com · taking in consideration the advantages and...

50
11/17/2008 1 G.A.S.E.T G.A.S.E.T ™ ( G.E.T ( G.E.T - - Ca / Jo Ca / Jo - - Advance Search Enhancement Tool ) Advance Search Enhancement Tool ) Search Engine ... with Intelligence . Search Engine ... with Intelligence . Search Engine ... with Intelligence . Search Engine ... with Intelligence .

Transcript of Search Engine with Intelligence - gaset-gbset.com · taking in consideration the advantages and...

11/17/2008 1

G.A.S.E.TG.A.S.E.T ™ ( G.E.T ( G.E.T -- Ca / Jo Ca / Jo -- Advance Search Enhancement Tool ) Advance Search Enhancement Tool )

Search Engine ... with Intelligence .Search Engine ... with Intelligence .Search Engine ... with Intelligence .Search Engine ... with Intelligence .

11/17/2008 2

G.A.S.E.T G.A.S.E.T –– G G ™™

( G.E.T ( G.E.T -- Ca / Jo Ca / Jo -- Advance Search Enhancement Tool for Advance Search Enhancement Tool for Google™ ) ) **

* Temporary name : we are using Google™ search engine services for demonstrative and comparison purposes only .

Important Note :

We will use the name G.A.S.E.T G.A.S.E.T -- GG ™™ when we explain tech Issues , web services , functions and features of G.A.S.E.T G.A.S.E.T ™™ project which Is temporarily using GoogleGoogle™™

services – platform as a web pages source.

11/17/2008 3

COPYRIGHT NOTICE

Gharbeyah Establishment for Technology Feb 2006

Copyright of this project belongs to Gharbeyah Establishment for Technology Canada / Jordan .

Unauthorized copying , distribution or use of this report in Part or its entirety is prohibited .

11/17/2008 4

Important Legal NoticeImportant Legal Notice

� G.A.S.E.T – G ™ ( G.E.T - Ca / Jo Advance Search Enhancement Tool for Google™ ) ) is a temporary name suggested in principle by Ms Rose Hagan / the Google Inc legal – Trademark Department.

� We are using Google™ search engine platform for demonstrative , comparison and testing purposes only , we have no intention of selling G.A.S.E.T – G ™ commercially

� All Google ™ modified diagrams shown in this project will be used only as a G.A.S.E.T- G ™ capability and features demonstration tool , bending final approval by the Google ™ Inc legal department .

� GET-Jordan to the best of its knowledge is following carefully all laws and regulations stated by Google ™ Inc in regard the use of its searching services , and logos.

� Previous statement cover the all GET-Jordan commercial developments projects such as and not limited to GET-Canada / Jordan version of Google ™ GDS .

11/17/2008 5

Contact us !Contact us !

Website : Website : www.gasetwww.gaset--gbset.comgbset.com

Canada office:Canada office:

Mr. Gharbeyah Wael Mr. Gharbeyah Wael CEOCEO // waelwael@[email protected]

Mr. Gharbeyah Eyad Mr. Gharbeyah Eyad Tech VPTech VP

Ms .Farah Sawsan Ms .Farah Sawsan PR VPPR VP

Tel:Tel: 647 262 2893647 262 2893

Address:Address: 160 Cactus Ave. 160 Cactus Ave. # 26 Toronto, Ontario M2R 2V3 # 26 Toronto, Ontario M2R 2V3 Email: Email: [email protected]@yahoo.com / / gbset2003@[email protected]

Jordan office:Jordan office:

Mr. Gharbeyah Weam Mr. Gharbeyah Weam COO COO / / weam@[email protected]

Mr. Saeed Amer Mr. Saeed Amer Marketing ManagerMarketing Manager

Mr.Awad A Mr.Awad A PR ManagerPR Manager

Tel :Tel : 00962 79 673 564200962 79 673 5642 / / 00962 777 460 78200962 777 460 782

Address : Address : VELA # 28 suliman toqan st Amman JordanVELA # 28 suliman toqan st Amman Jordan

11/17/2008 6

TABLE OF CONTENTS / PagesTABLE OF CONTENTS / Pages

�� Introduction : 1 Introduction : 1 –– 88

�� The Search Engine 9 The Search Engine 9 –– 1616

�� Main Services 17 Main Services 17 –– 4040

�� The Comparison 41 The Comparison 41 -- 5151

11/17/2008 7

Part OnePart One

The Product !The Product !

11/17/2008 8

G.A.S.E.T G.A.S.E.T ™™ CapabilitiesCapabilities

Description of Main System ServicesDescription of Main System Services

We used photo slides from our web based Java version We used photo slides from our web based Java version

of the of the G.A.S.E.T- G ™ project on this presentation .project on this presentation .

11/17/2008 9

Very Important Notes .. !Very Important Notes .. !Very Important Notes .. !Very Important Notes .. !Very Important Notes .. !Very Important Notes .. !Very Important Notes .. !Very Important Notes .. !

�� We used Google services to complete the Beta code stage of our pWe used Google services to complete the Beta code stage of our product and to demonstrate our roduct and to demonstrate our technology advanced capabilities, G.A.S.E.T implementation as a technology advanced capabilities, G.A.S.E.T implementation as a desktop application ( in C # ) and desktop application ( in C # ) and web based ( in Java ) was used only to web based ( in Java ) was used only to illustrate its major functions and operations.illustrate its major functions and operations.

�� We don't think of our project as a replacement to standard keywoWe don't think of our project as a replacement to standard keyword search engines, rather an rd search engines, rather an opportunity for web surfers to try more progressive ( from a funopportunity for web surfers to try more progressive ( from a functionality prospect ) solution ctionality prospect ) solution which will satisfy their sophisticated web searching tasks .which will satisfy their sophisticated web searching tasks .

�� Project speed , search results quality and some of its main funcProject speed , search results quality and some of its main functions where downscaled due to tions where downscaled due to present present insufficient hardware capabilitiesinsufficient hardware capabilities and the currently imposed Googleand the currently imposed Google™™ ( our current ( our current source of web pages ) ranking , this eventually limited our systsource of web pages ) ranking , this eventually limited our system full system expansion.em full system expansion.**

�� 100+ PPT slides where produced as a general guidelines of our G.100+ PPT slides where produced as a general guidelines of our G.A.S.E.T main functions and A.S.E.T main functions and technologies, yet we think that it only technologies, yet we think that it only covered part of its potentialscovered part of its potentials. .

** We are certain that the uploading of our projects on more sWe are certain that the uploading of our projects on more suitable web servers ( after receiving uitable web servers ( after receiving the needed financial funds ) should eventually take care of thisthe needed financial funds ) should eventually take care of this problem. problem.

11/17/2008 10

Brief description of some of the features applied at the Brief description of some of the features applied at the

G.A.S.E.TG.A.S.E.T-- GG ™™ platform ( platform ( keyword vs. conceptual web surfingkeyword vs. conceptual web surfing ))

FromFrom

ToTo™™

11/17/2008 11

GoogleGoogle™™ Advanced Search Main FeaturesAdvanced Search Main Features

To :

11/17/2008 12

Push Advance Search Butto

n

G.A.S.E.TG.A.S.E.T-- GG ™™ Main Screen !Main Screen !

Main Functions

11/17/2008 13

G.A.S.E.T G.A.S.E.T -- GG ™™ Search Interface Main Features Search Interface Main Features

Scroll down

11/17/2008 14

G.A.S.E.T G.A.S.E.T -- GG ™™ Search Interface Main FeaturesSearch Interface Main Features-- Continue Continue --

11/17/2008 15

G.A.S.E.T G.A.S.E.T -- GG ™™ VS GoogleVS Google™™ Advance Search Advance Search ( User ( User Optional Optional Query Enhancements ) Query Enhancements ) -- 11

Google™ and G.B.S.E.T-G™ comparison :Capable of reforming Google™ Original query by applying Multiple Query Fields enhancements toGoogle™ interface for comparison purposes .

Temporary limitation of choices because ofInsufficient hardware resources ( it will be solved upon installing the needed servers ).

11/17/2008 16

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Regular SearchRegular Search( User ( User OptionalOptional Query Enhancements ) Query Enhancements ) -- 22

We added linguistically/ web generated suggestions to provides the user

with the ability to choose between the following scroll down boxes :

1- Thesaurus which will replace the user query terms with other linguistically and

contextually corresponding possibilities .

2- Related Words which will add conceptually , linguistically and Syntactically related

terms to the user query .

3-Web Suggestion Terms which will add web () keywords Suggestions linguistically

and Conceptually extracted from the G.A.S.E.T – G / Google™ snippets , title , …etc

Temporary Timer

( inadequate server )

11/17/2008 17

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Result Page Result Page -- 11( Google( Google™™ Regular Result Page )Regular Result Page )

The Google query changing suggestionswhich G.AS.E.T-G offer at the start page.

G.AS.E.T-G search link

11/17/2008 18

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Result Page Result Page -- 11( ( G.A.S.E.T G.A.S.E.T -- G G ™™ Dynamic Result Page Dynamic Result Page –– A )A )

Query enrichment“ Web Formats “

Search results formats

Sample ofSample of G.A.S.E.T G.A.S.E.T -- GG ™™ results which was selected from limited pool of web pages results which was selected from limited pool of web pages

( due to the ( due to the lacking of suitable serverslacking of suitable servers ) , it show that ) , it show that main search enginesmain search engines internal internal

functionsfunctions are operative ( are operative ( crawling , parsing , indexing , crawling , parsing , indexing , limitedlimited ranking ranking …… EtcEtc ) )

11/17/2008 19

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Result Page Result Page -- 11( ( G.A.S.E.T G.A.S.E.T -- G G ™™ Dynamic Result Page Dynamic Result Page –– B )B )

Continue with the rest of Continue with the rest of

G.A.S.E.T - G ™ ( not Google ( not Google ™™ ))

diverse search results ! diverse search results !

11/17/2008 20

Project Processes ( Main differences ) Project Processes ( Main differences ) Very important

G.A.S.E.TG.A.S.E.T--G G ™™ ( ( withwith Google Google ™™ dependency dependency ) :) :

�� Interacting with the web surfer.Interacting with the web surfer.

�� Query lingo and web Suggesting / Modification ,Query lingo and web Suggesting / Modification ,

extracted from results pageextracted from results page Google Google ™™ snippets.snippets.

�� Interacting with Interacting with Google Google ™™ ( http request / API ( http request / API

which is a slow and possibly illegal process )which is a slow and possibly illegal process )

�� Crawling extracted Crawling extracted Google Google ™™ results page URLsresults page URLs

( ( with conflicting with conflicting Google Google ™™ rank method rank method ))

�� Analyze , reAnalyze , re--rank and weight Received rank and weight Received Google Google ™™

web results web results --TVDTVD’’s ( s ( slow processslow process ))

…… etcetc

G.A.S.E.T G.A.S.E.T ™™ ( ( with its own serverswith its own servers ) :) :

�� Interacting with the web surfer.Interacting with the web surfer.

�� Query lingo and web Suggesting / Modification ,Query lingo and web Suggesting / Modification ,

generated from G.A.S.E.Tgenerated from G.A.S.E.T™™ web repository.web repository.

�� Interacting with G.A.S.E.TInteracting with G.A.S.E.T™™ server , a speedy server , a speedy

process with compatible and accurate results.process with compatible and accurate results.

�� Retrieving preRetrieving pre--analyzed and weighed TVDanalyzed and weighed TVD’’s.s.

… etc Faster , more accurate results and main functions ( Power ,QA .. etc ) compatible .

11/17/2008 21

G.A.S.E.T andG.A.S.E.T and other Searchother SearchBeasts ( Ex:Beasts ( Ex: GoogleGoogle™™ ))

Different philosophy ... Different solutions !Different philosophy ... Different solutions !Different philosophy ... Different solutions !Different philosophy ... Different solutions !

11/17/2008 22

Comparison Between Comparison Between GoogleGoogle™™ & & G.A.S.E.T G.A.S.E.T ™™ Main Main Features Features

The comparison The comparison in Featuresin Features not capacitynot capacity covers covers somesome examples only .examples only .

�� We claim that in our journey to seek the most comprehensive soluWe claim that in our journey to seek the most comprehensive solution for web surfing we analyzed tion for web surfing we analyzed hundreds of specialized web search tools , ordinary search enginhundreds of specialized web search tools , ordinary search engines tools and technically related es tools and technically related documents , and we are sure that we developed system capable to documents , and we are sure that we developed system capable to compete ( compete ( in the fields of in the fields of functionality , technical superiority and diversityfunctionality , technical superiority and diversity ) with currently available commercial products ) with currently available commercial products and academically supervised projects and academically supervised projects …… please proof us wrong by contacting us with your please proof us wrong by contacting us with your comments and suggestions .comments and suggestions .

�� We are comparing the practicality of our project functions to thWe are comparing the practicality of our project functions to the main Google e main Google ™™ operations , operations , taking in consideration the advantages and disadvantages of metataking in consideration the advantages and disadvantages of metasearching Google searching Google ™™ web web services , we think that the services , we think that the overall pictureoverall picture will be taken in consideration when we arrange for our will be taken in consideration when we arrange for our project installation on a practical hardware baseproject installation on a practical hardware base with reasonable connectivity speed .with reasonable connectivity speed .

�� We are using materials from the We are using materials from the ““GoogleGoogle™™ GuideGuide”” document published by Nancy Blachman which document published by Nancy Blachman which explains some of the benefits and pitfalls of Google explains some of the benefits and pitfalls of Google ™™ search engine, we quoted it as a mean of search engine, we quoted it as a mean of comparison ( in technology and functionality ) between our G.A.Scomparison ( in technology and functionality ) between our G.A.S.E.T .E.T ™™ system and the original system and the original GoogleGoogle™™ search engine and other plugsearch engine and other plug--ins which utilize the Googleins which utilize the Google™™ search capabilities. We search capabilities. We consider our project consider our project as a complementas a complement to current keywords based standard search engines.to current keywords based standard search engines.

11/17/2008 23

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Features Features ““ Not Capacity Not Capacity ““ ComparisonComparison

( Examples # 1 )( Examples # 1 )

• Our system has the capability to search not only in limited sites but the entire web.• Price rang function used by our system has superb technology which uses the regular expressions techniques for accurate results.• The system use similar functions found in our G.B.S.E.T B2B search project which is able to conclude financial transactions and track shipments in a secure environment.

Limited capability .

Not functional

11/17/2008 24

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Features Features ““ Not Capacity Not Capacity ““ ComparisonComparison

( Examples # 2 )( Examples # 2 )

Our previous comments regarding the Frooglelimitations apply to theGoogle™ Catalogs function.

We are currently using our own technology which willbe able to build customized directories of consumer retailers catalogs With customizable and enhanced Features

11/17/2008 25

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Features Features ““ Not Capacity Not Capacity ““ ComparisonComparison

( Examples # 3 )( Examples # 3 )

By the end of 2010 our firm willexpand the Google™ Directory by 10 X it current size , usingour sophisticated contextually and semantically motivated crawling , parsing technology .

Our conceptual Power Searchfunction is capable of retrieving very accurate results which willcomplement the traditional directory based web searching.

11/17/2008 26

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Features Features ““ Not Capacity Not Capacity ““ ComparisonComparison

( Examples # 4 )( Examples # 4 )

The complete process was fully integrated and automated by oursystem which will enable the user to concentrate more on the main Issues.

Our system will take any form of query even ina question form ( using our QA operation ) andanalyze its content – concept to get matched results , based on the conditions required bythe user query .

G.A.S.E.T (QA) ? No problem .

NLP TECH

11/17/2008 27

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Features Features ““ Not Capacity Not Capacity ““ ComparisonComparison

( Examples # 5 )( Examples # 5 )

Our system will recognize the need for the query extra words ( documentation , listing ) and it will analyze its position , sentence / context & otherrelations , results will be effected because of Its presence.

Web related word suggestions and the Power /Topic Search functions will be able to handle the general / none precious query.

G.A.S.E.TG.A.S.E.T--GG™™ (NLP)? No problem .

The Solution.

11/17/2008 28

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Features Features ““ Not Capacity Not Capacity ““ ComparisonComparison

( Examples # 6 )( Examples # 6 )

Vocabulary correction will be handled at the original query interface ( VASE ) which will save time and help retrieve more exact results.

Using the Topic / Power Search will overcome suchproblem ( that’s one of the main ideas behind ourtechnology ) , it will use a conceptually harmonizedthesaurus / related words combinations whenever its appropriate to find the coherent topic / result.

NLPTool

11/17/2008 29

G.A.S.E.T G.A.S.E.T -- GG ™™ / Google/ Google™™ Features Features ““ Not Capacity Not Capacity ““ ComparisonComparison

( Examples # 7 )( Examples # 7 )

Please check our Multimedia Search functionwith its better-quality features , all forms ofknowledge on the web are unique , yet it willalways be in need to be recognized not only by it is html format , but also buy its initiative.

Current technologies used by Google™ to search for photos are incapable of looking for photos , videos , music… etc that might be associated with the concept of the search.

11/17/2008 30

Examples of Third Party GoogleExamples of Third Party Google™™ PlugPlug--Ins Software's Ins Software's -- 11

Limited capabilities

11/17/2008 31

Examples of Third Party GoogleExamples of Third Party Google™™ PlugPlug--Ins Software's Ins Software's -- 22

Simple enhancements to the Google™ interface

Its hard to believe that the previous examples, which fairly represent the available third party Google™plug-Ins software, have such low technical capabilities and immature features.

The search engines market is in great need for innovative web surfing technologies which use some ofthe theoretically tested formulas and implement newly developed techniques … like G.A.S.E.T G.A.S.E.T -- G G ™™..

11/17/2008 32

G.A.S.E.T G.A.S.E.T ™™

Searching the web with intelligence .Searching the web with intelligence .

The Problem The Problem …… and the Solution !and the Solution !

11/17/2008 33

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search -- 11

�� Topic SearchTopic Search ::

Used to locate web pages with topics and concepts whUsed to locate web pages with topics and concepts which match the web surfer query. In our ich match the web surfer query. In our technology we use a pioneering methods to technology we use a pioneering methods to decipher web page framework and conceptdecipher web page framework and concept either either in parts or as a whole. It will also check multiple facets of thin parts or as a whole. It will also check multiple facets of the page in regard to it is relevance e page in regard to it is relevance to already stored, analyzed and confirmed to already stored, analyzed and confirmed concept containersconcept containers which our system use for which our system use for evaluation, ranking and dynamically evaluation, ranking and dynamically rere--modifyingmodifying of our web repositoriesof our web repositories ..

This means that a topic search such as This means that a topic search such as ““car manufacturingcar manufacturing”” will find pages which will find pages which conceptually conceptually matchmatch the query with the query with sentences and wordssentences and words such as, machinery, raw material, tools, sales, such as, machinery, raw material, tools, sales, testing ... Etc in testing ... Etc in a harmonized linguistic forma harmonized linguistic form. The results and pages containing them will be . The results and pages containing them will be more related to the topic more related to the topic than calculating the number of timesthan calculating the number of times the query words the query words ““carcar”” and and ““ManufacturingManufacturing”” are repeated on the web pages with disregard to its semantic reare repeated on the web pages with disregard to its semantic relation , which lation , which is a practice followed by all of the main current search engineis a practice followed by all of the main current search engines.s.

We used complicated logarithmic We used complicated logarithmic -- technology to enable our system to technology to enable our system to constantly self modified constantly self modified and configured its own rulesand configured its own rules in an attempt to keep the unique linguistic forms, dialogue typin an attempt to keep the unique linguistic forms, dialogue types es and cultural lingo differences in prospect , and cultural lingo differences in prospect , with autonomous topic structure updating.with autonomous topic structure updating.

11/17/2008 34

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search -- 22

�� Power Search :Power Search :

The main idea behind Power Search is to assist the wThe main idea behind Power Search is to assist the web surfer in anticipating eb surfer in anticipating multi level web multi level web searching taskssearching tasks. This will be done by using the web as a base for our self modi. This will be done by using the web as a base for our self modified and fied and customizable recustomizable re--learner agent. With its learner agent. With its auto query adjustments and multistage searchingauto query adjustments and multistage searching, and , and its its redefined / readjustedredefined / readjusted web repository, will lead to the adaptation of conceptual base web repository, will lead to the adaptation of conceptual base construction by reconstruction by re--implementing its newly acquired conceptual implementing its newly acquired conceptual –– contextual knowledge contextual knowledge foundation. Such technology is in place to assist in identifyingfoundation. Such technology is in place to assist in identifying the the web pages identitiesweb pages identities, , semantic and perception relations with other web pages/ sitessemantic and perception relations with other web pages/ sites in away that take into account in away that take into account interchangeable compatibility in regard to serving web surfer neinterchangeable compatibility in regard to serving web surfer need to have complex searching ed to have complex searching process, compatible with the density levels of his query. process, compatible with the density levels of his query.

Our innovation will enable our system to understandOur innovation will enable our system to understand a query, such as a query, such as ““car manufacturingcar manufacturing””, , as as a task which need to be achieveda task which need to be achieved. It will then find resources, solutions, technologies . It will then find resources, solutions, technologies …… etc etc related to the specific query, organize it in one block of info,related to the specific query, organize it in one block of info, along with other related links along with other related links disregard of its initial location on the web.disregard of its initial location on the web.

11/17/2008 35

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search -- 33

�� QA ( Question Answering ):QA ( Question Answering ):

Considered by our staff to be the project Considered by our staff to be the project ““GemGem””, which is capable of analyzing, based on a , which is capable of analyzing, based on a strong dynamic and specialized linguistic knowledge base, the usstrong dynamic and specialized linguistic knowledge base, the userer’’s complex and multis complex and multi--facets facets question then question then locating, generating and customizinglocating, generating and customizing the proper answers.the proper answers.

Such technology employs hand crafted and web extractSuch technology employs hand crafted and web extracted linguistic rules ed linguistic rules –– patterns which are patterns which are capable of virtually realizing the capable of virtually realizing the question conceptquestion concept and its contextual relations to other parts of and its contextual relations to other parts of its linguistic block. It will then its linguistic block. It will then find or customizefind or customize a sentence or a paragraph which meets the a sentence or a paragraph which meets the answer criteria in a answer criteria in a logical manorlogical manor (optional corresponding thesaurus, expected forms of (optional corresponding thesaurus, expected forms of answersanswers…… etc). It will also consider the question as a form of a requestetc). It will also consider the question as a form of a request which might be in need which might be in need for complementing addfor complementing add--ons. Such task will be done by anticipating the searcher ons. Such task will be done by anticipating the searcher question question categorycategory based on foreseeing related questions based on foreseeing related questions –– answers results compiled from our answers results compiled from our ““QAQA””dynamic and dynamic and specialized knowledge database and web repositories. specialized knowledge database and web repositories.

This is done by engaging the user in some sort of a This is done by engaging the user in some sort of a dialogue, which will be helpful in narrowing dialogue, which will be helpful in narrowing the results ( such technology will be employed upon the the results ( such technology will be employed upon the full integration of the needed web full integration of the needed web resourcesresources into our system into our system –– a huge hardware dependency task which we don't possess now).a huge hardware dependency task which we don't possess now).

11/17/2008 36

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search -- 44

�� Multimedia Search :Multimedia Search :

The future of web surfing will be determined by the The future of web surfing will be determined by the need for strong need for strong analytical application analytical application capable of tagging and groupingcapable of tagging and grouping massive amounts of multimedia resources which use the massive amounts of multimedia resources which use the internet as a storage facility, our internet as a storage facility, our rankingranking technique will be the other decisive factor, it will be technique will be the other decisive factor, it will be based on based on material conceptmaterial concept from its associated information (not only titles and other htmlfrom its associated information (not only titles and other html tags). tags). Users will also require accessing information in the form of criUsers will also require accessing information in the form of critique and opinionstique and opinions……etc, etc, available on other locations, that are conceptually available on other locations, that are conceptually related to the subjectrelated to the subject they are searching. they are searching. The market demands such service and we have created an applicatThe market demands such service and we have created an application to satisfy such needion to satisfy such need

Our Our multimulti--faceted faceted NLPNLP -- contextual parser contextual parser will set the stage for the needed item classifications will set the stage for the needed item classifications which will be used to search specialized multimedia web resourcewhich will be used to search specialized multimedia web resources such as photos, books, s such as photos, books, papers, videos, audios, software papers, videos, audios, software …… etc with an advance customized etc with an advance customized matching technologymatching technology. .

We regard the We regard the web resources as oursweb resources as ours to use with no need to categorize and restore it in to use with no need to categorize and restore it in specialized web repositories (current method like: Google videospecialized web repositories (current method like: Google video ... etc). We have succeeded in ... etc). We have succeeded in creating special techniques used to creating special techniques used to tag such multimediatag such multimedia sources, located in it is original sources, located in it is original location, to be retrieved and location, to be retrieved and filtered for any irregularities.filtered for any irregularities.

11/17/2008 37

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search -- 55

Samples of currently implemented technologies : Samples of currently implemented technologies :

�� Automating the whole searching process by applyingAutomating the whole searching process by applying the main conceptthe main concept of the search over the of the search over the web through the thorough implementation of our various complicatweb through the thorough implementation of our various complicated algorithms which will ed algorithms which will enable our user to use combined G.A.S.E.T functions at the same enable our user to use combined G.A.S.E.T functions at the same time like time like …… QA and Multimedia .QA and Multimedia .

�� Enhance our version of the indexed web repository with the needEnhance our version of the indexed web repository with the needed ed linguistically harmonized linguistically harmonized termsterms in a verities of combinations using our highly integrated experin a verities of combinations using our highly integrated expert system , then t system , then ReRe--rankrankthe results according to its web conceptual matching and linguisthe results according to its web conceptual matching and linguistic credibility using our tic credibility using our developed NLP techniques and parameters. developed NLP techniques and parameters.

�� Filtering and indexing the received web pages to Filtering and indexing the received web pages to determine its conceptdetermine its concept by analyzing the by analyzing the adjourned set of terms , page concept , unified information blocadjourned set of terms , page concept , unified information blocks ( UBI ) and implementing the ks ( UBI ) and implementing the ReRe--learning process and techniques to compare generated concepts .learning process and techniques to compare generated concepts .

�� The web surfer The web surfer behaviour behaviour –– choiceschoices will be an essential part of our ranking , results analyzing will be an essential part of our ranking , results analyzing and and logarithm autologarithm auto--modifyingmodifying , it is the soul of the web ., it is the soul of the web .

11/17/2008 38

Separate Boxes forTitle , URL …etc

Linguistically Suggested Terms

Web SuggestedTerms

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search -- 66Main Interface Diagram ( A )Main Interface Diagram ( A )

Full / Fast Search

11/17/2008 39

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search -- 7 7 Main Interface Diagram ( B )Main Interface Diagram ( B )

Web and URL links Power Searching

Search Upgrade and Customizations

G.A.S.E.T-G™ LegalDisclaimers

www.abc.com

Suggestions from Web Directories

11/17/2008 40

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search -- 8 8 GDM Concept SuggestionGDM Concept Suggestion

We Added the We Added the ““ Related Directory Maps Related Directory Maps ”” option ( similar to option ( similar to GoogleGoogle™™ Related categoriesRelated categories ), which will ), which will suggest choices derived from the suggest choices derived from the G.A.S.E.TG.A.S.E.T-- G G ™™ directory mapsdirectory maps ( ( GDMGDM ) / dynamic web bases through ) / dynamic web bases through XML/SOAP services , the tool user could either use some of the wXML/SOAP services , the tool user could either use some of the words shown as an enhancement for the ords shown as an enhancement for the original query or he/she could link to it directly , obviously original query or he/she could link to it directly , obviously we did apply our concept / context relation we did apply our concept / context relation acquiring technology to give better matching results.acquiring technology to give better matching results.

G.A.S.E.T G.A.S.E.T -- G G ™™Web Repository

Ontological / Lingo

Parsing Technique Context / ConceptParsing Technique

11/17/2008 41

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search -- 9 9

Replacement Replacement -- Enhancement of GoogleEnhancement of Google™™ ““ Similar Page Similar Page ““

www . abs . com

�� Radical improvements to the Radical improvements to the ““Similar PageSimilar Page”” Function which uses the Function which uses the ““Multiple ConceptsMultiple Concepts””

tool to search for pages which harmonize with the targeted one ttool to search for pages which harmonize with the targeted one taking in consideration aking in consideration the page the page

structure and linguistic scopestructure and linguistic scope . Advance filtering system will guarantee the best conceptual . Advance filtering system will guarantee the best conceptual

matching of resulted pages ( same function is available on the matching of resulted pages ( same function is available on the ““ find results find results ““ page ) .page ) .

11/17/2008 42

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search –– 1010

Replacement Replacement -- EnhancementEnhancement of Googleof Google™™ ““ SearchSearch withinwithin Results Results ””

We have succeeded in improving the GoogleWe have succeeded in improving the Google™™ ““SearchSearch withinwithin resultsresults”” task which was initially task which was initially

done done by adding extra query terms chosenby adding extra query terms chosen by the user by the user to the original search query ?? !!!,to the original search query ?? !!!, ( with ( with

GoogleGoogle™™ limitation of maximum 10 query wordslimitation of maximum 10 query words ) , our approach is by using our ) , our approach is by using our prepre--analyzed analyzed

web pagesweb pages and its and its concept barrelsconcept barrels as a comparison base for the new search ( similar to the as a comparison base for the new search ( similar to the

Power Search & Similar pages functions ) , we recognize that Power Search & Similar pages functions ) , we recognize that ““ Search within resultsSearch within results”” imply imply

that the user think that this page is an excellent candidate that the user think that this page is an excellent candidate ( based on the results page info )( based on the results page info )

for new for new extensive searchextensive search (Something Google(Something Google™™ was not successful in achieving ). was not successful in achieving ).

Jordan river

11/17/2008 43

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search –– 1111

G.A.S.E.T G.A.S.E.T ™™ Replacement Replacement -- EnhancementsEnhancements ofof GoogleGoogle™™ ““ Search Results Page Search Results Page ”” -- AA

�� Improvements to regular search engines results page were introdImprovements to regular search engines results page were introduced in order to give the uced in order to give the user an option to chose between receiving our version the of theuser an option to chose between receiving our version the of the results page results page snippetssnippets or :or :

1.1. Related page Related page KeywordsKeywords list ( web page most important related words ). list ( web page most important related words ).

2.2. Complete related Complete related ParagraphsParagraphs (web page paragraph/s matching the query concept ).(web page paragraph/s matching the query concept ).

3.3. DownloadsDownloads ( list the web page downloadable parts / used with multimedia sp( list the web page downloadable parts / used with multimedia specialized search ).ecialized search ).

�� We used our friendly We used our friendly ““scroll down windowscroll down window”” which will keep the search results page which will keep the search results page proportionate while giving searchers the ability to viewproportionate while giving searchers the ability to view page conceptpage concept without the need to open without the need to open the actual web pages.the actual web pages.

�� In the near future our UBO technique In the near future our UBO technique ““ User Behaviour User Behaviour -- results analysing results analysing --Observation Observation ““ will will be fully implemented , this will save more time by classifying wbe fully implemented , this will save more time by classifying web surfer results page analyzing eb surfer results page analyzing patterns and his/her subjects of interests . patterns and his/her subjects of interests .

11/17/2008 44

G.A.S.E.T - G ™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search –– 1212

G.A.S.E.T G.A.S.E.T ™™ Replacement Replacement -- EnhancementsEnhancements ofof GoogleGoogle™™ ““ Search Results Page Search Results Page ”” -- BB

G.A.S.E.T – G ™™ Search Results Toolbar

Google™ Search

Four Types of Page Results Summary :Snippet , Paragraph , Keywords and Downloads

( with scroll down option to showMore info without leaving the page )

Similar pages access

Important : This is the results page ( with its own ranking ) from our Java web based version of G.A.S.E.T - G ™ project

11/17/2008 45

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search –– 1313

G.A.S.E.T G.A.S.E.T ™™ Replacement Replacement -- EnhancementsEnhancements ofof GoogleGoogle™™ ““ Search Results Page Search Results Page ”” -- CC

Google™

Results Page

G.A.S.E.T-G™Search Toolbar

11/17/2008 46

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search –– 1414

G.A.S.E.T G.A.S.E.T ™™ Replacement Replacement -- Enhancement of GoogleEnhancement of Google™™ ““ local search local search ””

�� Our system has resolved the location search ( searching in a desOur system has resolved the location search ( searching in a designated geographical areas ) ignated geographical areas )

by extracting the needed site location information from the web by extracting the needed site location information from the web site, using our own site, using our own customized customized

Expert System / Regular Expression enabled analyzing system Expert System / Regular Expression enabled analyzing system , it is a known fact that Google, it is a known fact that Google™™

approach to search in countries domains are short of getting to approach to search in countries domains are short of getting to target simply because most target simply because most

web site owners use the common domains like web site owners use the common domains like .com.com , disregard of their actual , disregard of their actual geographical geographical

locationlocation ..

Page Contact Info

Geographical / location Information Analyzed by Specialized Technology !

Page IP

11/17/2008 47

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search –– 1515

G.A.S.E.T G.A.S.E.T -- G G ™™ Specialized Multimedia Search Tools Specialized Multimedia Search Tools

ConceptWWW Exact

• Regular web searching : general results with limited classifications / parsing

capabilities .

• Conceptual parsing : examine incoming documents stream to determine the

incremental relevance of the page to the query concept.

• Adaptive – refined filters ( classifiers ) : rigorous checking for the availability of

the uniqueness characteristics / format of the service ( video , audio ..etc ) .

11/17/2008 48

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search –– 1616

G.A.S.E.T G.A.S.E.T -- G G ™™ QAQA Tool Tool

ConceptWWW

• Parse the web linguistically : analyze its contents to extract possible formats of questions ( with enhanced framework ) and their potential matching answers .

• QA Conceptual parsing : examine incoming documents stream to determine theincremental relevance of the page lingo blocks to possible question concept.

• QA Adaptive – refined filters ( classifiers ) : rigorous checking for the uniqueness characteristics / format of the question type answer context and complexity .

Raw Data

Tagged Data

Exact and PerceptionalMatching Answers .

11/17/2008 49

G.A.S.E.T G.A.S.E.T -- GG ™™ TopicTopic , , PowerPower , , QA QA andand MultimediaMultimedia Search Search –– 1717

G.A.S.E.T G.A.S.E.T -- G G ™™ Specialized Search Tools Specialized Search Tools ( ( Partial ListPartial List ))

• We added an improvement to Google’s “Book Search” which search The web for downloadable web booksThe web for downloadable web books( confirmed 200,000 + ) , we also added the “ Dissertations / Thesis or Papers” function as an upgrade to the “ Google™ Scholar ” which helps the user identify the matching paper structure ( Title , IntroductionContents , Chapter … etc ) and the searcher query terms concept .

• We also included special Video Format search As an example of the choices the user will have on the results page , he/she will have the option to use other interactive options ( QA , Power .. Etc ) instantly .

Asia tourism

Asia culture

11/17/2008 50

Thank you for your time !Thank you for your time !Thank you for your time !Thank you for your time !

We appreciate any queries or comments !

For more details please contact :

Weam Gharbeyah , [email protected] / [email protected]