The Future of Internet Search

79
The Future Of Internet Search Chris Sherman (@cjsherman) Founding Editor

Transcript of The Future of Internet Search

Web Search Update 2007

The Future Of Internet SearchChris Sherman (@cjsherman)Founding Editor

1

What Your Mom Thinks You Do

2

What You Really Do

3

To StartLets look back at search history, and see where the strikes came from4

Search In 19985

Googles Original Name

6

PerspectiveWeve entered the age of big dataThis is having an impact both on how we search, and also, perhaps more importantly, what we end up seeing as resultsNet, a good thing, but ironically may now subject us to information underload7

Its More Than Rocket Science NowIt takes about the same amount of computing to answer one Google search query as all the computing done - in flight and on the ground for the entire Apollo space programNot just the actual flights, but all the computing done throughout the planning and execution of the 11-year, 17 mission Apollo program- Udi Manber and Peter Norvig8

Google Data Centers: Then

Google Data Centers: Now

10

The Big TrendsGoogle continues to improveBing continues to gain (slight) share, but is still far from a threat to Big GSocial and big data are now paramount in searchThus, beware the filter bubbleAnd, maybe, a promising new beginning for Yahoo

11

No QuestionGoogle is getting smarter But has it lost its mind?Google+ integrated into everythingGoogle- : more than 75 product eliminations over the past 2 yearsNet: CEO Larry Page is reshaping Google from something we know and love -into something we cant know and may not love

And Search Is ChangingSearch isnt just about finding documents any moreSearch has evolved into an interactive process that helps:Answer a questionSolve a problemAccomplish a task13

The Major ChangesSmall, but helpfulChanges to search result pagesAuthor profilesHuge, and maybe (or maybe not) helpfulSocial now woven deeply into all things GoogleA knowledge graph that seeks to make Google truly intelligent

14

How Search WorksCrawling the webCreating an indexApplying algorithms to organize indexed contentUsing signals to surface the most probable results for your query15

Ranking SignalsLinks from other, authoritative sourcesSearch Engine Optimization (SEO)Your web history & locationMatching keywords in the title, URL & body of a web pageEmphasis: Lists, italics or boldface, headers, etc.400+ more16

Search Engines Are Pruning ResultsThe big data problemBased on game theory (think chess), pruning seeks to eliminate rather than expand choicesWhile this can lead to really good results (depending on the query), it also necessarily excludes millions (or even billions) of potential results17

For Perspective

18

Some ConsiderationsIf the web is made up of more than 60 trillion pages, that means that 99.999999% is effectively invisible, even if included in Googles indexGoogle cannot crawl and index some things that are accessible online:Some Facebook/Twitter dataMost database powered web sites19

Social: The ImplicationsGoogle is putting emphasis on trust and identityThis is a new type of ranking signal, and one that will grow in strength over timeIts already having a major impact on both search results and ads - Up to 20% of results for some queries may be social results20

PersonalizationGoogle now personalizes all search results, whether youre logged in or notIn addition, at any given moment there are 50-200 different versions of the core algorithm out in the wildConclusion: There are no standard Google search results any more21

Conflicting Signals?Google+ and personalization are creating filter bubbles for all of usThis is changing/confounding the whole notion of authorityAt the same time, Google is aggressively developing a knowledge graph which seeks to have definitive information for everything 22

The Filter BubbleCoined by Eli Pariser, and a worthy read:

23

What Is The Filter Bubble?An algorithmselectively guesses what information a user would like to see, effectively separating them from information that disagrees with his or her viewpointsExamples?Google's personalized search resultsFacebook's personalized news feed24

How To Pop Googles BubbleLog out of all instances of Google accounts (including Gmail, Google+, etc.)Deliberately use different terms than those suggested by Google as you typeUse a different search engine25

What Is The Knowledge Graph?A seriously cool approach to taking search to the next levelEssentially, its a map that does two fundamental things:It attempts to disambiguate multiple meanings of wordsIt also connects the relationships between entities 26

Disambiguation27

Connecting Entities28

The Knowledge Graph Is True AIGoogle has taught its systems about things, in multiple waysWeb search taught Google about real world relationshipsBook search taught Google about language and how its usedThe knowledge graph is correlating these and other learnings29

Knowledge Graph Results

30

Surfacing Related Links

31

Carousel Results

32

The Knowledge Graph Vs. AdsAmazingly, knowledge graph results in some cases displace ads on the right side of the screenNonetheless, Googles ad revenue has actually increased since its introductionDarwinian conclusion: More relevant web results make surviving ads more compelling and/or useful33

Important: Author ProfilesAs part of its identity push, Google now identifies authors in resultsDone primarily via HTML markup, but also associations with Google Profile pages and other trustworthy sites on the web (Major Media sites, Wikipedia, LinkedIn, etc)Benefit: Increased visibility of authoritative authors in search results

34

Another Key ShiftRelevance used to be determined by web authority essentially authoritative web pages from trusted publishersToday, social proof is equally if not more important35

What Is Social Proof?The original version: Click popularity (observing what pages most searchers click on and stay on)Ver. 2.0: PageRank - relying on links to other pages, e.g. publisher votes (in the offline world known as citation analysis)Today: Likes, +1s, tweet mentions, other social recommendations

36

The Social Proof ConundrumWhos the most trusted authority?Britannica or Wikipedia?Your parents or your friends?Your friends or 500 5-star ratings on Yelp?500 5-star ratings on Yelp or one Michelin review?Your teachers or op-ed writers of major newspapers?

37

The Problem With Social ProofWho delivered the Gettysburg Address?

Abraham LincolnPeter NorvigEdward Everett

Google: Gettysburg Address

39

Who Was Edward Everett?The specially invited speaker who delivered THE Gettysburg orationFormer Governor of Massachusetts, U.S. Secretary of State, Harvard President, Minister to Great BritainAbraham Lincoln was a last-minute addition to the ceremony, who spoke for just two minutes after Everetts two hour official oration40

The New Query RealityWith all of the changes to Google, traditional keyword search isnt the most effective approach any moreLonger queries are betterHow long? Full sentences or even paragraphs or full pages of textTip: Voice search is really good with longer queries, & its easy to try many variations41

Boolean Is DeadIts NOT whether you use Boolean operators OR alternate keywords AND hope that Google overrides ~500 other signalsIts how you phrase your question, what youve searched for in the past, and what other people have found useful.42

Advanced Search Isnt DeadAdvanced search is still really useful; accessible three ways:For each type of search result, contextual search tools availableFrom settings button on google.com, select advanced searchUse search operators in your queryWait isnt Advanced Search just Boolean? Sorta/not really

43

About Those Queries20% of queries are unique and new to Google every day 450 billion uniques since 2003!Every query has to travel on average 1,500 miles to get results back to a user44

The Next Big Trend: Predictive SearchIt started with query autofillThen it was Google instant for search resultsNext: Always on search agents that observe your behavior and location, know your history and have access to key assets like your calendar, contacts, social connections and more

Key Question, For YouThe Professional Searcher:Do you want search to be descriptive or predictive?Traditional search focused on precision vs. recallBy definition, and in the way search engines evolved, search results were descriptive

Predictive SearchNew (predictive) search uses machine learning, constant scanning of news, social media and the web, combined with our search history & behavior to intuit whats new and perhaps more interesting than traditional searchGoogle NowBing Satori47

Google NowFor now, very mobile centricAssumes you have your connected mobile device (notice, I didnt say smartphone) with you at all timesHelps with work tasks but skewed toward tasks where you might use your phone to engage with a mobile ad

Bing Predictive SearchRemember this guy?

Hes Baaaaacckkk

Bing Code Name: SatoriUsing Bing as a knowledge base, monitors your activities on apps, and intervenes when it has suggestionsWriting a paper on GalileoChatting with a friend about which movie to seeLike Siri or Google Now, but less intrusive and smarter thanks to content partnerships

Satori PotentialSatori has an intriguing inherent capability: it can search a real time graph of more than 1BN nodes instantlyTranslation: Possible real-time Facebook/Twitter search

Hope For Yahoo Search?

Search is far from over. Its physics in the 1600s or biology in the 1800s. Theres miles to go before you get to quantum physics or even a microscope.

- Marissa Mayer, August 1, 2013

Shifting GearsHeres some stuff to help you search more effectively & creatively on a daily basis53

A People Search Strategy, Part 1Write a mini-biography - schools attended, professional history, hobbies, etc.In search settings select never show instant results and select 100 results per pageNow do a simple search for the persons name54

A People Search Strategy, Part 2Cut & paste results into wordcounter.comMake a list of all non-relevant frequent termsIn Advanced Search:Put must-have terms in exact phrase and at least one of the words boxesPut must-not-have terms in without the words box55

Bill Who?

56

Setting Up The Search

57

Two Simple People Search TacticsUse fill in the blank querieswork for company nameemployed bydegree in/fromexpert inUse the wildcard operator"a * saved is a * earned"

58

People Search & PatentsPatents can provide some really useful search avenues to exploreTry Googles patent search, and use some of the advanced optionsExample:ininventor:"Sebastian Thrun

Patent Search Results

60

Some Additional Useful SitesGoogle is like a crack dealer, one frustrated startup founder told me recently. They give you something that gets you hooked, but you end up strung out. Youre so dependent on somebody that you cant do anything about it.GigaOM, March 16, 201261

Awful Name, Great Results: BlekkoA new search engine, built from scratch over the past six yearsFrom the gang who built Newhoo (the Open Directory Project)A bunch of new, original features you wont find elsewhereEmphasis on openness, community, privacy and customization

62

Blekko SlashtagsSlashtags are filters that customize standard web resultsBlekko has hundreds of pre-built slashtagsYou can also create your own, or use the slashtags created by other users63

Blekko Slashtags

64

Another Fowl NameDuck-Duck-Go. Yes, thats its real name.Lots of useful tools for direct resultsZero click results similar to Google one box but sometimes more usefulDirect search/integration with unique content sources65

Microsoft Academic SearchAn index of millions of academic publicationsAlso displays the key relationships between and among subjects, content, and authors, highlighting the critical links that help define scientific research66

Microsoft Academic Search

67

Wolfram AlphaA computational knowledge engineGreat for authoritative facts that can be computedA little known side benefit: Your personal Facebook report68

Facebook Report

69

QuoraA continually improving collection of questions and answers created, edited, and organized by everyone who uses itSo far, pretty high-quality stuffComing soon: Ask to answer suggestions e.g. recommendations of topical expert Quorans70

YouTube: An Increasingly Rich Resource100+ hours of video are uploaded to YouTube every minuteYouTube is localized in 61 countries and across 61 languagesNot just silly cat videos companies, universities and other important organizations (e.g. governments) now recognize the value of video71

SlideshareBecoming the defacto YouTube for Powerpoint presentationsA great resource for investigating a persons communication skillsA great serendipitous way to find people with similar skills/interests72

A Word About PrivacyGoogle has come under fire regarding its privacy policies, especially in some European countriesAlso from advocacy groups (coincidentally partially funded by Microsoft) here in the US73

Dont Be Evil?

74

Mr. Schmidt Goes To Washington

75

Should You Trust Google?Google was forced to provide some information to the NSAIt gave far less than othersIt also has taken steps to make it harder for others to snoop.However Google also collects (and uses) a ton of personal data every time you use it76

The Bottom Line?Google works very hard to protect privacyIt provides free services in exchange for capturing personal informationIf you feel the usefulness is worth the price, theres little harm in trusting them with your personal details77

Confucius SaysThe hardest thing to find is a black cat in a dark room especially if there is no cat.78

Wait For It!

79