Scaling Credible Content

24
Scaling CREDIBLE Content A Case Study by Joe Griffin Co-CEO, iAcquire

description

Learn how iAcquire scaled identification of credible content producers - with credibility being based on authorship proliferation.

Transcript of Scaling Credible Content

Page 1: Scaling Credible Content

Scaling CREDIBLE Content

A Case Study

by Joe Griffin

Co-CEO, iAcquire

Page 2: Scaling Credible Content

We do Scalable Content Marketing

Page 3: Scaling Credible Content

Problem: Finding CREDIBLE Content

Producers at Scale is Tough

Page 4: Scaling Credible Content

Within search results, information tied to verified online profiles will be ranked

higher than content without such verification, which will result in most users

naturally clicking on the top (verified) results. The true cost of remaining

anonymous, then, might be irrelevance.

– The New Digital Age by Eric Schmidt, Chairman of Google

Page 5: Scaling Credible Content

Okay, How Do We Find These Authors?

Page 6: Scaling Credible Content

Operation Find and Rank Authors

- Round 1 – Low Scale

Page 7: Scaling Credible Content

We Stalked Big Publishers That

Use Google Authorship

Page 8: Scaling Credible Content

Nerdy Data Crawls Millions of

Websites & You Can Search HTML

Page 9: Scaling Credible Content

Example – 55,000 Websites Found Using

the Google Maps API

Page 10: Scaling Credible Content

Blekko Published a WebGrep on

Rel=Author – Top 500 Free

Page 11: Scaling Credible Content

We Created Big Lists of Credible

Authors (like 100,000)

Page 12: Scaling Credible Content

But, We Needed to Judge Their Relative

Proliferation… Because Google Authorship is Becoming “People PageRank”

Page 13: Scaling Credible Content

Necessity is the Mother of Invention

Page 14: Scaling Credible Content

Operation Find and Rank Authors

- Round 2 – Big Scale

Page 15: Scaling Credible Content

We Tapped into Big Data

& Built an Application Big Data + Software Engineering = Scale

Page 16: Scaling Credible Content

The Common Crawl Became a Seed Source

41 Million Domains, 4 Billion Pages and 210 Terabytes

Page 17: Scaling Credible Content

GNIP Served as a Real-Time Engine

Firehoses from Twitter, Tumblr, WordPress, and more…

Page 18: Scaling Credible Content

Example – You Can Pull Up to 2-Years

of Historical Tweets Firehoses from Twitter, Tumblr, WordPress, and more…

Page 19: Scaling Credible Content

We Used Amazon Cloud Search to Create

an Index of the Domains Specifically URL’s that have authorship markup.

Page 20: Scaling Credible Content

We Used GiraPh To Rank The Authors

Facebook Uses Giraph Too

Page 21: Scaling Credible Content

The Result

Page 22: Scaling Credible Content

VoiceGraph™  and VoiceRank™  ™

Page 23: Scaling Credible Content

Follow iAcquire™  and ClearVoice™ for

Tools and Resources

Page 24: Scaling Credible Content

You Stay Classy Manhattan