Five Things You Didn't Know DataSift Can Do

29
Brad Hubbard Product Manager, Developer Relations DataSift ive Things You Didn’t Know ataSift Can Do #DSwebinar

Transcript of Five Things You Didn't Know DataSift Can Do

Page 1: Five Things You Didn't Know DataSift Can Do

Brad HubbardProduct Manager, Developer Relations DataSift

Five Things You Didn’t Know DataSift Can Do

#DSwebinar

Page 2: Five Things You Didn't Know DataSift Can Do

HUMAN DATA INTELLIGENCE

FILTER TAG • ENRICH

STORE

Stream products will be covered todayTo see PYLON (our aggregated, anonymized Facebook topic data), join our next live demo:

http://lp.datasift.com/20150701-Live-SE-Demo-Registration

DataSift is of Two Minds: Indexed Data & Streaming

#DSwebinar

VEDO

Page 3: Five Things You Didn't Know DataSift Can Do

2011 1K 4

Launched

• San Francisco• New York• London• Reading, UK

Customers across 40 countries

2B

Items processed

per day

(These don’t count toward the 5 things)

Global offices:

#DSwebinar

Page 4: Five Things You Didn't Know DataSift Can Do

Brave New Data World

of all digital data created by consumers

emails a day

of US adults’ location is known

increase in global data by 2020

ThoughtsEm

otio

ns

LIKES

Dis

likes

Intentions IdeasCurrent Events

GEOOccupationAge

Top

icsGenderIdeas

Gender

Occupation

Intentions

Age

Th

ou

gh

tsG

EO

Dislikes

Age

Ideas

ThoughtsAge

Intentions

Current Events

Current Events

Emotions

GEO

IdeasGEO

#DSwebinar

Page 5: Five Things You Didn't Know DataSift Can Do

Sources of Human-Generated Data

BLOGS & NEWS INSIDE YOUR BUSINESS

SOCIAL NETWORKS

#DSwebinar

Page 6: Five Things You Didn't Know DataSift Can Do

The Complexity of Human Data

VOLUMEVARIET

YVELOCITY

Billions of users

Noisy

Generated in real time

per second

Post vs blog vs like

Terabytes per day

Ambiguous

Big spikesUnstructured

#DSwebinar

Page 7: Five Things You Didn't Know DataSift Can Do

Turn Human Data into Meaning

#DSwebinar

Page 8: Five Things You Didn't Know DataSift Can Do

Unify Human Data

#DSwebinar

Page 9: Five Things You Didn't Know DataSift Can Do

9

We apply structure to the chaotic world of human data

#DSwebinar

Page 10: Five Things You Didn't Know DataSift Can Do

Facebook

Tencent Weibo

Sina Weibo

Google+

YouTubeInstagram

LexisNexis

Wikipedia

Wordpress

TumblrIntense Debate

DisqusNewsCred

Reddit

TopixJiveTwitter

EDGAR NewsVideoIMDBYammer

Unifying data from across the web

#DSwebinar

Page 11: Five Things You Didn't Know DataSift Can Do

Filtering Human Data with CSDL

#DSwebinar

Page 12: Five Things You Didn't Know DataSift Can Do

Filter: CSDL Data Processing Language

Page 13: Five Things You Didn't Know DataSift Can Do

WRITE ONCE • USE MANYFilters against generic objects or get source-specific

#DSwebinar

Page 14: Five Things You Didn't Know DataSift Can Do

Rules can contain millions of tag and filter criteria, no need to limit yourself

INFINITE COMPLEXITY

#DSwebinar

Page 15: Five Things You Didn't Know DataSift Can Do

Enrich Human Data

#DSwebinar

Page 16: Five Things You Didn't Know DataSift Can Do

Identifies links in social posts and fetches header

dataAllowing you to filter against link content

LINKS AUGMENTATION

#DSwebinar

Page 17: Five Things You Didn't Know DataSift Can Do

LANGUAGE DETECTIONWrite filters on a per-language basis, or limit

yourself to only certain languages

#DSwebinar

Page 18: Five Things You Didn't Know DataSift Can Do

Location either disclosed by user or listed in profile

GENDER DETECTION USING PROFILES AND NAME + LANGUAGE

#DSwebinar

Page 19: Five Things You Didn't Know DataSift Can Do

SENTIMENT AND TOPICS Likely positive • Neutral • Likely Negative

Topic detection (looking for nouns and disambiguating them)

#DSwebinar

Page 20: Five Things You Didn't Know DataSift Can Do

Categorization, Scoring and Tagging

#DSwebinar

Page 21: Five Things You Didn't Know DataSift Can Do

VEDO enables automatic

classification of Human Data

based on it’s meaning

Apply Data Science

#DSwebinar

Page 22: Five Things You Didn't Know DataSift Can Do

OFF THE SHELF CLASSIFIERSEnable automatic scoring and classification

#DSwebinar

Page 23: Five Things You Didn't Know DataSift Can Do

CUSTOM TAXONOMIESHierarchal rules to mach your business

#DSwebinar

Page 24: Five Things You Didn't Know DataSift Can Do

CUSTOM SCORING SYTEMTo expose meaning hidden deep within

unstructured, text-rich data

#DSwebinar

Page 25: Five Things You Didn't Know DataSift Can Do

DeliveryUse Everywhere

#DSwebinar

Page 26: Five Things You Didn't Know DataSift Can Do

CONSUME A JSON STREAM DIRECTLY

#DSwebinar

Page 27: Five Things You Didn't Know DataSift Can Do

Send your data to any of these pre-built connectors

#DSwebinar

Page 28: Five Things You Didn't Know DataSift Can Do

We handle the infrastructure and send you the data you need

#DSwebinar

Page 29: Five Things You Didn't Know DataSift Can Do

THANK YOU

#DSwebinar