Data Mashups -Data Science Summit

14
Data Mashups Turning Data Exhaust into Insights May 12, 2011 Data Scientist Summit Pete Skomoroch LinkedIn @peteskomoroch

description

As large datasets come together exciting and unexpected things can happen. Human behavior is high dimensional, so combining many diverse datasets is critical to revealing actionable insights.

Transcript of Data Mashups -Data Science Summit

Page 1: Data Mashups -Data Science Summit

Data Mashups

Turning Data Exhaust into Insights

May 12, 2011Data Scientist SummitPete SkomorochLinkedIn@peteskomoroch

Page 2: Data Mashups -Data Science Summit

We have an explosion of data

•DataWrangling

• InfoChimps

•Data.gov

• Factual

• SimpleGeo

Page 3: Data Mashups -Data Science Summit

And the tools to make sense of it

•Hadoop

•NoSQL

•R

•Python

•Mechanical Turk

Page 4: Data Mashups -Data Science Summit

Diverse datasets = better signal

Page 5: Data Mashups -Data Science Summit
Page 6: Data Mashups -Data Science Summit
Page 7: Data Mashups -Data Science Summit

Find a meaningful problem

http://www.flickr.com/photos/aloshbennett/

• Identify pain points

•Work on stuff that matters

• Focus on underutilized data

Page 8: Data Mashups -Data Science Summit

Trendingtopics.org @hourlytrends

Page 9: Data Mashups -Data Science Summit

LinkedIn Skills

Page 10: Data Mashups -Data Science Summit

The best mashups are actionable

•Reveal patterns

•Enable predictions

•Recommendations

Page 11: Data Mashups -Data Science Summit

Mashup: Skills & Cities

Page 12: Data Mashups -Data Science Summit

Yuba City, California: 21.3% Unemployment

Page 13: Data Mashups -Data Science Summit

Ames, Iowa: 4.7% Unemployment

Page 14: Data Mashups -Data Science Summit

Make data mashups work for you

•Open Data = powerful mashups

•Mashup > sum of its parts

• Focus on meaningful problems

•Actionable mashups are better