Big Data = O’Reilly Strata Conference
February 29 2012
Bigger Metadata
Pivot/Skate, etc…
Refounded 2006
Neighborhood boundaries
Mass transit data
Refocused 2009
SaaS for mapping + on-demand data
Founded 2003
Poor man’s GIS
Panamap
Achtung!
NoSQL is no panacea
Big Data isn’t about data
Big Data isn’t new
Big Data doesn’t present a Boolean quandary
With power comes responsibility
AWS bills
Lady Gaga tweets
Innumeracy (correlation v causation)
Big v Important
Big
Heterogeneous
Raw
Distributed
Streaming/real time
Search for meaning
Time-sensitive
Philosophical
Important
Well-defined schema
High value (not free)
Test-driven
Relational
Historical
Enterprise-focused
Data Exhaust
Analytics Probes
Gov 2.0Social Media
Platforms
Commoditization of compute and storage
A Brief History of Metadata
Callimachus Library of Alexandria, Egypt
A Brief History of Metadata
“Pinakes” (lists)
Title
Category
Author
Author birthplace
Father
Word count
Callimachus
A Brief History of Metadata
A Brief History of Metadata
A Brief History of Metadata
Card catalog room,
Library of Congress c. 1920
A Brief History of Metadata
Dewey Decimal System goes electronic in 1967
Out with the Old, in with the New
Archiving card catalogs
after digitization
Why Can’t We Be Together?
Metadata Data
Exponential Growth in Data
1876
TaxonomyPinakes
300 BC
Database
1970
Catalog
1595 AD
Data
Unprecedented rate of data creation, 1995-today
Oh, How I’ve Missed You
The reunification of metadata
and the artifact
Together At Last
GIS Data is Unevolved
+ =
Enter the Data Curator
Part social scientist, part librarian,
part statistician, part RDBMS wiz
DIKW Model
Data
Fact, Signal, Symbol
Information
Structural v Functional
Symbolic v Subjective
Knowledge
Processed
Procedural
Propositional
Popularity (Google Trends)
Words to Live By
dxdt/
Top Related