Data All the Way Down

20
Data All the Way Down Jeni Tennison @JeniT http://www.jenitennison.com/blog/

description

Presentation at OKCon 2011 on how to build web applications that provide complex data using a layered architecture.

Transcript of Data All the Way Down

Page 1: Data All the Way Down

Data All the Way DownJeni Tennison@JeniThttp://www.jenitennison.com/blog/

Page 2: Data All the Way Down

Data All the Way Down

• challenges of complex open data

• layered approach to data publishing

• essential steps

• benefits

Page 3: Data All the Way Down

Complex Datasets

• too much for a single spreadsheet

• need to navigate• browse through data

• look at slices of larger dataset

• view summary statistics

• need to explain• definitions of terms, provisos & disclaimers

Page 4: Data All the Way Down

User Challenge

• complex data sets have range of users• different hardware / platforms

• different tasks / goals

• different ability / understanding

• no one interface satisfies everyone

• data owners cannot satisfy everyone

• create ecosystem around open data

Page 5: Data All the Way Down

visualisation / data gap end user vs reuser

Page 6: Data All the Way Down

Visualisations

• approachable for real people

• necessary for stakeholder buy-in

• beauty is in what's left out• advertisement or taster of rich datasets

• often not possible in official data

• leaves questions unanswered• what if we looked at the data in a different way?

Page 7: Data All the Way Down

Raw Data

• importable into own data store• often only interested in particular slice

• data set may be massive / changing

• run whatever analysis you want• requires at least some programming skills

• analysis might not be appropriate for the data

• documentation probably lacking

Page 8: Data All the Way Down

bridging the gap layered data access

Photo by Nikita Kravchuk http://www.flickr.com/photos/mi55er/3845619153/

Page 9: Data All the Way Down

Layered Architecture

• user interface• navigation and global understanding

• API• curated, targeted, programmable access

• query• free-form programmable access

• raw data

Page 10: Data All the Way Down

legislation.gov.uk lists as Atom feeds

Page 11: Data All the Way Down

legislation.gov.uk content as XML

Page 12: Data All the Way Down

legislation.gov.uk layer other views

Page 13: Data All the Way Down

organograms navigable visualisation

Page 14: Data All the Way Down

organograms JSON data

Page 15: Data All the Way Down

organograms RDF / XML / HTML

Page 16: Data All the Way Down

organograms SPARQL query

Page 17: Data All the Way Down

organograms raw data

Page 18: Data All the Way Down

Key Techniques

• resource-driven design (good URIs)

• every page built based on API calls

• explicit links to API access• for bonus points, link to your transformation code

• consistent terminology• clear mapping from UI to API

• caching & access control at each level

Page 19: Data All the Way Down

Benefits

• fork at any point• don't like the visualisation / API? create your own!

• everyone is human• reusers gain understanding from user interface

• visualisation benefits the stack• API oriented towards achieving a goal

• visual validation of data improves quality

Page 20: Data All the Way Down

Questions?