Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and...
-
Upload
percival-sherman -
Category
Documents
-
view
220 -
download
0
Transcript of Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and...
Publishing Official Datasets
4th Bloomsbury Conference on
e-Publishing and e-Publications 24th and 25th June , 2010
Toby GreenOECD Publishing
Publishing Official Data in cool ways since 1961
Climategate! “investigation reveals scientific concern about missing tree ring data”. The
Guardian, January 2010
Would it have been lost had it been properly published and curated?Should we rely on authors to self-publish data?
Data is not second class stuff. It should be just as easy to:• peer review
• publish• citeas research articles.
We simply need the existing scholarly publishing ‘toolkit’:• review mechanism
• metadata• doi identifiers• CrossRef
Here’s one OECD prepared earlier . . .
So, whereas for books we have this:
For datasets we could have this:
But data is not the same as an article or book chapter, Sub-sets can be published.
Data subset seriesHomepage
Subset 1Homepage
Subset 2Homepage
Subset 3Homepage
DOI: 1234.56/Subset#3
Sub-sets: each has unique identifier, with links to the ‘mother’ dataset
DOI: 1234.56/Subset#2
DOI: 1234.56/Subset#1
DOI: 1234.56/Series
DOI link to: Main dataset
DOI link to: Main dataset
DOI link to: Main dataset
The same data can have a different rendition or graphical interface
Dataset‘Homepage’
Rendition 1 Rendition 2 Rendition 3
Datasets with multiple renditions: same identifier
Datasets can grow.
Our current solution is to give them the same
identifier andexplain the growth in the metadata
Datasets can change.
Our current solution is to give them a NEW
identifier, explain the change in the metadata,
and provide a link back to the original dataset.
Jim Gray’s data ‘era’ (2008)OECD’s “stuff machine” (2010)
Publications
Processed dataData Presentations
Data
Publisher Responsibility
Statistician and Researcher Responsibility
Data publishing workflow at OECD
Data producer (author)
Data Editor Data ProductionEditor
Data Operations
Data Marketing & Support
Selection, Quality Assurance, Metadata,Acronym killing,Packaging
DOI allocation,Technical checks.
Hosting,Infrastructure
Promotion,Training,Support,Discovery optimisation
End User and Librarian Feedback
RegistrationCertification
Stewardship
Awareness
http://statlinks.oecdcode.org/
Great visualisations tell stories
Charles Minard's 1869 chart showing the losses in men, their movements, and the temperature of Napoleon's 1812 Russian campaign.
TOYS FOR BOYS?
OECD Factbook iPhone Apphttp://itunes.apple.com/us/app/oecd-factbook-2010/id327348502?mt=8&uo=6 OECD Regional Statistics eXplorerhttp://stats.oecd.org/OECDregionalstatistics/
OECD Factbloghttps://community.oecd.org/community/factblog/blog/2010/05/11/tax-who-pays-what
OECD graph generatorhttp://viz.oecdcode.org/ts/20755104-table1/latest
OECD Toys
Facebook privacy (not any more): http://mattmckeon.com/facebook-privacy/ Why I can’t get a cab outside the UN building in NY? http://www.nytimes.com/interactive/2010/04/02/nyregion/taxi-map.html
Why my musician brother grows his own food http://www.informationisbeautiful.net/2010/how-much-do-music-artists-earn-online/
How they spend your moneywww.wheredoesmymoneygo.org
Pimp my data
PIMP KITS and SITES FOR SHARING DATA
http://statlinks.oecdcode.org/
Thank-you and er…