Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and...

26
Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e- Publications 24 th and 25 th June , 2010 Toby Green OECD Publishing

Transcript of Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and...

Page 1: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Publishing Official Datasets

4th Bloomsbury Conference on

e-Publishing and e-Publications 24th and 25th June , 2010

Toby GreenOECD Publishing

Page 2: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Publishing Official Data in cool ways since 1961

Page 3: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Climategate! “investigation reveals scientific concern about missing tree ring data”. The

Guardian, January 2010

Page 4: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Would it have been lost had it been properly published and curated?Should we rely on authors to self-publish data?

Page 5: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Data is not second class stuff. It should be just as easy to:• peer review

• publish• citeas research articles.

Page 6: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

We simply need the existing scholarly publishing ‘toolkit’:• review mechanism

• metadata• doi identifiers• CrossRef

Page 7: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Here’s one OECD prepared earlier . . .

So, whereas for books we have this:

Page 8: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

For datasets we could have this:

Page 9: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

But data is not the same as an article or book chapter, Sub-sets can be published.

Page 10: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Data subset seriesHomepage

Subset 1Homepage

Subset 2Homepage

Subset 3Homepage

DOI: 1234.56/Subset#3

Sub-sets: each has unique identifier, with links to the ‘mother’ dataset

DOI: 1234.56/Subset#2

DOI: 1234.56/Subset#1

DOI: 1234.56/Series

DOI link to: Main dataset

DOI link to: Main dataset

DOI link to: Main dataset

Page 11: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

The same data can have a different rendition or graphical interface

Page 12: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Dataset‘Homepage’

Rendition 1 Rendition 2 Rendition 3

Datasets with multiple renditions: same identifier

Page 13: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Datasets can grow.

Our current solution is to give them the same

identifier andexplain the growth in the metadata

Page 14: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Datasets can change.

Our current solution is to give them a NEW

identifier, explain the change in the metadata,

and provide a link back to the original dataset.

Page 15: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

http://doi.org/abr

Read all about it!

Page 16: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Jim Gray’s data ‘era’ (2008)OECD’s “stuff machine” (2010)

Publications

Processed dataData Presentations

Data

Page 17: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Publisher Responsibility

Statistician and Researcher Responsibility

Data publishing workflow at OECD

Data producer (author)

Data Editor Data ProductionEditor

Data Operations

Data Marketing & Support

Selection, Quality Assurance, Metadata,Acronym killing,Packaging

DOI allocation,Technical checks.

Hosting,Infrastructure

Promotion,Training,Support,Discovery optimisation

End User and Librarian Feedback

RegistrationCertification

Stewardship

Awareness

Page 18: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

[email protected]

I can end it here, or is there time for more?

Page 19: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

http://statlinks.oecdcode.org/

Page 20: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Great visualisations tell stories

Charles Minard's 1869 chart showing the losses in men, their movements, and the temperature of Napoleon's 1812 Russian campaign.

Page 21: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

TOYS FOR BOYS?

Page 22: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

OECD Factbook iPhone Apphttp://itunes.apple.com/us/app/oecd-factbook-2010/id327348502?mt=8&uo=6 OECD Regional Statistics eXplorerhttp://stats.oecd.org/OECDregionalstatistics/

OECD Factbloghttps://community.oecd.org/community/factblog/blog/2010/05/11/tax-who-pays-what

OECD graph generatorhttp://viz.oecdcode.org/ts/20755104-table1/latest

 

OECD Toys

Page 23: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

Facebook privacy (not any more): http://mattmckeon.com/facebook-privacy/ Why I can’t get a cab outside the UN building in NY? http://www.nytimes.com/interactive/2010/04/02/nyregion/taxi-map.html

Why my musician brother grows his own food http://www.informationisbeautiful.net/2010/how-much-do-music-artists-earn-online/

How they spend your moneywww.wheredoesmymoneygo.org 

Pimp my data

Page 24: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

PIMP KITS and SITES FOR SHARING DATA

Page 25: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

http://statlinks.oecdcode.org/

Page 26: Publishing Official Datasets 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June, 2010 Toby Green OECD Publishing.

[email protected]

Thank-you and er…