ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data...

23
ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation [email protected] http:// www.opendatafoundation.org

Transcript of ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data...

Page 1: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

ODaF Europe 2008Colchester, UK, April 14-15, 2008

Metadata in social science and the Open Data Foundation

Pascal Heus

Open Data Foundation

[email protected]

http://www.opendatafoundation.org

Page 2: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Thanks!

ODaF Europe 2009

Page 3: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Outline

• Needs and challenges in statistical data and metadata management

• Metadata and XML solutions• Selecting specifications• Need for tools• Open Data Foundation• Conclusions / Q&A

Page 4: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Managing data and metadata is challenging!

We are in charge of the data. We support our users but also need to protect our respondents!

We want easy access to high quality and well documented data!

We need to collect the information from the producers, preserve it, and provide access to our users!

Producers

Librarians

Users

General Public

Policy Makers

Sponsors

Media/Press

Academic

Business

Government

We have an information

management problem

Page 5: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

What is Metadata?

• Common definition: Data about Data

Unlabeled stuff Labeled stuff

The bean example is taken from: A Manager’sIntroduction to Adobe eXtensible Metadata Platform, http://www.adobe.com/products/xmp/pdfs/whitepaper.pdf

Page 6: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

XML to the rescue!

• XML is driving today’s web service oriented architecture of the Internet and Intranets

• Using XML, we can capture, structure, transform, discover, exchange, query, edit and secure metadata and data

• XML is platform & language independent and can be used by everyone

• XML is both machine and human readable• XML is non-proprietary, public domain and

many open tools exist• Domain specific standards are available!

Page 7: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

XML Technical Overview

StructureDTD

XSchema

TransformXSL, XSLT

XSL-FO

DiscoverRegistriesDatabases

ExchangeWeb Services

SOAPREST

SearchXPath

XQuery

ManageSoftwareXForms

CaptureXML

Page 8: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

XML Solutions

Producers

Librarians

Users

General Public

Policy Makers

Media/Press

Academic

Business

Government

Sponsors

XML Specs

Use our specifications and your will be happy! It will harmonize everything.

Great, I can provide public metadata!

Well documented data, here we come!

Now we can talk to each other!

Page 9: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Let’s use XML, but….

Producers

Librarians

Users

XML Specs

Which specifications should we adopt?

How do we do this? Where are the tools and guidelines?

?Open Data Foundation

Page 10: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Open Data Foundation (ODaF)

• US Based non-profit organization, established 2006

• Directors, advisors and managers from statistical and ICT communities

• Project oriented• Mission

– Focus on socio-economic data– Adoption of global metadata standards – Coordinated development of open-source tools– Capacity building– Improving data and metadata accessibility and

overall quality – Operate at the global level

Page 11: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Selecting XML specifications

• A single specification is not enough!– XML specifications commonly focus on a specific

area of knowledge and/or set of functionalities– Cannot answer the needs of all actors

• XML mappings between specifications are possible– Information can be converted from one domain to

another and be carried across communities

• Which ones should we use?– Fit for purpose– Widely accepted and supported– Can be mapped to a cross-domain family

Page 12: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

A suggested set for socio-economic data

• Statistical Data and Metadata Exchange (SDMX)– Macrodata, time series, indicators, registries– http://www.sdmx.org

• Data Documentation Initiative (DDI)– Microdata (surveys, studies)– http://www.ddialliance.org

• ISO 11179– Semantic modeling, concepts, registries– http://metadata-standards.org/11179/

• ISO 19115– Geography– http://www.isotc211.org/

• Dublin Core– Resources (documentation, images, multimedia)– http://www.dublincore.org

Page 13: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

The need for Tools

We produce data not tools! We don’t have the expertise.

We use data and software but we don’t build tools! We don’t have the expertise

We preserve and disseminate data not software! We don’t have the expertise

Producers

Librarians

Users

XML Specs

We set specifications and standards. Tools are not our mandate

Open Data Foundation

Page 14: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Open Data Foundation

The need for Tools

Mandated to develop toolsProvide cross-domain expertise in ICT and statistics

Provide umbrella for coordinated developmentEnsure inter-operability

Outline harmonized architecture and environmentPromote open source / maximize reusability

Build global registriesResources/Fund raising

Coordination / Networking…

Page 15: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

ODaF Vision

• Promote and facilitate the production and use of “open data”– Public metadata, high quality, fully documented, respondent

protected, easy to find, accessible in accordance to statistical principles and legislations

• Foster a global harmonized framework– Facilitate the flow of data and metadata– Promotes dialog between all stakeholders

Unlock the Data!

Page 16: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Some ODaF Projects & Ideas

• Guidelines for an harmonized architecture and development environment

• Foster tools development• XML mappings• Facility to host development of open source

projects (GForge)• Provide hosting services for agencies• Produce training and reference material• Technical support & capacity building• Bring people / agencies together• …

Page 17: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

ODaF partners

• Statistical agencies / producers• Data Archives• Research Data Centers / Data enclaves• Academic & Research communities• Standard settings agencies & consortiums• Governmental organizations• International organizations• Open source community• Software developers• IT Vendors

Page 18: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Growing solutions in a complex environment

USEPRODUCTION

PRESERVATION

ANALYSIS

DISCOVERY

DISSEMINATION

QUALITY

SECURITY

SPSS

Excel

SDDS

GDDS

Toolkit

Blaise

CSPro

SAS METADATAStata

DQAF

Access

Privacy Disclosure

Legal

ISO 11179

TECHNOLOGY

Accessibility

DDI

ISO 19115

SDMX

DCMIRegistries

XML

WebDatabases

XML-DB

Infrastructure

XSLTXPath SOAP

Programming

GIS

Warehouse

What are we concerned with?

Page 19: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Growing solutions in a complex environment

USEPRODUCTION

PRESERVATION

ANALYSIS

DISCOVERY

DISSEMINATION

QUALITY

SECURITY

SPSS

Excel

SDDS

GDDS

Toolkit

Blaise

CSPro

SAS METADATAStata

DQAF

Access

Privacy Disclosure

Legal

ISO 11179

TECHNOLOGY

Accessibility

DDI

ISO 19115

SDMX

DCMIRegistries

XML

WebDatabases

XML-DB

Infrastructure

XSLTXPath SOAP

Programming

GIS

Warehouse

CHALLENGEWe need a set of tools that work

together in an harmonized framework. This requires

coordinated efforts and expertise from the various communities

OPEN DATA FOUNDATION• Provide cross-domain & IT expertise• Coordinate and support development• Knowledge sharing• Capacity Building• Provide global vision and guidance

Page 20: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

ODaF Projects / Activities

• Advocacy– ODaF meetings (Europe, US)– Conferences (FedCasic, IASSIST, METIS, etc.)– Coordination / networking

• DDI– DDI Foundation Tools Program (coordination, DExT, etc.)– Standards: DDI, SDMX, mappings

• Research Data Centers / Data enclaves– National Opinion Research Center (NORC), Canada RDC,

European initiatives– Establish standards based framework + investigate

researcher metadata / collaborative spaces

• Others– QuDExT, technical assistance (often semail)– Open to suggestions

Page 21: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

ODaF Challenges

• The technology is available today• The right people are available today• The need and the will are there• The challenges are:

– Bring together individuals and agencies– Develop of harmonized tools– Focus resources and funding– Develop awareness / understanding of technology– Manage change– Content management– Coordination & Guidance– Institutional commitment

• Learn for the past for a better future• It’s not about data, it’s about people

Page 22: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Summary

• Managing data and metadata is challenging– Solutions exist to make it easier and provide

better information to unlock the data

• Adopt a set of specifications that answer your requirements and can connect across domains– DDI, SDMX, ISO 11179, Dublin Core, ISO 19115

• Promote the use and development of open tools, do not work in isolation, get the appropriate expertise– Open Data Foundation

Page 23: ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation pheus@opendatafoundation.org.

http://www.opendatafoundation.org

Meet the Icons...

THANK YOU!