Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
-
Upload
thomas-bosch -
Category
Technology
-
view
112 -
download
2
Transcript of Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
![Page 1: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/1.jpg)
Towards the Discovery of Person-Level Data
Thomas Bosch1, Benjamin Zapilko1, Joachim Wackerow1, Arofan Gregory2
1GESIS – Leibniz Institute for the Social Sciences, Germany {first name.last name}@gesis.org
2Open Data Foundation, USA [email protected]
International Workshop on Semantic Statistics22 October 2013, Sydney, Australia
![Page 2: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/2.jpg)
Why DDI as Linked Data?
![Page 3: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/3.jpg)
Overviewclass overview
«union»
VariableQuestion
Instrument
Questionnaire
dcat:Dataset
LogicalDataSet
skos:Concept
AnalysisUnit
skos:Concept
Universe
Study
StudyGroup
1..* product
0..*
0..*inGroup
0..1
1..*
variable 0..*
0..*universe1
1..*
containsVariable
0..*
0..*
question
1..*0..*
universe
1
0..*
analysisUnit
0..10..*
universe
1
0..*question0..*
0..*
analysisUnit
0..1
0..*
universe
1..*
![Page 4: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/4.jpg)
Use Cases
![Page 5: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/5.jpg)
Where to search for specific data?
![Page 6: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/6.jpg)
What microdata according to specific metadata exists?
![Page 7: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/7.jpg)
What datasets are associated with the microdata?
![Page 8: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/8.jpg)
What aggregated data according to specific metadata exists?
![Page 9: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/9.jpg)
What datasets are associated with the aggregated data?
![Page 10: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/10.jpg)
From which microdata datasets is the aggregated dataset derived?
![Page 11: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/11.jpg)
What summary statistics does a variable have?
![Page 12: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/12.jpg)
What category statistics does a variable representation have?
![Page 13: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/13.jpg)
What microdata datasets are created by the research institute 'GESIS'
![Page 14: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/14.jpg)
![Page 15: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/15.jpg)
Conclusion
![Page 16: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/16.jpg)
Thank you for your attention…
![Page 17: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/17.jpg)
Backup Slides
![Page 18: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/18.jpg)
Why DDI as Linked Data?
• Users discover data in the Linked Open Data Cloud using DDI metadata
• Users can search for data• Data providers publish searchable / accessable
metadata• The Discovery specification contains the most
important DDI concepts for the discovery purpose• We integrated well elaborated vocabularies• We use SW technologies
![Page 19: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/19.jpg)
Overview
• Study• LogicalDataSet: the dataset where we save the actual data• Universe: for whom is the study applied to? (e.g. all women
in Germany)• Analysis Unit (e.g. persons or households)• Instrument: How do we want to measure? (e.g. 'What is
your sex?')• Concept: What do we want to measure?• Variable: Where do we save what we measured? (e.g. sex)
![Page 20: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/20.jpg)
Future Work
• Physical description of rectangular data especially CSV data– We integrate this description in discovery
• How originate aggregated data on the basis of microdata?– Aggregation method is described in the form machines can
process it– We see a need that this area should be explored further in
order to describe the relationship between aggregate data and microdata more detailed
![Page 21: Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]](https://reader034.fdocuments.us/reader034/viewer/2022052413/559e1a9c1a28abcf5b8b45e7/html5/thumbnails/21.jpg)
Acknowledgements
26 experts from the statistical community and the Linked Data community comingfrom 12 different countries contributed to this work. They were participating inthe events mentioned below.• 1st workshop on 'Semantic Statistics for Social, Behavioural, and Economic
Sciences: Leveraging the DDI Model for the Linked Data Web' at SchlossDagstuhl - Leibniz Center for Informatics, Germany in September 2011
• Working meeting in the course of the 3rd Annual European DDI Users GroupMeeting (EDDI11) in Gothenburg, Sweden in December 2011
• 2nd workshop on 'Semantic Statistics for Social, Behavioural, and EconomicSciences: Leveraging the DDI Model for the Linked Data Web' at SchlossDagstuhl - Leibniz Center for Informatics, Germany in October 2012
• Working meeting at GESIS - Leibniz Institute for the Social Sciences inMannheim, Germany in February 2013