WOW13_RPITWC_Web Observatories

16
Exploration in Web Science: Instruments for Web Observatories Presented by: Kristine Gloria Co-authors: Deborah McGuinness and Joanne Luciano The Tetherless World Constellation Rensselaer Polytechnic Institute, Troy, NY With thanks to the extended RPI Tetherless World Team

description

Presentation during the WOW 2013 workshop featuring Web Observatory works created by the RPI Tetherless World Constellation

Transcript of WOW13_RPITWC_Web Observatories

Page 1: WOW13_RPITWC_Web Observatories

Exploration in Web Science: Instruments for Web

Observatories

Presented by:Kristine Gloria

Co-authors: Deborah McGuinness and Joanne Luciano

The Tetherless World Constellation

Rensselaer Polytechnic Institute, Troy, NY

With thanks to the extended RPI Tetherless World Team

Page 2: WOW13_RPITWC_Web Observatories

Agenda

6

I. Web Observatories at RPI’s Web Science Research Center

II. Web Observatory Themes

III. Science Data

IV. Health and Life Sciences,

V. Open Government

VI. Social Spaces

Page 3: WOW13_RPITWC_Web Observatories

Web Observatories @ WSRC

At RPI WSRC, our observatories present both tools and methodologies that empower researchers to study the web and to make a difference in the world

Page 4: WOW13_RPITWC_Web Observatories

Web Observatories Themes

Science Data Observatory

Health & Life Sciences ObservatoryOpen Government Observatory

Social Spaces Observatory

Page 5: WOW13_RPITWC_Web Observatories

Web Observatory Theme

Open Government Observatory

Page 6: WOW13_RPITWC_Web Observatories

Open Government DataTWC –Intl Open Government Data Sets

Page 7: WOW13_RPITWC_Web Observatories

Web Observatories Themes

Science Data Observatory

Page 8: WOW13_RPITWC_Web Observatories

SemantAqua

• Enable/Empower citizens & scientists to explore pollution sites, facilities, regulations, and health impacts along with provenance

• Demonstrates semantic monitoring possibilities

• Extend to endangered species and resource mgr issues

• Explanations and Provenance available

1

2 3

45

1. Map view of analyzed results2. Explanation of pollution 3. Possible health effect of contaminant (from EPA)4. Filtering by facet to select type of data5. Link for reporting problems6. Extended with input from USGS, with population counts for birds & fish

Page 10: WOW13_RPITWC_Web Observatories

Semantic Methodology and Semantic Application Evolution

5

Originally developed for Virtual Observatories (in solar terrestrial) , now in water quality, Sea ice, volcanology, mycology, oceans…. …

McGuinness, Fox, West, Garcia, Cinquini, Benedict, Middleton The Virtual Solar-Terrestrial Observatory: A Deployed Semantic Web Application Case Study for Scientific Research. Proc. 19 Conf. on Innovative Applications of Artificial Intelligence (IAAI-07), http://www.vsto.org

SemantAqua -> SemantEco -> DataOne modularizing, broadening, provenance, interaction

VSTO -> SESDI -> SPCDIS - modularizing, provenance, broadening, interaction

Page 11: WOW13_RPITWC_Web Observatories

Web Observatory Theme

Health & Life Sciences Observatory

Page 12: WOW13_RPITWC_Web Observatories

Department of Health and Human Services'                  Developer Challenge

6

In June 2012, HHS issued the first of its seven challenges calling for developers “to make high value health data more accessible to entrepreneurs, researchers, and policy makers in the hopes of better health outcomes for all.”

A group from RPI TWC won first place in the competition, by using semantic technologies and in-house developed software, such as csv2rdf4lod, LODSPeaKr, Farrah and DataFAQS.

HHS wanted Metadata

"... application of existing voluntary consensus standards for metadata common to all open government data"

RPI TWC submitted:•DCAT - W3C Data Catalog◦Version controlled on github.◦Extracted from their CKAN as input to converter.•VoID - W3C Vocabulary of Interlinked Data◦Organized datasets by source, dataset, version.◦Provided links to data dumps, Linksets to LOD.•PROV - W3C Provenance Interchange Model◦Captured during CKAN extraction, retrieval, conversion, and publishing.•Dublin Core Metadata Terms◦Annotated subjects based on descriptions.

HHS wanted Classification

"...classify datasets in our growing catalog, creating entities, attributes and relations that form the foundations for better discovery, integration..."

RPI TWC presented:•Bottom-up vocabulary and entity reuse◦Vocabulary created for each dataset◦Enhanced datasets shifted to reuse vocabulary and entities from other datasets.◦Three stub vocabularies for top-level reuse.•NCBO (Nat. Center for Biomedical Ont.) Annotations◦annotator/annotator.py SADI service◦data/source/bioontology-org/annotator-description-subject/version/retrieve.sh

HHS wanted Liquidity

"new designs ... that form the foundations for ... liquidity"

RPI TWC provided: 2B triples among 1M URIs•Dataset Linked Data◦Machine and Human views (via conneg)◦Faceted search of datasets•Dataset dumps (.ttl.gz)◦For each dataset, and for the whole thing.Dataset query (http://healthdata.tw.rpi.edu/sparql)

Text https://github.com/jimmccusker/twc-healthdata/wiki

Page 13: WOW13_RPITWC_Web Observatories

Web Observatory Themes

Social Spaces Observatory

Page 14: WOW13_RPITWC_Web Observatories

Twitter Network ObservatoryMakani, B. & Zhang, Q.

• Explores the relationships of people and semantics in the graph database

• Basic functions:• Users can visualize and

analyze different types of sub-graphs

• Preforms a set of basic analyses for other COSMIC Groups

Page 15: WOW13_RPITWC_Web Observatories

How can we leverage Social Media sites…

to identify these communities, and

stakeholders within them?to gather requirements from these

communities?

First Responders, including Emergency Medical Personnel, Firefighters, and Police Officers, have active online communities on Social Media websites.

First Responders (with NIST)McGuinness, Erickson, Chastain, Fry, Yan, Zhu

 http://tw.rpi.edu/web/project/FirstResponders

Find Topics:

Find Users:How can we leverage

Social Media sites…to identify these communities, and

stakeholders within them?to gather requirements from these

communities?

Page 16: WOW13_RPITWC_Web Observatories

Questions?

6