Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless...

11
Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute September 30, 2010

Transcript of Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless...

Page 1: Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute.

Linking Open Government Data

(TWC LOGD)

Li Ding, Jim Hendler and Deborah L. McGuinness

Tetherless World ConstellationRensselaer Polytechnic Institute

September 30, 2010

Page 2: Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute.

2

Opening government data world-wideJa

nu

ary

1,

20

09

“Openness will strengthen our democracy and promote efficiency and effectiveness in Government.”

--- President Obama

Putting Government Data online

Ma

y 2

1,

20

09

Jan

ua

ry 1

9,

20

10

data.gov.uk online

Ma

y 2

1,

20

10

data.gov online data.gov relaunchwith semantic webfeatured

Jun

e3

0,2

00

9

2009 2010 …

Many countries• US• UK• Australia• New Zealand …

Page 3: Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute.

3

Semantic Web featured at data.gov

http://www.data.gov/semantic/ • data.gov adopted Semantic Web Technolgoies• Web-based Mashups • Downloadable RDF data - 6.4 billions of triples

Page 4: Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute.

4

Data-gov Wiki: Innovations at RPI

The Data-gov Wiki explores and educates the use of semantic web technologies, esp. linked data, in producing, processing and utilizing government data from data.gov.

The Data-gov Wiki is run by the Tetherless World Constellation at RPI, headed by Professors Jim Hendler and Deborah McGuinness and led by Li Ding. Other student team members include: Dominic DiFranzo, Sarah Magidson ,James Michaelis, Alvaro Graves, Adam Bell, Jin Guang Zheng, Xian Li, Tim Lebo, Gregory Todd Williams, Peter Coons, Zhenning Shangguan, Devin Gaffney, William Cooper, Brian Zaik, and Johanna Flores .

40+ Demos 400+ Datasets Tutorials & Videos

Page 5: Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute.

5A Typical Mashup: CASTNET(Clean Air Status and Trends Network)

Exhibit Visualization API

Data.govData.gov

CASTNET Ozone(CSV)

epa.govepa.gov

CASTNET Site(CSV)

Convert raw dataset into linkable RDF

Data Mashup Web Application MashupVisualization Mashup

query multiple RDF dataset via SPARQL end point

surf to EPA applications

1

2

drill down for details3

4

Created by Dominic DiFranzo, PhD student at RPI, http://www.data.gov/semantic/Castnet/html/exhibit

Page 6: Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute.

6US and UK Foreign AID FY2007:Integrating data from two countries

AID Major aids from US Major aids from UK

Brazil US >UK Development Assistance Gov & civil society, Economic

India UK > US Child Survival and Health Health, Economic

Created by James Michaelis, RPI, http://data-gov.tw.rpi.edu/demo/linked/aidviz-1554-10030.html

Data Sources:

[Spatial Mashup] Data.gov (USAID) + Data.gov.uk (DFID)

Page 7: Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute.

7Gov Data for Pop Science:Integrating different sources for discovery

Created by Sarah Magidson, U. Chicago. http://data-gov.tw.rpi.edu/demo/stable/tobacco-smoker/demo-state-10026-smoke-rate-statevarsapi.html

[Spatial Mashup] Data.gov (Population) + NIH (Tobacco Tax, Smoking rate)

Gov data provides knowledge for poplation science study

Page 9: Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute.

9Social Mashup - White House Visitor Search:Linking social network data using semantic wiki

“POTUS”

dbpedia:Barack_Obama

Created by Dominic DiFranzo, Evan Patton, RPI, http://data-gov.tw.rpi.edu/demo/stable/white-house-visitor/top100-visitees.php

[Person Mashup (via Data-gov Wiki)] Data.gov (statistics) + Wikipedia (personal profiles)

The White House

Semantic Wiki

WikipediaNYTimes

Page 10: Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute.

10

Using Web Tools

Information networks can be integrated via the Semantic Wiki and visualized a number of different ways: social networks, human-language technology, workflows, …

Page 11: Linking Open Government Data (TWC LOGD) Li Ding, Jim Hendler and Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute.

11

Conclusion and Future Work

• Now– 6.4 8.5 billions of triples – “data + visualization + mashup” – Low-cost solutions– Education

• New LOGD site is coming– More raw data, catalog, links, – More technologies, RDFa– More tools, services– More demos, tutorials, videos– More domain applications

• Future Research – Data integration, link, search– Social machine– Provenance, versions, trust– Usability and data quality– Scalability scalable

http://logd.tw.rpi.edu