Multilingual scraping from dutch government data
-
Upload
tobias-wunner -
Category
Documents
-
view
448 -
download
0
description
Transcript of Multilingual scraping from dutch government data
Multilingual Scraping fromOpen Dutch Government Data
Open Data Day Hackathon IrelandDERI & 091 labs Galway, 4 Dec 2010
Tobias Wunner
Dutch open government data
3 websites same databut multilingual
Dutch Spending Data
JavascriptWebsite
Pixel Graphicin PDF
Dutch Spending Data
Website
Pixel Graphicin PDF
DIFFICULT!
• 367 concept (24 Excel files)
• concept hierarchy
Scrape multilingual concepts
“Long-term interest rate”@en“Lange Rente”@nl
“International items”@en“Internationale conjunctur”@nl
super concept
• 367 concept (24 Excel files)
• concept hierarchy
Scrape multilingual concepts
“Long-term interest rate”@en“Lange Rente”@nl
“International items”@en“Internationale conjunctur”@nl
super concept
[1] Open Data Day Galway with results http://www.opendataday.org/wiki/City_Events#Galway
[2] Multilingual scraper fo Dutch Government Data http://scraperwiki.com/scrapers/cpbnl-multilingual-terminology/
References