Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State and University Library
-
Upload
ralf-stockmann -
Category
Education
-
view
1.512 -
download
2
description
Transcript of Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State and University Library
Controlled Vocabularies and Text Mining –
Use Cases at the Goettingen State and University Library
Ralf StockmannTextGrid Workshops – July 13th 2011
Textmining
Enhanced Context-Search
Multilingual Access
DBPedia, ...
Visualisation
Metadata
OCR/Fulltext
Named Entity Recognition
Catalog Data
Crowd- sourcing
Annotation Tools
Relationship Graphs
Linked Open Data
Ontologies
Scholars
Libraries
Reposi-tories
Textmining
Enhanced Context-Search
Multilingual Access
DBPedia, ...
Visualisation
Metadata
OCR/Fulltext
Named Entity Recognition
Catalog Data
Crowd- sourcing
Annotation Tools
Relationship Graphs
Linked Open Data
Ontologies
Scholars
Libraries
Reposi-tories
Use case #1:
eAqua
Projekt: eAqua
• Partners:– Institut of Computer Science - Computerlinguistic,
Leipzig (Büchler, Eckart, Heyer, Baumgardt)– SUB Göttingen (Stockmann, Kothe, Mahnke)
• Comparing semantic graphs between– Headings of journal articles and– Fulltext of the same articles
Search Term „socialism“ on title elements
„Mephisto“ on fulltext
Textmining
Enhanced Context-Search
Multilingual Access
DBPedia, ...
Visualisation
Metadata
OCR/Fulltext
Named Entity Recognition
Catalog Data
Crowd- sourcing
Annotation Tools
Relationship Graphs
Linked Open Data
Ontologies
Scholars
Libraries
Reposi-tories
Use case #2:
Europeana 4D visualisation
Partner:
Concept
MAP
Concept
MAP TIMELINE
Concept
MAP TIMELINE
• Multiple data layers• Interaction• Animation• Aggregation of data• Connections• Drilldown• Historical/custom
maps• Result table• Splitting Datasets• ...
Refinement
Technological Framework
• OpenLayers• Simile Timeline/Timeplot• GeoNames (Geoparser...)• Explorer Canvas (Google)• GeoServer (OpenStreetmap, Google Maps)• Google Web Toolkit (GWT)• KML (XML)
Data Model
WHAT?
NAME
description
url
MANDATORY
optional
Data Model
WHAT?
WHERE?
NAME
description
url
COORDINATES
address
MANDATORY
optional
KML
Data Model
WHAT?
WHERE? WHEN?
NAME
description
url
COORDINATES
address
TIMESTAMP
range
MANDATORY
optional
Exchange Format: KML (XML)
Questonnaire
Questonnaire
Questonnaire
Questonnaire
Questonnaire
Datasets
• Library catalog• Flickr• IMDB• DBpedia• WikiLeaks
Flickr: „tsunami“
Use your own data in 5 easy steps!
1. Take a look at the .kml specificationhttp://tinyurl.com/e4d-kml
2. Build your own KML dataset3. Upload it to a webserver4. Put the URL into the prototype at http://tinyurl.com/e4d-
demo25. Share your set via the magnetic link!
Ressources
• e4D info website: http://tinyurl.com/e4d-project
• Europeana thoughtLab: http://www.europeana.eu/portal/thoughtlab.html