Scientific social tagging - background knowledge come to surface (AquaRing Project)
-
Upload
stefano-bianchi -
Category
Technology
-
view
105 -
download
1
description
Transcript of Scientific social tagging - background knowledge come to surface (AquaRing Project)
www.aquaringweb.euECP 2005 CULT 038261
Dublin Core Social Tagging Workshop 2009Dublin Core Social Tagging Workshop 2009
Scientific social tagging:Scientific social tagging:background knowledge comes to surface background knowledge comes to surface
Stefano BianchiStefano Bianchi
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 22
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
BackgroundBackground
Scientific institutions create, manage and store Scientific institutions create, manage and store thousands of thousands of digital contentsdigital contents (images, video, documents etc.) for their (images, video, documents etc.) for their institutional mission (research, edutainment etc.)institutional mission (research, edutainment etc.)
ClassifyingClassifying and and aggregatingaggregating such contents is a such contents is a benefitbenefito for the scientific institutions, to ease management and reusefor the scientific institutions, to ease management and reuseo for the “community”, once contents are published onlinefor the “community”, once contents are published online
ONTOLOGYONTOLOGYknowledge formalizationknowledge formalization
++METADATAMETADATA
enhanced content annotationenhanced content annotation
HOW?HOW?
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 33
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
AquaRing projectAquaRing project
eContenteContentPlusPlus EC funded programme – CALL 2005 EC funded programme – CALL 2005o FOCUS: Cultural and scientific/scholarly contentFOCUS: Cultural and scientific/scholarly content
Project Project www.www.aquaringwebaquaringweb..eueu (Sept.06-Mar.09) (Sept.06-Mar.09)
o a European cross-border a European cross-border virtual global knowledge and content virtual global knowledge and content spacespace on on aquatic environment and resourcesaquatic environment and resources
CoordinationCoordination
IT providersIT providers
EvaluationEvaluation
DisseminationDissemination
Knowledge &Knowledge &content content providersproviders
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 44
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
??
??
Distributedcontent space
Users
Centralised access
Additional content
providers
OverviewOverview
DETAILS >
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 55
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
ChallengeChallenge
integrated, semantic-based, cross-border digital collectionintegrated, semantic-based, cross-border digital collection of cultural and scientific contents of cultural and scientific contents on on aquatic and marine sciencesaquatic and marine sciences
== complex knowledge domaincomplex knowledge domain
DYNAMICDYNAMIC
HUGEHUGE
HETEROGENEOUSHETEROGENEOUS
E.g. FishBase (www.fishbase.org)
- 31,200 Species - 276,500 Common names
…only fish!
E.g. FishBase (www.fishbase.org)
- 31,200 Species - 276,500 Common names
…only fish!
Species, Land, Habitats, Environment, Fishing areas, Vessels, Leisure etc. Different languages, different audiences, different formats etc.
Species, Land, Habitats, Environment, Fishing areas, Vessels, Leisure etc. Different languages, different audiences, different formats etc.
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 66
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
DifficultiesDifficulties
Huge, heterogeneous and dynamic knowledge domainHuge, heterogeneous and dynamic knowledge domaino Marine Biology, Aquatic Sciences, Aquatic Environment, Aquatic and Marine Biology, Aquatic Sciences, Aquatic Environment, Aquatic and
marine activities and technology, Marine Culture and Leisure, marine activities and technology, Marine Culture and Leisure, Education and Awareness etc. etc.Education and Awareness etc. etc.
Several different large ontologies/thesauri exist Several different large ontologies/thesauri exist butbut no one covers the whole domainno one covers the whole domaino Integration/merging to achieve adequate coverageIntegration/merging to achieve adequate coverageo Introduction of new knowledge as it appearsIntroduction of new knowledge as it appears
Mandatory interoperability with ongoing scientific initiativesMandatory interoperability with ongoing scientific initiativeso Future extensions/collaborations/data exchangeFuture extensions/collaborations/data exchangeo Considering inclusion in Europeana (Considering inclusion in Europeana (www.europeana.euwww.europeana.eu))
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 77
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Remedial actionsRemedial actions
To adopt a standard metadata model (DC) To adopt a standard metadata model (DC) o To ensure future interoperability with other organisations / initiativesTo ensure future interoperability with other organisations / initiatives
To use To use state-of-the-artstate-of-the-art existing ontologies existing ontologies o provided by reliable organisations in the fieldprovided by reliable organisations in the field
To transform To transform state-of-the-artstate-of-the-art existing thesauri/DB views into ontologies existing thesauri/DB views into ontologieso E.g. conversion of ASFA thesaurus into an ontologyE.g. conversion of ASFA thesaurus into an ontologyo E.g. Habitats db view from European Environment AgencyE.g. Habitats db view from European Environment Agency
To create a new ontology for uncovered sub-domainsTo create a new ontology for uncovered sub-domainso using reliable scientific data as sources (E.g. EDUcational ontology)using reliable scientific data as sources (E.g. EDUcational ontology)
To support a mixed annotation approach To support a mixed annotation approach ((ontologies + hierarchical free tagsontologies + hierarchical free tags))
o to use knowledge from ontologies and incorporate new knowledgeto use knowledge from ontologies and incorporate new knowledge
To implement an ontology learning approach To implement an ontology learning approach o to learn from content annotation creating a unified ontologyto learn from content annotation creating a unified ontology
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 88
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
ApproachApproach
How can How can semanticssemantics support a support a cross-border cross-border virtual global knowledge virtual global knowledge andand content space content space on on aquatic environment aquatic environment andand resources resources??
ContentsContentso managementmanagemento annotationannotation
• semantics (meaning) semantics (meaning) **– MetadataMetadata
» data about data (e.g. “identity card” of a resource)data about data (e.g. “identity card” of a resource)– OntologyOntology
» ““an ontology is an explicit specification [i.e. formalization] of a conceptualization” an ontology is an explicit specification [i.e. formalization] of a conceptualization” [Gruber,1993]… [Gruber,1993]… concepts + relationsconcepts + relations
KnowledgeKnowledgeo generationgeneration
• from generic to specific from generic to specific can annotation create new knowledge? can annotation create new knowledge?o exploitationexploitation
• content annotation content annotation **: semantics for domain-focused tagging: semantics for domain-focused tagging• content retrieval: semantics to refine search and guide navigationcontent retrieval: semantics to refine search and guide navigation
CONTEXTUALIZATION
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 99
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Content managementContent management
How contents are managed?How contents are managed?o contents are locally collected and arrangedcontents are locally collected and arranged
into collections (folders) and then simplyinto collections (folders) and then simplymoved to a dedicated server (http/ftp)moved to a dedicated server (http/ftp)
o basic metadata for basic metadata for allall digital contents digital contents uploaded are automatically created and uploaded are automatically created and ready for semantic annotationready for semantic annotation
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 1010
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Content annotationContent annotation
How contents are annotated?How contents are annotated?o once uploaded, each content/collection has a once uploaded, each content/collection has a metadatametadata record record
associated, ready to be enriched with information specifying “what associated, ready to be enriched with information specifying “what the resource is about” (the resource is about” (subjectsubject))
o tags for tags for subjectsubject can be then selected can be then selected from from a)a) seven multidisciplinary ontologies seven multidisciplinary ontologies covering different aspects of the aquaticcovering different aspects of the aquaticworld or world or b)b) can be entered freely to fill can be entered freely to fill possible gaps in coverage of each ontologypossible gaps in coverage of each ontology
• distinction is tracked for quality check!distinction is tracked for quality check!
o only once all mandatory fields are filled only once all mandatory fields are filled the content is published onlinethe content is published online
• NOTE: annotation is inherited for collections!NOTE: annotation is inherited for collections!
SubjectSubject::Hyppocamous sppHyppocamous sppNE ATLANTICNE ATLANTIC
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 1111
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Knowledge generationKnowledge generation
How knowledge is generated?How knowledge is generated?o Semi-automatic ontology learning processSemi-automatic ontology learning process
• Relations between concepts used for annotation are Relations between concepts used for annotation are semi-automaticallysemi-automatically created on the basis of the specific contextual content annotationscreated on the basis of the specific contextual content annotations
SPECIESSPECIES HABITATSHABITATSlives inlives in
Hippocampus sppHippocampus spp NE ATLANTICNE ATLANTIClives inlives in
SubjectSubject::Hippocampus sppHippocampus sppNE ATLANTICNE ATLANTIC
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 1212
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Knowledge-based content retrievalKnowledge-based content retrieval
How contents are retrieved?How contents are retrieved?o Services exploits annotations and generated knowledgeServices exploits annotations and generated knowledge
• To focus better search and navigation resultsTo focus better search and navigation results• To suggest refinementsTo suggest refinements• To ease content navigation and useTo ease content navigation and use
Hippocampus sppHippocampus spp
is affected byis affected by
Hippocampus sppHippocampus spp SEARCHSEARCH
is studied byis studied by
lives inlives in AquariologyAquariology
Illegal fishingIllegal fishing
NE AtlanticNE Atlantic
Hippocampus spp!Hippocampus spp!
Aquariology!Aquariology!
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 1313
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
SEMANTICSSEMANTICS
ServicesServices
Back-officeBack-officeo content management, knowledge formalizationcontent management, knowledge formalization
• Metadata editor (http://metadata.aquaringweb.eu)Metadata editor (http://metadata.aquaringweb.eu)• Ontology editor (http://ontology.aquaringweb.eu)Ontology editor (http://ontology.aquaringweb.eu)
Front-officeFront-office (http://www.aquaringweb.eu) (http://www.aquaringweb.eu)
o content provision, knowledge exploitationcontent provision, knowledge exploitation• multilanguage dynamic site frameworkmultilanguage dynamic site framework• (semantic) search engine (5 customised + 1 general)(semantic) search engine (5 customised + 1 general)• (semantic) tag cloud(semantic) tag cloud• virtual exhibitionsvirtual exhibitions• GoogleMaps-based interactive mapGoogleMaps-based interactive map
RESTRICTED RESTRICTED ACCESS!ACCESS!
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 1414
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
SemanticsSemantics
WHY?WHY?o To improve semantic-based content annotationTo improve semantic-based content annotationo To support semantic-based retrieval and navigation of contentsTo support semantic-based retrieval and navigation of contents
HOW?HOW?o Identification / adaptation of a suitable metadata formalismIdentification / adaptation of a suitable metadata formalism
• Based on Based on assessedassessed & & interoperableinteroperable metadata standards metadata standards
o Research / definition of a suitable domain ontologyResearch / definition of a suitable domain ontology• Scientific / technical evaluation of existing ontologiesScientific / technical evaluation of existing ontologies• Reliability / suitability / application domain / multilingual supportReliability / suitability / application domain / multilingual support• Merge / combine of existing ontologiesMerge / combine of existing ontologies
o Mechanisms to allow multilingual user interactionMechanisms to allow multilingual user interaction• Support for consortium languages + EnglishSupport for consortium languages + English• Multilingual annotationsMultilingual annotations• Multilingual ontologiesMultilingual ontologies
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 1515
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
MetadataMetadata
DublinCore QualifiedDublinCore Qualifiedo 24 elements (DCMES’ 15 plus some refinements)24 elements (DCMES’ 15 plus some refinements)
o 10 mandatory elements10 mandatory elements– Title, Audience, Abstract, Publisher, Type, Language, Subject, Format, Identifier, DateTitle, Audience, Abstract, Publisher, Type, Language, Subject, Format, Identifier, Date
o 4 purpose-based element groupings4 purpose-based element groupings• Core ElementsCore Elements
– Title, Audience, Abstract, Publisher, Type, Language, Rights, CreatorTitle, Audience, Abstract, Publisher, Type, Language, Rights, Creator
• Semantic AnnotationSemantic Annotation– Subject (terms selected from 7 ontologies + free tags)Subject (terms selected from 7 ontologies + free tags)
• Physical ResourcePhysical Resource– Format, Identifier, Date of creation, Date of Availability, Date of Issuing, Date of Validity, Format, Identifier, Date of creation, Date of Availability, Date of Issuing, Date of Validity,
Format extentFormat extent
• Additional ElementsAdditional Elements– Contributor, Bibliographic Citations, Source, HasVersion, Replaces, Requires, HasPart, Contributor, Bibliographic Citations, Source, HasVersion, Replaces, Requires, HasPart,
References, HasFormat, SpatialCoverage, Temporal CoverageReferences, HasFormat, SpatialCoverage, Temporal Coverage
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 1616
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
InstancesInstances
Instances used to annotate a translation of the original resourceInstances used to annotate a translation of the original resourceo Mandatory Mandatory
• to create at least one instance for each metadata record to create at least one instance for each metadata record • to create at least one annotation of the resource in Englishto create at least one annotation of the resource in English
– the metadata record or an instancethe metadata record or an instance
• to create an instance in the annotator’s native language (configurable)to create an instance in the annotator’s native language (configurable)
TitleTitle Title in the instance languageTitle in the instance language
AbstractAbstract Abstract in the instance languageAbstract in the instance language
DateDate Date of creation, modification, issuing, availability and /or validityDate of creation, modification, issuing, availability and /or validity
FormatFormat MIME typeMIME type
Format ExtentFormat Extent Size and / or durationSize and / or duration
IdentifierIdentifier Physical location of resourcePhysical location of resource
LanguageLanguage Language of the translationLanguage of the translation
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 1717
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
OntologyOntology
… … 77 ontologiesontologies selected for content annotation selected for content annotation1.1. Biological SpeciesBiological Species2.2. Fishing AreasFishing Areas3.3. Land AreasLand Areas4.4. VesselsVessels5.5. HabitatsHabitats
• programmatically developed from EUNIS Habitat types classificationprogrammatically developed from EUNIS Habitat types classification
6.6. ASFA (AQUATIC SCIENCES AND FISHERIES ABSTRACTS)ASFA (AQUATIC SCIENCES AND FISHERIES ABSTRACTS)• programmatically developed from ASFA thesaurus, provided by FAOprogrammatically developed from ASFA thesaurus, provided by FAO
7.7. EDUcationEDUcation• derived from DCMI, LOM, DC-Ed AP, LRE, IMATI ITDderived from DCMI, LOM, DC-Ed AP, LRE, IMATI ITD
Hierarchical free tags annotation mechanismHierarchical free tags annotation mechanism
AquaRing ontology generated by annotation (ontology learning)AquaRing ontology generated by annotation (ontology learning)o including relationships among the seven ontologies and merging free tagsincluding relationships among the seven ontologies and merging free tagso ontology editor for non-expertsontology editor for non-experts** developed for manual refinements developed for manual refinements
By FAOBy FAOdeveloped by EC project Neon developed by EC project Neon ((www.neon-project.org))
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 1818
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Domain coverageDomain coverage
Species
Fishing Areas
Land
Vessels
Habitats
ASFA (general)
EDUcation
What if a species I need is missing?
But only checked names & correct classification,
please… we are scientists!
What if a species I need is missing?
But only checked names & correct classification,
please… we are scientists!
FREEFREETAGSTAGS
FREEFREETAGSTAGS
FREEFREETAGSTAGS
FREEFREETAGSTAGS
FREEFREETAGSTAGS
FREEFREETAGSTAGS
FREEFREETAGSTAGS
FREEFREETAGSTAGS
FREEFREETAGSTAGS
FREEFREETAGSTAGS
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 1919
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
FreeFree tags tags
Free tags allowed for each sub-domain (ontology) Free tags allowed for each sub-domain (ontology) o to fill in domain gaps not covered by the selected ontologiesto fill in domain gaps not covered by the selected ontologieso functionality available at annotation timefunctionality available at annotation timeo results reusable by all annotatorsresults reusable by all annotators
Annotators advised to consult Annotators advised to consult preferredpreferred thesauri thesaurio GEMETGEMET, Environment field, Environment fieldo AGROVOCAGROVOC, Agricultural field (but covers partially Marine Biology), Agricultural field (but covers partially Marine Biology)o AQUATEXTAQUATEXT, Online Aquaculture Dictionary , Online Aquaculture Dictionary o MarineSpeciesMarineSpecies, Marine organisms names, Marine organisms nameso FishBaseFishBase, contains most of fish species known to science , contains most of fish species known to science o EUROVOCEUROVOC, EC activity multilingual thesaurus, EC activity multilingual thesaurus
““Guided” free tags Guided” free tags (+ ontology editing!)(+ ontology editing!)
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 2020
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Ontology learning approachOntology learning approach
ONTOLOGY AONTOLOGY A
ONTOLOGY BONTOLOGY B
ONTOLOGY CONTOLOGY C
“concepts = keywords” used for annotation “concepts = keywords” used for annotation
semi-automatically generated relations semi-automatically generated relations
HIERARCHICALFREE TAGS
HIERARCHICALFREE TAGS
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 2121
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Ontology editorOntology editor
AquaRing ontology enrichment and improvement (AquaRing ontology enrichment and improvement (for non-expertfor non-expert**))o Terms and free tags translations (multilingual system!)Terms and free tags translations (multilingual system!)o Relationships managementRelationships managemento Free tags managementFree tags management
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 2222
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
ResultsResults
Relations Relations ##
AffectsAffects 18761876
ConsidersConsiders 332332
DescribesDescribes 22
ExploitsExploits 4747
Lies inLies in 37573757
Occurs_inOccurs_in 49024902
OwnsOwns 4343
StudiesStudies 62546254
UsesUses 11511151
Includes familyIncludes family 16961696
Includes orderIncludes order 110110
Includes speciesIncludes species 459459
Related toRelated to 122122
OntologyOntology # of concepts# of concepts
Biological Species (Free Tags)Biological Species (Free Tags) 68116811
Biological SpeciesBiological Species 2234322343
Educational (Free Tags)Educational (Free Tags) 9090
EducationalEducational 854854
Fishing Areas (Free Tags)Fishing Areas (Free Tags) 12971297
Fishing AreasFishing Areas 35833583
Habitats (Free Tags)Habitats (Free Tags) 14701470
HabitatsHabitats 57645764
Land Areas (Free Tags)Land Areas (Free Tags) 10591059
Land AreasLand Areas 50495049
Marine Biology (Free Tags)Marine Biology (Free Tags) 50665066
Marine BiologyMarine Biology 2418824188
Vessels (Free Tags)Vessels (Free Tags) 2828
VesselsVessels 9393
AquaRing ontologyAquaRing ontologyover 75,000!over 75,000!
over 20,000!*over 20,000!*
* Over 160,000 with translations!
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 2323
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
BenefitsBenefits
Access to resources classified in AquaRing knowledge sub-Access to resources classified in AquaRing knowledge sub-domainsdomains
Search performed according to resource meaning and user Search performed according to resource meaning and user preferencespreferenceso User’s preferences: audience type, language, format, etc. User’s preferences: audience type, language, format, etc.
• e.g. “videos for children in French”e.g. “videos for children in French”
Ontology terms suggested to refine/complete the searchOntology terms suggested to refine/complete the searcho Knowledge exploration – Knowledge exploration – domain-based guided serendipitydomain-based guided serendipity!!
Access to resources grouped by a representation of usage Access to resources grouped by a representation of usage degree of ontology terms in annotations (+ relations with other degree of ontology terms in annotations (+ relations with other terms)terms)
• Tag cloud-based navigation + relationsTag cloud-based navigation + relations
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 2424
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Semantic technology (and Semantic web) still considerably Semantic technology (and Semantic web) still considerably unknownunknown in domains in domains where CMS and IS might benefit from such formalised approacheswhere CMS and IS might benefit from such formalised approacheso simple yet effectivesimple yet effective knowledge-based solutions are usually positively evaluated by knowledge-based solutions are usually positively evaluated by
content providerscontent providers
Semantic technology’s Semantic technology’s take-up hamperedtake-up hampered by difficulties related to the proper by difficulties related to the proper formalizationformalization of complex scientific knowledge ( of complex scientific knowledge (ontology engineeringontology engineering) and ) and classificationclassification of contents ( of contents (semantic annotationsemantic annotation))
Semantic content annotation is a valuable source of information to generate new Semantic content annotation is a valuable source of information to generate new domain knowledgedomain knowledgeo From content contextualization to knowledge formalizationFrom content contextualization to knowledge formalization
Semi-automatic generation is effective Semi-automatic generation is effective provided thatprovided that according to the domain according to the domain ((e.g. science, healthe.g. science, health) manual check is allowed / enabled () manual check is allowed / enabled (QoS problemQoS problem))o ““Freedom” is related to the application domain & objectivesFreedom” is related to the application domain & objectiveso ““Hierarchical” free tags more meaningful than “flat” free tagsHierarchical” free tags more meaningful than “flat” free tagso ““Guided” free tag approach minimizes contextual noiseGuided” free tag approach minimizes contextual noise
Lessons learntLessons learnt
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 2525
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
www.aquaringweb.eu
Contacts
Stefano BianchiResearch Team [email protected]. +39 010 6026 368Fax. +39 010 6026 350
www.softeco.it
Contacts
Stefano BianchiResearch Team [email protected]. +39 010 6026 368Fax. +39 010 6026 350
www.softeco.it
Thanks for your attention!Thanks for your attention!
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 2626
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
About contextualization…About contextualization…
Why is Why is contextualizationcontextualization important? important?
http://www.sccwrp.org/view.php?id=82http://www.sccwrp.org/view.php?id=82
chlorophyll-achlorophyll-a
low oxygenlow oxygen
macroalgal bloomsmacroalgal blooms
aquatic vegetationaquatic vegetation
eutrophic estuaryeutrophic estuary
nutrient enrichmentnutrient enrichment
NICE IMRESSIONIST LANDSCAPE?NICE IMRESSIONIST LANDSCAPE?
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 2727
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
About contextualization…About contextualization…
How can How can contextualizationcontextualization help? help?
JUST A SHARK?JUST A SHARK?
Scientific classificationScientific classification
AnimaliaAnimalia Chordata Chordata Vertebrata Vertebrata Chondrichthyes Chondrichthyes Elasmobranchii Elasmobranchii Euselachii Euselachii Carcharhiniformes Carcharhiniformes Carcharhinidae Carcharhinidae Galeocerdo Galeocerdo Galeocerdo cuvierGaleocerdo cuvier
Human impactHuman impact
BYCATCHBYCATCHLINE-FISHINGLINE-FISHINGSHARK FISHERYSHARK FISHERYFINNINGFINNINGFISHERY REGULATIONSFISHERY REGULATIONSFISHERY MORTAL DATAFISHERY MORTAL DATAFISHERY DATAFISHERY DATAHISTORICAL FISHINGHISTORICAL FISHINGENDANGERED SPECIESENDANGERED SPECIES
Geographical areaGeographical area
EC PACIFICEC PACIFICWC PACIFICWC PACIFICEC ATLANTICEC ATLANTICWC ATLANTICWC ATLANTICINDIAN OCEAN INDIAN OCEAN < BACK
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 2828
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Final demonstration prototypeFinal demonstration prototype
www.aquaringweb.eu
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 2929
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Search engineSearch engine
RELATIONSRELATIONSto extend to extend and refine and refine the search the search on the basis on the basis of the of the AquaRing AquaRing ontologyontology
CONTENTSCONTENTS
Hippocampus sppHippocampus spp SEARCHSEARCH
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 3030
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Tag cloudTag cloud
CONTENTSCONTENTS
TAGS (ontology concepts): “the larger the font, the more the contents”TAGS (ontology concepts): “the larger the font, the more the contents”
RELATIONSRELATIONSto ease to ease navigation navigation on the basis on the basis of the of the AquaRing AquaRing ontologyontology
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 3131
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Virtual exhibitionsVirtual exhibitions
Build value-added learning paths on top of aggregated digital content collectionsBuild value-added learning paths on top of aggregated digital content collections
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 3232
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Virtual exhibitionsVirtual exhibitions
NAVIGATIONNAVIGATION
ROOMSROOMS
CONTENTCONTENT
DESCRIPTIONDESCRIPTION
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 3333
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
VideoVideo
TOPIC BROWSERTOPIC BROWSERVIDEO BROWSERVIDEO BROWSER
FLASH PREVIEWFLASH PREVIEW
DOWNLOADDOWNLOAD
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 3434
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
MapMap
Web 2.0 GoogleMaps API based content visualizationWeb 2.0 GoogleMaps API based content visualization
< BACK
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 3535
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Architecture detailsArchitecture details
CONTENTMANAGEMENT
Easily reachable from inside(to move contents from local PCs to the server – FTP facility)
Easily reachable from outside(to retrieve contents from the portal – HTTP facility)
Central Central systemsystem
Semantic Semantic repositoryrepository(metadata + (metadata + ontologies)ontologies)
PortalPortal
AdministratioAdministration servicesn services
User servicesUser services
Global Content Space Global Content Space NodeNode
Server
Contents
Content ManagementServices
INTER
NE
INTER
NETT
Local PCs
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 3636
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Local digital collections and objects interfacingLocal digital collections and objects interfacing
CENTRAL SYSTEMCENTRAL SYSTEM GLOBAL CONTENT SPACE NODEGLOBAL CONTENT SPACE NODE
DMZDMZ
DMZDMZ
FTP SERVERFTP SERVER
SERVLET CONTAINERSERVLET CONTAINER
SERVLET CONTAINERHTTP SERVER
SERVLET CONTAINERHTTP SERVER
LAN
INTERNET
MINIMAL FTP CLIENT(DESKTOP APP)
MINIMAL FTP CLIENT(DESKTOP APP)
HOST (SERVER)HOST (SERVER)
FTP SERVERFTP SERVER
METADATA EDITORMETADATA EDITOR
2a2a2b2b
11
33
44
55
66““One-shot” procedure
One-shot” procedure
from content provider’s desktop
from content provider’s desktop
to AquaRing portal
to AquaRing portal
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 3737
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Local digital collections and objects interfacingLocal digital collections and objects interfacing
FTP + HTTPFTP + HTTPSERVERSERVER
FTP + HTTPFTP + HTTPSERVERSERVER
FTP + HTTPFTP + HTTPSERVERSERVER
AQR FTPCLIENTAQR FTPCLIENT AQR FTPCLIENTAQR FTPCLIENT
http://adg.contents.aquaringweb.euhttp://naus.contents.aquaringweb.euhttp://lsm.contents.aquaringweb.eu http://rzoo.contents.aquaringweb.eu http://rbins.contents.aquaringweb.euhttp://new.contents.aquaringweb.eu
AQR FTPCLIENTAQR FTPCLIENTAQR FTPCLIENTAQR FTPCLIENT AQR FTPCLIENTAQR FTPCLIENT AQR FTPCLIENTAQR FTPCLIENT
??????
ftp://81.208.74.210 http://81.208.74.210
ftp://81.93.5.231 http://www.nausicaa.fr/aquaring
3 physical nodes 3 physical nodes < BACK
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 3838
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Metadata annotation & collectionsMetadata annotation & collections
Metadata annotation = time consuming taskMetadata annotation = time consuming tasko Review reportReview report
• ““Think about alternative methods and technologies to address the time consuming Think about alternative methods and technologies to address the time consuming multilingual metadata annotation”multilingual metadata annotation”
• ““The metadata annotation requires a lot of labour-intensive effort and specialized The metadata annotation requires a lot of labour-intensive effort and specialized expertise: this might become a risk for the sustainability of the project”expertise: this might become a risk for the sustainability of the project”
““HierarchicalHierarchical collectionscollections””o Dublin Core: “A collection is an aggregation of items. The term collection Dublin Core: “A collection is an aggregation of items. The term collection
means that the resource is described as a group; its parts may be separately means that the resource is described as a group; its parts may be separately described and navigated.”described and navigated.”
Many contents with similar “meaning” aggregated in a Many contents with similar “meaning” aggregated in a collectioncollection (folder or even folder tree) (folder or even folder tree)o Iterative automatic creation of metadata for inner contentIterative automatic creation of metadata for inner content
Specific annotation can be specified at any levelSpecific annotation can be specified at any levelo Necessary requirement for annotation quality!!!Necessary requirement for annotation quality!!!
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 3939
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
CollectionsCollections
COLLECTION = FOLDER (or folder tree!)COLLECTION = FOLDER (or folder tree!)
COLLECTION METADATA
COLLECTION METADATA
CONTENT METADATACONTENT METADATA
General description, valid for all General description, valid for all contents included in the contents included in the collection (e.g. photo campaign collection (e.g. photo campaign on the same on the same area/species/individual etc.)area/species/individual etc.)
Specific description, valid for a Specific description, valid for a specific content (e.g. peculiarity, specific content (e.g. peculiarity, location etc.)location etc.)
++
==
Semantic description Semantic description (inheritance)(inheritance)
Faster annotation on large
Faster annotation on large
collections, detailed description
collections, detailed description
on specific contents!!!
on specific contents!!!
Milan, June 10th 2009Milan, June 10th 2009 DC Social Tagging Workshop 2009DC Social Tagging Workshop 2009 4040
ECP 2005 CULT 038261
www.aquaringweb.euwww.aquaringweb.eu
Collections & content managementsCollections & content managements
AUTOMATIC METADATA CREATIONAUTOMATIC METADATA CREATIONON CONTENT/FILE FTP TRANSFERON CONTENT/FILE FTP TRANSFER
Once created, metadata can be refined (E.g. subject)Once created, metadata can be refined (E.g. subject)
All available information on files (name, size, format etc.) can be All available information on files (name, size, format etc.) can be automatically detected and used for the annotationautomatically detected and used for the annotation
LOCAL PC FTP SERVER
AquaRing FTP clientAquaRing FTP client - simple Java standalone desktop - simple Java standalone desktop applicationapplication - file/folder selection from local pc- file/folder selection from local pc - one shot ftp transfer- one shot ftp transfer - add/remove file/folder on ftp server- add/remove file/folder on ftp server
Fully integrated & synchronized with Fully integrated & synchronized with Metadata Editor functionalitiesMetadata Editor functionalities- trigger system for metadata - trigger system for metadata creation/deletioncreation/deletion