The Vocbench Project

76
Presentations by Johannes Keizer is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License . Dr. Caterina Caracciolo Dr. Johannes Keizer Office of Knowledge Exchange, Research and Extension Food and Agriculture Organization of the UN The VocBench, a collaborative tool for Vocabulary management Presentation at NSTL, Beijing, China 2012-09-16

description

Presentation by Caterina Caracciolo and Johannes Keizer at the National Science and Technology Library

Transcript of The Vocbench Project

Page 1: The Vocbench Project

Presentations by Johannes Keizer is licensed under a Creative Commons Attribution-

NonCommercial-ShareAlike 3.0 Unported License.

Dr. Caterina CaraccioloDr. Johannes KeizerOffice of Knowledge Exchange, Research and ExtensionFood and Agriculture Organization of the UN

The VocBench, a collaborative tool for Vocabulary management

Presentation at NSTL, Beijing, China 2012-09-16

Page 2: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

www.fao.org

Page 3: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Page 4: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

AGROVOC

Page 5: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

AGROVOC

•In 1980: • AGROVOC thesaurus was born• It was in 3 languages: English, Spanish, French• It was a paper publication

•In 2012: • AGROVOC is an RDF/SKOS resource• AGROVOC is available in 21 languages• 4 languages are under development

Page 6: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Arabic

Chines

e

Czech

Englis

hFar

si

Frenc

h

Germ

anHind

i

Hunga

rian

Italia

n

Japa

nese

Korea

nLa

o

Polish

Portu

gues

e

Russia

n

Slovak

Spanis

h

Telugu

Thai

Turkis

h0

5000

10000

15000

20000

25000

30000

35000

40000

45000

Series1

Number of terms per language in AGROVOC

Page 7: The Vocbench Project

Linked Open Vocabularies

Page 8: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

AGROVOC LOD record

Page 9: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

The problem Thesaurus editors old fashioned, not viable

to edit SKOS

Ontology editors to complex one one side, not specific enough on the other

None of the existing tools had an integrated workflow environment

No possibility of collaborative editing, most tools were not web applications

Page 10: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

The solution: VocBench

Is a web-based working environment for managing the AGROVOC Concept Scheme

Facilitate the collaborative editing of multilingual terminology and semantic concept information

It includes administration and group management features

It includes workflows for maintenance, validation and quality assurance of the data pool

It is accessible freely to everybody to facilitates collaborative editing

10

Page 11: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

VocBench

11

Page 12: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

MODULES

Home - Recent Changes

Concept/Term Management

Relationship Management

Validation

Consistency Check

Export

User/Group Management

Statistics/Preferences

Search

Page 13: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

GROUP/MODULE PERMISSION

13

Modules\Group Administrators Publishers Validators Ontology

editorsTerm editors Guest

Home ✓ ✓ ✓ ✓ ✓ ✓

Concepts ✓ ✓ ✓ ✓ ✓ ✓

Relationships ✓ ✓ ✗ ✗ ✗ ✗

Classifications ✓ ✓ ✓ ✓ ✓ ✓

Export ✓ ✓ ✓ ✓ ✓ ✗

Search ✓ ✓ ✓ ✓ ✓ ✓

Validation ✓ ✓ ✓ ✗ ✗ ✗

Consistency ✓ ✓ ✓ ✓ ✓ ✗

Statistics ✓ ✓ ✓ ✓ ✗ ✗

Users ✓ ✓ ✗ ✗ ✗ ✗

Groups ✓ ✓ ✗ ✗ ✗ ✗

Preferences ✓ ✓ ✓ ✓ ✓ ✗

Page 14: The Vocbench Project

VocBench 2.0

Page 15: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Coming features

• Any SKOS fileGeneric

• Adding a layer so VocBench can connect to most leading triple-storesConfiguration

• per-language• per-concept schemeComplex rights

• Module to align multiple SKOSAlignment

• Validation• Term changesBulk actions

Page 16: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

VocBench 2.0

VB 2.0 will natively support SKOS

Separation of the business and presentation layers from the data layer

This allows VB to be attached to any SPARQL-compliant triple-store database (Sesame, Jena, Allegrograph, OWLLim)

First release of VB 2.0 will retain the same interface as that of VB 1.0

VB 2.0 will incorporate all the functionalities available in VB 1.0

Multi-user support, roles and right administration, and publishing work flow management will inherited from VB 1.0.

Current Protégé implementation of VB 1.0 will be replaced by Semantic Turkey

All the business logic and data access layers related to ontology management will be handled by Semantic Turkey

Page 17: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

SYSTEM OVERVIEW

Version 2.0

Front end Back end

Administrative Database(MySQL)

Triple Store

Middleware

Hibernate Layer

SemanticTurkey/

OWLART API

Gilead

Service Wrapper

Layer

Google Web Toolkit(GWT)

Graph Visualization

GWT Incubator

Web services

Page 18: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

SYSTEM ARCHITECTURE

Version 2.0

Ontology repository (SKOS)System Data Repository

Semantic TurkeyJDBC (MYSQL)

Validation

Stati

stics

Use

r M

anag

emen

t

Gro

up

Man

agem

ent

Syst

em

Pref

eren

ce

GWT

Conc

ept

Man

agem

ent

Rela

tions

hip

Man

agem

ent

Sear

ch

Sche

me

Man

agem

ent

Impo

rt

Expo

rt

Cons

isten

cy

Chec

k

VOCBENCH INTERFACE

Page 19: The Vocbench Project

DEMO

Page 20: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

LOGIN PAGE

Page 21: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Multi-lingual

21

Page 22: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Managing change

Page 23: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Top concepts

Page 24: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Browse concepts

Page 25: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Concept relationships

Page 26: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Concept notes

Page 27: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Browse relationships

Page 28: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Validation

Page 29: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Export

Formats

• SKOS• RDBMS-SQL• TBX• OWL-Simple • OWL-Complete

Criteria

• Sub-vocabulary• Scheme• Concept• Term• Date

Page 30: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Export

Page 31: The Vocbench Project

johannes keizer/caterina caracciolohttp://aims.fao.org

Concept Graph view

Page 32: The Vocbench Project

Further Material

Page 33: The Vocbench Project

33

REGISTRATION

Page 34: The Vocbench Project

34

MENUBAR

• Link to different modules

• Sign out link to logout from the system.

• Post Comment link to send the comments about that specific module

• Help link to view the help for specific module

• Display Language link to choose the languages for data to be displayed

Page 35: The Vocbench Project

35

MultilingualFilter languages to display terms for each concepts

Page 36: The Vocbench Project

36

COMMENTS

Allows provision of feedbacks for each module

Page 37: The Vocbench Project

37

HELP

Steps by steps procedure for each module

Page 38: The Vocbench Project

38

HOME - RECENT CHANGES

• Lists all the changes in reverse chronological order

Page 39: The Vocbench Project

39

SEARCH

Page 40: The Vocbench Project

40

SEARCH

• by string: the user can specify if the system should search by exact match, beginning with, contains or fuzzy

• by URI or term code; or by range of term code (e.g. between 123 and 9876)

• by classification schemes

• by creation or modification date

• by specific relationships (e.g. search all concepts using the “has_pest”)

• by status, language

• by notes/attributes

Page 41: The Vocbench Project

41

SEARCH CRITERIA• Exact match (“water” will find only water)

• Starting with (“water” will find “water”, “water management”, and “watergate”)

• Contain (“water” will find “water”, “water management”, “inland water”, and “watergate”)

• Exact word (“water” will find “water”, “water management”, “inland water” but not “watergate”)

• Case Sensitive (“ABc” will find only for the word “ABc” and not “abc”

• Include Description

– Search also content of definitions, images, editorial notes, scope notes

• Include Spelling Variant

– Search also on different spelling variants

Page 42: The Vocbench Project

42

CONCEPT MANAGEMENT• Visualize / Navigate Concepts

– depending on the access level, user can edit

• Show selected concept details

– all terms and other information

• Show selected concept relationships

• Delete concept

• Can view the URI of the concept enabling ‘Show URI’ option

Page 43: The Vocbench Project

43

CONCEPT MANAGEMENT

Page 44: The Vocbench Project

44

ADD/DELETE CONCEPT

• Add term to the newly created concept in one language• Can delete the concept and all the relationship associated with that concept

Page 45: The Vocbench Project

45

CONCEPT RELATIONSHIP

• Can create the concept-concept relationship• Inverse relationship is also created

automatically– Ex: If we create A affect B, then B is affected by A

relationship is also created

Page 46: The Vocbench Project

46

CONCEPT IMAGE

• Name of the image with description

• URL will point to the image which will open in an external

• Provide the source of the image

• Can add more translation in different language

Page 47: The Vocbench Project

47

SCOPE/EDITORIAL NOTES• In Editorial Notes, we can post the modifications

made to the selected concepts• In Scope Notes, we can post information for indexing• Both can contain multiple values in multiple

languages

Page 48: The Vocbench Project

48

CONCEPT DEFINITION• Add definition to the selected concept • Add translation in different languages• Provide the source of the definition• Creation and modified date are set automatically

Page 49: The Vocbench Project

49

TERM MANAGEMENT

• Visualize / Navigate Terms – depending on the user access level and role

• Show selected term details• Show selected term relationships• Show term variances• Add/Edit/Delete term

Page 50: The Vocbench Project

50

ADD/EDIT/DELETE TERM• Term actions will be associated to the selected concept • Term can be created in different languages• Term can be set as preferred or non preferred• An option allows preferred terms visualization only in the concept tree

browser

Page 51: The Vocbench Project

51

CONCEPT GRAPH VISUALIZATION

Page 52: The Vocbench Project

52

RELATIONSHIP MANAGEMENT

Page 53: The Vocbench Project

53

LABEL/DEFINITION

• Can add/edit/delete the label for relationship in multiple languages

• Can add/edit/delete definition of the relationships in multiple languages

Page 54: The Vocbench Project

54

PROPERTIES/ DOMAIN & RANGE

Can assign functional, transitive, inverse & symmetric properties to the selected relationship

Only functional properties can be selected for data type properties

Can specify the domain & range

Page 55: The Vocbench Project

55

VALIDATION

• Allows to verify the new concept/term/definition/relationships/notes/attributes created, edited or deleted.

Page 56: The Vocbench Project

56

STATISTICS

Provides statistical data onconceptstermsusersrelationships

Page 57: The Vocbench Project

57

EXPORT

• Formats– SKOS– RDBMS-SQL– TBX– OWL

Page 58: The Vocbench Project

58

CLASSIFICATION SCHEMES

• Add/Edit/Delete different classification schemes

• Can create new category or assign existing domain concept to the classification schemes

• Mapping categories to domain concepts is possible

Page 59: The Vocbench Project

59

USER MANAGEMENT

• Manages user detail information.• Assign each user different group privilege and languages.

Page 60: The Vocbench Project

60

GROUP MANAGEMENT

• Manages different groups, their permission and members.• Assign allowed actions for each group.

Page 61: The Vocbench Project

61

CONSISTENCY CHECK

• Checks if there is any inconsistency in the model data.

Page 62: The Vocbench Project

62

SYSTEM PREFERENCES

• Manage user information/preferences• VocBench interface

Page 63: The Vocbench Project

63

RSS

• All the changes on the VocBench content is published through RSS.

• Helps people who regularly use VocBench to stay informed about the changes.

• Shows

– all the concept/terms/relationships created/edited/deleted

– Users/Groups created/edited/deleted

Page 64: The Vocbench Project

64

WEB SERVICES

VocBench maintains access

response

uses

SKOS

TripleStore

Other Applications

Page 65: The Vocbench Project

65

WEB SERVICES

• AGROVOC Concept Server is accessible via web services, which can be called from any client application.

• The web services are realized on Apache Axis running on Tomcat.

• Using web services, –helps third party application for easy access to WB content–changes on the content of AGROVOC Concept Server can be

accessed immediately–reduces the time and effort necessary to download and

incorporate the latest version of the AGROVOC Concept Server content into applications

Page 66: The Vocbench Project

66

WEB SERVICES

Page 67: The Vocbench Project

67

CURRENT AGROVOC RELEASES

• AGROVOC Home Page

– http://aims.fao.org/agrovoc

• Latest stable release version 1.0:

– http://agrovoc.mimos.my

• Web Services Test Client

– http://202.73.13.50:55381/ACSWWebserviceV1Client/sampleACSWWebServiceProxy/TestClient.jsp

Page 68: The Vocbench Project

68

TECHNICAL DETAILS

Page 69: The Vocbench Project

69

SYSTEM ARCHITECTUREVersion 1.3

Ontology repository (OWL)System Data Repository

Protégé OWL APIJDBC (MYSQL)

Validation

Stati

stics

Use

r M

anag

emen

t

Gro

up

Man

agem

ent

Syst

em

Pref

eren

ce

GWT

Conc

ept

Man

agem

ent

Rela

tions

hip

Man

agem

ent

Sear

ch

Sche

me

Man

agem

ent

Impo

rt

Expo

rt

Cons

isten

cy

Chec

k

VOCBENCH INTERFACE

Page 70: The Vocbench Project

70

SYSTEM OVERVIEWVersion 1.3

Front end Back end

Administrative Database(MySQL)

Protégé Triple Store

Middleware

Hibernate Layer

ProtégéOWL API

Gilead

Intermediate Layer

Google Web Toolkit(GWT)

Graph Visualization

GWT Incubator

Web services

Page 71: The Vocbench Project

71

VocBench 1.0 2.0

Version 1.x

GWT/Presentation Business logic

Protégé API 3.4.x MySQL

Version 2.0

GWT/Presentation Business logic

Semantic Turkey• OWL ART API Triple Store

Page 72: The Vocbench Project

72

SYSTEM OVERVIEW• Front End

– GWT/GWT Incubator• Google Web Toolkit

– Graph Visualization• Java Applet

• Middle ware– Service wrapper layer

• separates client and service layer• allows to switch between old Protégé implementation and Semantic Turkey

– Semantic Turkey/OWL Art API– Gilead

• integrating GWT and Hibernate

• Back end– Administrative DB : MySQL– Triple Store: Sesame/Jena/Allegrograph,..

Page 73: The Vocbench Project

73

Semantic Turkey/OWLArt API• Semantic Turkey provides

– business logic including OWL and SKOS support, – import for OWL and SKOS, – triple-store agnostic design and – an OSGi-compliant plug-in architecture.

• Semantic Turkey uses the OWLArt API which is already partially integrated within VocBench.

• OWLArt API offers– a basic RDF triple query and SPARQL access layer, which need to be

implemented for any specific triple store– methods more closely connected with operations related to the standard

vocabularies of the RDF family (from RDF to RDFS, OWL, SKOS and SKOS-XL).

Page 74: The Vocbench Project

74

Semantic Turkey Architecture

Page 75: The Vocbench Project

75

TOOLS/TECHNOLOGIES

Tools Description

EclipseEclipse an open source community, whose projects are focused on building an open development platform comprised of extensible frameworks, tools and runtimes for building, deploying and managing software across the lifecycle.

GWT Google Web Toolkit (GWT) is an open source Java software development framework that makes writing ajax web application.

Protégé OWL APIProtégé-OWL API is an open-source Java library for the Web Ontology Language and RDF(S) providing classes and methods to load and save OWL files, to query and manipulate OWL data models, and to perform reasoning.

OWL ART API OWL ART API offer an abstraction layer over different RDF triple store technologies. Currently available for Sesame or Jena Implementation.

Semantic TurkeySemantic Turkey is a platform for Semantic Bookmarking and Ontology Development realized by the ART Research Group at the University of Rome, Tor Vergata

MySQL MySQL is an open source relational database system

Hibernate Hibernate is a powerful, high performance object/relational persistence and query service.

Apache Tomcat Apache Tomcat is the servlet container that is used in the official Reference Implementation for the Java Servlet and JSP technologies.

Page 76: The Vocbench Project

76

TOOLS/TECHNOLOGIES VersionTool/Technology Version Reference

JAVA 1.5 or above http://java.sun.com/javase/downloads/index.jsp

Mysql 5 or above http://dev.mysql.com/downloads/

Hibernate 3.1 https://www.hibernate.org/

Gilead 1.2.3 http://noon.gilead.free.fr/gilead/

GWT 1.7 http://code.google.com/webtoolkit/versions.html

GWT Incubator gwt-incubator drop for gwt 1.7 Revision 1686

http://code.google.com/p/google-web-toolkit-incubator/

Protégé OWL API 3.4 Build 130 http://protege.cim3.net/download/old-releases/3.4%20betas/build-130/full/

OWL ART API 2.0.4 http://art.uniroma2.it/owlart/

Semantic Turkey 0.8.5.x http://semanticturkey.uniroma2.it/

Apache Tomcat 6 or above http://tomcat.apache.org/download-55.cgi