Ivan Herman, W3C, “Semantic Café”, organized by the W3C Brazil Office São Paulo, Brazil,...

Ivan Herman, W3C, Semantic Caf, organized by the W3C Brazil Office So Paulo, Brazil, 2010-10-15

(4) Site editors roam the Web for new facts may discover further links while roaming They update the site manually And the site gets soon out-of-date

(5) Editors roam the Web for new data published on Web sites Scrape the sites with a program to extract the information Ie, write some code to incorporate the new data Easily get out of date again

(6) Editors roam the Web for new data via API-s Understand those input, output arguments, datatypes used, etc Write some code to incorporate the new data Easily get out of date again

(7) Use external, public datasets Wikipedia, MusicBrainz, They are available as data not API-s or hidden on a Web site data can be extracted using, eg, HTTP requests or standard queries

(8) Use the Web of Data as a Content Management System Use the community at large as content editors

(10) There are more an more data on the Web government data, health related data, general knowledge, company information, flight information, restaurants, More and more applications rely on the availability of that data

(11) Photo credit nepatterson, Flickr

(12) A Web where documents are available for download on the Internet but there would be no hyperlinks among them

(14) We need a proper infrastructure for a real Web of Data data is available on the Web data are interlinked over the Web (Linked Data) I.e., data can be integrated over the Web

(15) Photo credit kxlly, Flickr

(16) We will use a simplistic example to introduce the main Semantic Web concepts

(17) Map the various data onto an abstract data representation make the data independent of its internal representation Merge the resulting representations Start making queries on the whole! queries not possible on the individual data sets

(19) IDAuthorTitlePublisherYear ISBN 0-00-6511409-Xid_xyzThe Glass Palaceid_qpr2000 IDNameHomepage id_xyzGhosh, Amitavhttp://www.amitavghosh.co m IDPublishers nameCity id_qprHarper CollinsLondon

(20) http://isbn/000651409X Ghosh, Amitav http://www.amitavghosh.com The Glass Palace 2000 London Harper Collins a:title a:year a:city a:p_name a:name a:homepage a:author a:publisher

(21) Data export does not necessarily mean physical conversion of the data relations can be generated on-the-fly at query time via SQL bridges scraping HTML pages extracting data from Excel sheets etc. One can export part of the data

(23) ABCD 1 IDTitreTraducteurOriginal 2 ISBN 2020286682Le Palais des Miroirs $A12$ISBN 0-00-6511409-X 3 4 5 6 IDAuteur 7 ISBN 0-00- 6511409-X $A11$ 8 9 10 Nom 11 Ghosh, Amitav 12 Besse, Christianne

(24) http://isbn/000651409X Ghosh, Amitav Besse, Christianne Le palais des miroirs f:original f:nom f:traducteur f:auteur f:titre http://isbn/2020386682 f:nom

(25) http://isbn/000651409X Ghosh, Amitav Besse, Christianne Le palais des miroirs f:original f:nom f:traducteu r f:auteur f:titre http://isbn/2020386682 f:nom http://isbn/000651409X Ghosh, Amitav http://www.amitavghosh.com The Glass Palace 2000 London Harper Collins a:title a:year a:city a:p_name a:name a:homepage a:author a:publisher

(26) http://isbn/000651409X Ghosh, Amitav Besse, Christianne Le palais des miroirs f:original f:nom f:traducteu r f:auteur f:titre http://isbn/2020386682 f:nom http://isbn/000651409X Ghosh, Amitav http://www.amitavghosh.com The Glass Palace 2000 London Harper Collins a:title a:year a:city a:p_name a:name a:homepage a:author a:publisher Same URI!

(27) a:title Ghosh, Amitav Besse, Christianne Le palais des miroirs f:original f:nom f:traducteu r f:auteur f:titre http://isbn/2020386682 f:nom Ghosh, Amitav http://www.amitavghosh.com The Glass Palace 2000 London Harper Collins a:year a:city a:p_name a:name a:homepage a:author a:publisher http://isbn/000651409X

(28) User of data F can now ask queries like: give me the title of the original well, donnes-moi le titre de loriginal This information is not in the dataset F but can be retrieved by merging with dataset A!

(29) We feel that a:author and f:auteur should be the same But an automatic merge doest not know that! Let us add some extra information to the merged data: a:author same as f:auteur both identify a Person a term that a community may have already defined: a Person is uniquely identified by his/her name and, say, homepage it can be used as a category for certain type of resources

(30) Besse, Christianne Le palais des miroirs f:original f:nom f:traducteu r f:auteur f:titre http://isbn/2020386682 f:nom Ghosh, Amitav http://www.amitavghosh.com The Glass Palace 2000 London Harper Collins a:title a:year a:city a:p_name a:name a:homepage a:author a:publisher http://isbn/000651409X http://foaf/Person r:type

(31) User of dataset F can now query: donnes-moi la page daccueil de lauteur de loriginal well give me the home page of the originals auteur The information is not in datasets F or A but was made available by: merging datasets A and datasets F adding three simple extra statements as an extra glue

(32) Using, e.g., the Person, the dataset can be combined with other sources For example, data in Wikipedia can be extracted using dedicated tools e.g., the dbpedia project can extract the infobox information from Wikipedia alreadydbpedia

(33) Besse, Christianne Le palais des miroirs f:original f:no m f:traducteu r f:auteur f:titre http://isbn/2020386682 f:nom Ghosh, Amitav http://www.amitavghosh.com The Glass Palace 2000 London Harper Collins a:title a:year a:city a:p_name a:name a:homepage a:author a:publisher http://isbn/000651409X http://foaf/Person r:type http://dbpedia.org/../Amitav_Ghosh r:type foaf:namew:reference

(34) Besse, Christianne Le palais des miroirs f:original f:nom f:traducteu r f:auteur f:titre http://isbn/2020386682 f:nom Ghosh, Amitav http://www.amitavghosh.com The Glass Palace 2000 London Harper Collins a:title a:year a:city a:p_name a:name a:homepage a:author a:publisher http://isbn/000651409X http://foaf/Person r:type http://dbpedia.org/../Amitav_Ghosh http://dbpedia.org/../The_Hungry_Tide http://dbpedia.org/../The_Calcutta_Chromosome http://dbpedia.org/../The_Glass_Palace r:type foaf:namew:reference w:author_of w:isbn

(35) Besse, Christianne Le palais des miroirs f:original f:nom f:traducteu r f:auteur f:titre http://isbn/2020386682 f:no m Ghosh, Amitav http://www.amitavghosh.com The Glass Palace 2000 London Harper Collins a:title a:year a:city a:p_name a:name a:homepage a:author a:publisher http://isbn/000651409X http://foaf/Person r:type http://dbpedia.org/../Amitav_Ghosh http://dbpedia.org/../The_Hungry_Tide http://dbpedia.org/../The_Calcutta_Chromosome http://dbpedia.org/../Kolkata http://dbpedia.org/../The_Glass_Palace r:type foaf:namew:reference w:author_of w:born_in w:isbn w:long w:lat

(36) It may look like it but, in fact, it should not be What happened via automatic means is done every day by Web users! The difference: a bit of extra rigour so that machines could do this, too

(37) We combined different datasets that are somewhere on the web are of different formats (mysql, excel sheet, etc) have different names for relations We could combine the data because some URI-s were identical (the ISBN-s in this case)

(38) We could add some simple additional information (the glue), also using common terminologies that a community has produced As a result, new relations could be found and retrieved

(39) We could add extra knowledge to the merged datasets e.g., a full classification of various types of library data geographical information etc. This is where ontologies, extra rules, etc, come in ontologies/rule sets can be relatively simple and small, or huge, or anything in between Even more powerful queries can be asked as a result

(40) Data in various formats Data represented in abstract format Applications Map, Expose, Manipulate Query

(41) The Semantic Web is a collection of technologies to make such integration of Linked Data possible!

(42) an abstract model for the relational graphs: RDF add/extract RDF information to/from XML, (X)HTML: GRDDL, RDFa a query language adapted for graphs: SPARQL characterize the relationships and resources: RDFS, OWL, SKOS, Rules applications may choose among the different technologies reuse of existing ontologies that others have produced (FOAF in our case)

(43) Data in various formats Data represented in RDF with extra knowledge (RDFS, SKOS, RIF, OWL,) Applications RDB RDF, GRDDL, RDFa, SPARQL, Inferences

(46) Datasets (e.g., MusicBrainz) are published in RDF Some simple vocabularies are involved Those datasets can be queried together via SPARQL The result can be displayed following the BBC style

(48) A set of core technologies are in place Lots of data (billions of relationships) are available in standard format see the Linked Open Data Cloud

(49) There is a vibrant community of academics: universities of Southampton, Oxford, Stanford, PUC small startups: Garlik, Talis, C&P, TopQuandrant, Cambridge Semantics, OpenLink, major companies: Oracle, IBM, SAP, users of Semantic Web data: Google, Facebook, Yahoo! publishers of Semantic Web data: New York Times, US Library of Congress, open governmental data (US, UK, France,)

(50) Companies, institutions begin to use the technology: BBC, Vodafone, Siemens, NASA, BestBuy, Tesco, Korean National Archives, Pfizer, Chevron, see http://www.w3.org/2001/sw/UseCases Truth must be said: we still have a way to go deployment may still be experimental, or on some specific places only

(53) Help in finding the best drug regimen for a specific case, per patient Integrate data from various sources (patients, physicians, Pharma, researchers, ontologies, etc) Data (eg, regulation, drugs) change often, but the tool is much more resistant against change Courtesy of Erick Von Schweber, PharmaSURVEYOR Inc., (SWEO Use Case)(SWEO Use Case)

(54) Integration of relevant data in Zaragoza Use rules to provide a proper itinerary Courtesy of Jess Fernndez, Mun. of Zaragoza, and Antonio Campos, CTIC (SWEO Use Case)(SWEO Use Case)

(55) Tools have to improve scaling for very large datasets quality check for data etc There is a lack of knowledgeable experts this makes the initial step tedious leads to a lack of understanding of the technology But we are getting there!

(56) A huge amount of data (information) is available on the Web Sites struggle with the dual task of: providing quality data providing usable and attractive interfaces to access that data

(57) Raw Data Now! Tim Berners-Lee, TED Talk, 2009 http://bit.ly/dg7H7Z Raw Data Now! Tim Berners-Lee, TED Talk, 2009 http://bit.ly/dg7H7Z Semantic Web technologies allow a separation of tasks: 1. publish quality, interlinked datasets 2. mash-up datasets for a better user experience

(58) The network effect is also valid for data There are unexpected usages of data that authors may not even have thought of Curating, using, exploiting the data requires a different expertise

(59) Thank you for your attention! These slides are also available on the Web: http://www.w3.org/2010/Talks/1015-SauPaulo-SemCafe-IH/

Ivan Herman, W3C, “Semantic Café”, organized by the W3C Brazil Office São Paulo, Brazil,...

Documents

Transcript of Ivan Herman, W3C, “Semantic Café”, organized by the W3C Brazil Office São Paulo, Brazil,...

Tutorial on the Semantic Web (Last update: 26 May 2009) adapted from (C) Ivan Herman, W3C

1 Introduction to the Semantic Web (tutorial) Johnson & Johnson Philadelphia, USA October 30, 2009 Ivan Herman, W3C ivan@w3.org.

Tutorial on the Semantic Web cntd. (Last update: 26 May 2009) adapted from (C) Ivan Herman, W3C

Ivan Herman W3C. (2) Current RDF has been published in 2004 Significant deployment since then ◦ implementation experiences ◦ users’ experiences

What is being done today? Ivan Herman, W3C Deutsche ... 6 or 7 geographically distributed databases, data sources, and web services… Michael Grove, Clark & Parsia, LLC, and Andrew

W3C Recent Activities 2014 Q4 : W3C最新活動 2014 Q4

1 Tutorial on the Semantic Web cntd. (Last update: 26 May 2009) adapted from (C) Ivan Herman, W3C Given at AAU @ WE course by Peter Dolog Adapted: October.

Embedding Knowledge in HTML Some content from a presentations by Ivan Herman of the W3c.

Semantic Web, Linked Data, and Semantic 3D Media Ivan Herman… · 2010. 2. 1. · 1 Semantic Web, Linked Data, and Semantic 3D Media Ivan Herman, W3C FOCUS 3D Conference, Sophia

W3C Current Activities1 W3C Current Activities Dagstuhl Perspective Workshop: Semantic Web Reflections and Future Directions Dagstuhl, Germany, June 30, 2009 Ivan Herman ivan@w3.org

Some W3C SW technologies to watch… Ivan Herman, W3C Semantic Web Gathering Amsterdam, 16 March, 2009.

1 Tutorial on the Semantic Web (Last update: 26 May 2009) adapted from (C) Ivan Herman, W3C Given at AAU @ WE course by Peter Dolog Adapted: October 2010.

Virtual Reality @W3C - VRIF › wp-content › uploads › VRSTD-03-W3C... · W3C Workshop on Web & Virtual Reality W3C Workshop on WebVR Authoring: ... Formats, Vocabularies, APIs

Ivan Herman - Semantic Web Activities @ W3C

SVG Tutorial - World Wide Web Consortium (W3C) · PDF fileSVG Tutorial David Duce *, Ivan Herman +, Bob Hopgood * * Oxford Brookes University, + World Wide Web Consortium Contents

Introduction to the Semantic Web · PDF file1 Introduction to the Semantic Web (tutorial) 2009 Semantic Technology Conference San Jose, California, USA June 15, 2009 Ivan Herman, W3C

Cambridge Semantic Web Gathering 2012-10-09, Cambridge, MA, USA Ivan Herman W3C, Semantic Web Activity Lead.

New i18n Developments at the W3C Richard Ishida W3C An overview of the W3C Internationalization Activity.

Introduction to the World Wide Web ConsortiumIntroduction to the World Wide Web Consortium (W3C) Ivan Herman Zheijiang University, Hangzhou, China September 2, 2009 2 A bit of history…

Introduction to Semantic Web and RDF RDF, Linked Data workshop at DANS The Hague, 28 th July, 2010, Ivan Herman, W3C.

SVG Tutorial - World Wide Web Consortium (W3C) · PDF fileSVG Tutorial David Duce , Ivan Herman +, Bob Hopgood * Oxford Brookes University, + World Wide Web Consortium Contents