The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share &...

22
The Semantic Web from 30.000ft Frank van Harmelen ative Commons License: owed to share & remix, must attribute & non-commercial

Transcript of The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share &...

Page 1: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

The Semantic Web from 30.000ft

Frank van Harmelen

Creative Commons License: allowed to share & remix,but must attribute & non-commercial

Page 2: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

http://www.youtube.com/watch?v=tBSdYi4EY3s

Page 3: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

The Semantic Web =a big engineering effort +

a set of information structuring principles

Page 4: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

People

Web

Machines

HOW?

Page 5: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

The Current Web of text and pictures

a web page in English

aboutFrank

And this page is about

LarKC

and another web page

aboutFrank

And this page is about

Stefano

This page is about the Vrije

Uniersitei

linked web-pages, written by people, written for people, used only by people...

Many of these pagesalready come from data,that is usable by computers!But we can’t link the data....

?

? ?

?

The Future Web of Data

?

linked data,usable by computers!useful for people!

Page 6: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

P1. Give all things a name

Page 7: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

P2. Relations form a graph between things

Page 8: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

P3. The names are addresses on the Web

x T

[<x> IsOfType <T>]

differentowners & locations

<village>

Page 9: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

P1+P2+P3 = Giant Global Graph

Page 10: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

P4. explicit & formal semantics

• assign types to things• assign types to relations• organise types in a hierarchy• empose constraints on

possible interpretations

Page 11: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

What’s it like to be a computer on the web?

Page 12: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

Examples of “semantics”

Semantics = predictable inference

Frank Lyndamarried-to

• Frank is male• married-to relates

males to females

• married-to relates 1 male to 1 female

• Lynda = Hazel

lowerbound upperbound

Hazelmarried-to

Page 13: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

Did you get anywhere? (1/2)

already many billions of facts & rules

Encyclopedia

Encyclopedia

Geographic names (millio

ns)

Geographic names (millio

ns)

names of artists

& art works

(10.000’s)

names of artists

& art works

(10.000’s)

scientific bibliographies

scientific bibliographies

hierarchical dictionaries

(UK, FR, NL)

hierarchical dictionaries

(UK, FR, NL)

life-sc

ience databases

life-sc

ience databases

any CD ever recorded (almost)

any CD ever recorded (almost)

May ‘09 estimate > 4.2 billion triples + 140 million interlinksMay ‘09 estimate > 4.2 billion triples + 140 million interlinks

basic facts on every country

on the planet

basic facts on every country

on the planet

common sense rules &

facts (100.000’s)

common sense rules &

facts (100.000’s)

Page 14: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

It gets bigger every month

25 billion facts & relations…

Page 15: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

Real life examples• handcrafted

– music: CDnow (2410/5), MusicMoz (1073/7)– biomedical: SNOMED (200k), GO (15k),

Emtree(45k+190kSystems biology

• ranging from lightweight – Yahoo, UNSPC, Open directory (400k)

to heavyweight (Cyc (300k))

• ranging from small (METAR) to large (UNSPC)

Did you get anywhere? (2/2)

Page 16: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

Did you get anywhere? (3/2)

used by• media: BBC, Reuters, New York Times, • governments• retail (10K companies):

BestBuy, Sears, Kmart, Volkswagen, Renault• IT: IBM, Oracle• Search: Bing, Yahoo, Google

(before May 2012, after May 2012)

Page 17: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

Any lessons?

Page 18: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

heterogeneityis unavoidable

Page 19: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

Much heterogeneityis solved socially

Page 20: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

knowledge obeysa long-tail distribution

Page 21: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

Types

Page 22: The Semantic Web from 30.000ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.

Semantic Web from 30.000ft ?

It’s not yet very well understood

It’s a surprisingly successful engineering effort

It’s a handful of principles