LiDIA: An integration architecture to query Linked Open Data from multiple datasets

21
LiDIA: An integration architecture to query Linked Open Data from multiple datasets ENC 2013 – Mexican International Conference on Computer Science MSc Cristian A. Rodríguez Enríquez - PhD Giner Alor Hernández – PhD Guillermo Cortés Robles Division of Research and Postgraduate Studies Instituto Tecnológico de Orizaba Presents: M.S.C. Cristian A. Rodríguez Enríquez

description

LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Transcript of LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Page 1: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

LiDIA: An integration architecture to query Linked Open Data from multiple

datasets

ENC 2013 – Mexican International Conference on Computer Science

MSc Cristian A. Rodríguez Enríquez - PhD Giner Alor Hernández – PhD Guillermo Cortés Robles

Division of Research and Postgraduate Studies Instituto Tecnológico de Orizaba

Presents:M.S.C. Cristian A. Rodríguez Enríquez

Page 2: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Agenda

• Background• Linked Open Data• GoogleTM and Linked Data• Problem• LiDIA Search• LiDIA Architecture• Knowledge Transfer• Conclusions• Future Work

Slide 2 de 21ENC 2013 – Mexican International Conference on Computer Science

Page 3: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Background

Slide 3 de 21ENC 2013 – Mexican International Conference on Computer Science

The Web of Linked Data grows rapidly and contains data from a wide range of different domains, including life science data, geographic data, government data, library and media data, as well as cross-domain data sets such as DBpedia or Freebase.

Page 4: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Background

According to the Linked Open Data Cloud Diagram (LODCD), there are 295 data sets catalogued and classified under Linked Open Data format (public available data).

Slide 4 de 21ENC 2013 – Mexican International Conference on Computer Science

Page 5: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Background

According to the Linked Open Data Cloud Diagram (LODCD), there are 295 data sets catalogued and classified under Linked Open Data format (public available data).

Slide 5 de 21ENC 2013 – Mexican International Conference on Computer Science

Page 6: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Linked Open Data

• Public available Linked Data• Knowledge is: Organized and Accessible

Slide 6 de 21ENC 2013 – Mexican International Conference on Computer Science

Highlight: http://www.lod-cloud.net/

Page 7: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Slide 7 de 21ENC 2013 – Mexican International Conference on Computer Science

Page 8: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Google & Linked Data

Slide 8 de 21ENC 2013 – Mexican International Conference on Computer Science

Highlight: http://www.google.com/insidesearch/features/search/knowledge.html

Page 9: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Problem?

Slide 9 de 21ENC 2013 – Mexican International Conference on Computer Science

Linked Data applications that want to consume data from this global data space face the challenges that:

• Data sources use a wide range of different RDF vocabularies to represent data about the same type of entity.

• The same real-world entity, for instance a person or a place, is identified with different URIs within different data sources.

• Data about the same real-world entity coming from different sources may contain conflicting value.

Page 10: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

LiDIA Search

Slide 10 de 21ENC 2013 – Mexican International Conference on Computer Science

Page 11: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

LiDIA Search

Slide 11 de 21ENC 2013 – Mexican International Conference on Computer Science

Page 12: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Slide 12 de 21ENC 2013 – Mexican International Conference on Computer Science

Page 13: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Slide 13 de 21ENC 2013 – Mexican International Conference on Computer Science

LiDIA Architecture

Page 14: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Slide 14 de 21ENC 2013 – Mexican International Conference on Computer Science

Knowledge Transfer

Dr. Spence Silver, a 3M scientist, is busily researching adhesives in the laboratory. In the process, he discovers something peculiar: an adhesive that sticks lightly to surfaces but does not tightly bond to them.

Page 15: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Slide 15 de 21ENC 2013 – Mexican International Conference on Computer Science

Knowledge Transfer

While singing in his church choir, Art Fry, another 3M scientist, tires of losing his place in the hymnal. He dreams of a bookmark that's lightly adhesive. Then he remembers Silver's adhesive, and his dream begins to become real.

What would happen if knowledge could be transferred to other

domains?

Page 16: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Conclusions

Slide 16 de 21ENC 2013 – Mexican International Conference on Computer Science

• Linked Data as an alternative to keep the information structured and organized through the Web, is the present, not the future.

• Linked Open Data is an efficient way to increase knowledge sharing through the Web.

• LiDIA aims to achieve knowledge transfer and integration of multiple data sources.

Page 17: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Future Work

Slide 17 de 21ENC 2013 – Mexican International Conference on Computer Science

Use of user interface design patterns, and other technologies available in the human-computer interaction context in order to improve the user experience:

• Natural language processing• Speech recognition

Page 18: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Future Work

Slide 18 de 21ENC 2013 – Mexican International Conference on Computer Science

Achieve the integration for multiple domains:

• Process queries and provide them with context, regardless of the domain of knowledge in which the user makes the search in order to get better results

Page 19: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Slide 19 de 21ENC 2013 – Mexican International Conference on Computer Science

Page 20: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

Questions?

Slide 20 de 21ENC 2013 – Mexican International Conference on Computer Science

Page 21: LiDIA: An integration architecture to query Linked Open Data from multiple datasets

eMail Social

Contact

Slide 21 de 21ENC 2013 – Mexican International Conference on Computer Science

[email protected]

[email protected]

[email protected]

kristian_are

crodriguezen

crodriguezen

Giner Alor Herná[email protected]

Guillermo Cortés [email protected]