FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

32
Chemnitz University of Technology Prof. Dr.-Ing. Martin Gaedke & Team 29.11.2016 FAME.Q A Formal Approach to Master Quality in Enterprise Linked Data André Langer and Martin Gaedke Semantic Web and XML @ ICWI2016 VSR://IntelligentInformationManagement/LEDS

Transcript of FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

Page 1: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

Chemnitz University of Technology Prof. Dr.-Ing. Martin Gaedke & Team 29.11.2016

FAME.Q A Formal Approach to Master Quality in Enterprise Linked Data

André Langer and Martin Gaedke

Semantic Web and XML @ ICWI2016

VSR://IntelligentInformationManagement/LEDS

Page 2: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

What we do

2

Page 3: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Background 3

Linked Open Data Corporate Data Social Networks

Data Lake

Knowledge Graphs

Management of Background Knowledge

Data Quality and Coherence

Knowledge Extraction

Search in Linked Data

E-Commerce Applications

Page 4: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Background 4

Linked Open Data Corporate Data Social Networks

Data Lake

Knowledge Graphs

Management of Background Knowledge

Data Quality and Coherence

Knowledge Extraction

Search in Linked Data

E-Commerce Applications

Main Focus

Page 5: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

Initial

Question

5

Page 6: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

What is Data Quality?

Which common definitions exist?

How can DQ be measured ?

VSR://IntelligentInformationManagement/LEDS/Motivation 6

Page 7: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

Data Quality

• Is a multi-dimensional concept

• data that is „fit for use“ by data consumers (Wang & Strong, 1996; Strong, Lee & Wang, 1997b)

• data that is „free of defects and posesses desired features“ (Redman, 2001)

VSR://IntelligentInformationManagement/LEDS/Motivation 7

Page 8: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

2.1 Simple Example

8

Page 9: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Motivation 9

Page 10: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Motivation 10

Page 11: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Motivation 11

Page 12: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

How can existing

definitions be formalized? ?

VSR://IntelligentInformationManagement/LEDS/Motivation 12

Page 13: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

Approach

13

Page 14: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

Data Quality characterizes

data to which degree it corresponds

to specific requirements

VSR://IntelligentInformationManagement/LEDS/Approach 14

Page 15: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

Data Quality characterizes

data to which degree it corresponds

to specific requirements

VSR://IntelligentInformationManagement/LEDS/Approach 15

Context

Page 16: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

Data Quality characterizes

data to which degree it corresponds

to specific requirements

VSR://IntelligentInformationManagement/LEDS/Approach 16

Context

metrics

Page 17: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

Data Quality characterizes

data to which degree it corresponds

to specific requirements

VSR://IntelligentInformationManagement/LEDS/Approach 17

Context

metrics a percentage

Page 18: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Approach 18

Context

metrics a percentage

Simplified Version

Page 19: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Approach 19

Context

metrics a percentage

Simplified Version

Page 20: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Approach 20

Common Quality dimensions and

appropriate metrics have already been

extensively classified by other authors

• Wang & Strong, 1996

• Zaveri et al, 2014

Page 21: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Approach 21

Common Quality dimensions and

appropriate metrics have already been

extensively classified by other authors

• Wang & Strong, 1996

• Zaveri et al, 2014

Zaveri, A. et al., 2014. Quality Assessment for Linked Open Data: A Survey. Semantic Web Journal, 1, p. 22

Page 22: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Approach 22

FAME.Q Quality Assessment Levels

Data Quality

Instance Level Schema Level Service Level

Page 23: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Approach 23

Example calculation 1

Page 24: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Approach 24

Example calculation 2

Page 25: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Approach 25

Summary: What is Data Quality?

„fatal“ „perfect“

Page 26: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

Conclusion

26

Page 27: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Conclusion 27

Data Quality can be interpreted as the degree to which data fits to current requirements • Build upon and reuse existing definitions • Apply it to the field of the Semantic Web • Set it in a formalized schema

Page 28: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

4.1 Future Steps

28

Page 29: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Conclusion 29

Page 30: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Conclusion 30

Several related Quality Measurement frameworks already exist(ed) with different result output capabilities • SWIQA (Fürber & Hepp, 2011a) • Luzzu (Debattista et al., 2015) • Roomba OpenData Checker (Assaf et al., 2015)

Page 31: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR://IntelligentInformationManagement/LEDS/Conclusion 31

We output the results of our Quality Assessment tool with the means of the data quality vocabulary (dqv)

Page 32: FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data

VSR

Chemnitz University of Technology Prof. Dr.-Ing. Martin Gaedke & Team 29.11.2016

Inspired and Interested?

[email protected]

VSR.Informatik.TU-Chemnitz.de

@andrelanger @myVSR /myVSR