Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Private Data

Post on 16-Apr-2017

78 views 2 download

Transcript of Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Private Data

LEDS

WWW.LEDS-PROJEKT.DE

LINKED ENTERPRISE DATA SERVICES

DR. MICHAEL MARTINUNIVERSITÄT LEIPZIG / AKSW

8. November

20161

LEDS

• Forschungseinrichtungen:• Universität Leipzig / AKSW

Forschungsgruppe• Technische Universität Chemnitz

• Firmen:• brox IT-Solutions GmbH• Ontos GmbH• Netresearch GmbH & Co. KG• Lecos GmbH• eccenca GmbH

PROJEKTÜBERSICHT

November 8, 2016

2

LEDSFORSCHUNGSGEBIETE

November 8, 2016

3

• Open eGovernment Data

• Semantic eCommerce

• Natural Language Processing

• Linking and Knowledge Extraction

• Versioning and Co-Evolution

• Big Data / Linked Data Integration

LEDSFORSCHUNGSGEBIETE

November 8, 2016

4

• Open eGovernment Data

• Semantic eCommerce

• Natural Language Processing

• Linking and Knowledge Extraction

• Versioning and Co-Evolution

• Big Data / Linked Data Integration

LEDSFORSCHUNGSGEBIETE

November 8, 2016

5

• Open eGovernment Data

• Semantic eCommerce

• Natural Language Processing

• Linking and Knowledge Extraction

• Versioning and Co-Evolution

• Big Data / Linked Data Integration

LEDS

• Open eGovernment Data

• Semantic eCommerce

• Natural Language Processing

• Linking and Knowledge Extraction

• Versioning and Co-Evolution

• Big Data / Linked Data Integration

FORSCHUNGSGEBIETE

November 8, 2016

6

LEDSFORSCHUNGSGEBIETE

November 8, 2016

7

• Open eGovernment Data

• Semantic eCommerce

• Natural Language Processing

• Linking and Knowledge Extraction

• Versioning and Co-Evolution

• Big Data / Linked Data Integration

LEDSFORSCHUNGSGEBIETE

November 8, 2016

8

BusinessUnit2 BusinessUnit3 BusinessUnit4 BusinessUnit5BusinessUnit1

CorporateMemory

Inbound

DataSources

OutboundandConsumption

InboundRawDataStore

BigData DWH-Infrastructure

KnowledgeGraphforMetaData,KPIDefinitionand DataModels

FrontendtoAccessRelationshipandKPIDefinition/Documentation FrontendtoAccess(adhoc)Reports

OutboundDataDeliverytoTarget

Systems

• Open eGovernment Data

• Semantic eCommerce

• Natural Language Processing

• Linking and Knowledge Extraction

• Versioning and Co-Evolution

• Big Data / Linked Data Integration

LEDS

WWW.LEDS-PROJEKT.DE

ECCENCA CORPORATE MEMORY

SEMANTICALLY INTEGRATED ENTERPRISE DATA LAKES

RENE PIETZSCH

8. November

20169

LEDSMOTIVATION

Enterprise Data Management Objective:

“Ensure all data is aligned to a common meaning in order to achieve automation in performing complex analytics and generating trusted reports.”

Source:2015DataManagementIndustryBenchmark- EDMCouncil

November 8, 2016

10

In 2015 only 7% of respondents claim to already be using shared and unambiguous definitions of data across the firm and have it accessible as operational metadata.

7%

©eccencaGmbH2016

LEDS

Accounting RegulatoryReporting

RiskMgmt. Treasury...

PerspectivesonDataturnintosilosofdatabeingduplicated,annotated,simplychangedovertime,makingreconciliationandinterpretationachallenge

MOTIVATION

©eccencaGmbH2016

LEDSARCHITECTURE

November 8, 2016

12

ManagementAccounting

RiskManagementRegulatoryReporting

Treasury MarketingAccounting

CorporateMemory

Inbound

DataSources

OutboundandConsumption

InboundRawDataStore

KnowledgeGraphforMetaData,KPIDefinitionandDataModels

FrontendtoAccessRelationshipandKPIDefinition/Documentation FrontendtoAccess(adhoc)Reports OutboundDataDeliveryto

TargetSystems

BigData DWH-Infrastructure

©eccencaGmbH2016

LEDSARCHITECTURE

ManagementAccounting

RiskManagementRegulatoryReporting

Treasury MarketingAccounting

InboundRawDataStore

KnowledgeGraphforMetaData,KPIDefinitionandDataModels

FrontendtoAccessRelationshipandKPIDefinition/Documentation FrontendtoAccess(adhoc)Reports OutboundDataDeliveryto

TargetSystems

BigData DWH-Infrastructure

DataIngestion• Filesinthedatalake(CSV,XML,Excel)• (relational)Databases

©eccencaGmbH2016

LEDSARCHITECTURE

ManagementAccounting

RiskManagementRegulatoryReporting

Treasury MarketingAccounting

InboundRawDataStore

KnowledgeGraphforMetaData,KPIDefinitionandDataModels

FrontendtoAccessRelationshipandKPIDefinition/Documentation FrontendtoAccess(adhoc)Reports OutboundDataDeliveryto

TargetSystems

BigData

DWH-Infrastructure

DataLake• Emergingapproachtohandlelargeamounts

ofdata• Cost-effectivestorage• DataisheldintheirnativeformatsGoodDoesnotforceanup-frontintegrationoftheingesteddatasetsBadRetaininganoverviewofdisparatedatasilosinthelakewithouthavingacoherentsharedviewisachallengingissue

DataWarehouses• Existinginfrastucture• Typicallyrelationaldatabases

©eccencaGmbH2016

LEDSARCHITECTURE

ManagementAccounting

RiskManagementRegulatoryReporting

Treasury MarketingAccounting

InboundRawDataStore

KnowledgeGraphforMetaData,KPIDefinitionandDataModels

FrontendtoAccessRelationshipandKPIDefinition/Documentation FrontendtoAccess(adhoc)Reports OutboundDataDeliveryto

TargetSystems

BigData DWH-Infrastructure

Metadata Layer• DatasetMetadata• Ontologies• IntegrationRules

Graphical UserInterface

CustomerApplications

©eccencaGmbH2016

LEDSDATASET MANAGEMENT

DatasetManagement• CatalogDatasets• CatalogOntologies• ManageMetadata

DatasetDiscovery• DataProfiling• DatasetExploration

DatasetIntegration• DatasetLifting• DatasetLinking• DataQualityValidation

DataAccess• DomainSpecificConsolidatedViews

• ExecutiononHadoop

November 8, 2016

16

©eccencaGmbH2016

LEDSDATASET DISCOVERY

November 8, 2016

17

DatasetManagement• CatalogDatasets• CatalogOntologies• ManageMetadata

DatasetDiscovery• DataProfiling• DatasetExploration

DatasetIntegration• DatasetLifting• DatasetLinking• DataQualityValidation

DataAccess• DomainSpecificConsolidatedViews

• ExecutiononHadoop

©eccencaGmbH2016

LEDSINTEGRATION PROCESS 1/2

November 8, 2016

18

DatasetManagement• CatalogDatasets• CatalogOntologies• ManageMetadata

DatasetDiscovery• DataProfiling• DatasetExploration

DatasetIntegration• DatasetLifting• DatasetLinking• DataQualityValidation

DataAccess• DomainSpecificConsolidatedViews

• ExecutiononHadoop

©eccencaGmbH2016

LEDSINTEGRATION PROCESS 2/2

November 8, 2016

19

DatasetManagement• CatalogDatasets• CatalogOntologies• ManageMetadata

DatasetDiscovery• DataProfiling• DatasetExploration

DatasetIntegration• DatasetLifting• DatasetLinking• DataQualityValidation

DataAccess• DomainSpecificConsolidatedViews

• ExecutiononHadoop

©eccencaGmbH2016

LEDSDATA ACCESS

DatasetManagement• CatalogDatasets• CatalogOntologies• ManageMetadata

DatasetDiscovery• DataProfiling• DatasetExploration

DatasetIntegration• DatasetLifting• DatasetLinking• DataQualityValidation

DataAccess• DomainSpecificConsolidatedViews

• ExecutiononHadoop

©eccencaGmbH2016

LEDS

ContactRenePietzschTel:+491726940915email:rene.pietzsch@eccenca.com

eccencaCommand your Data!