Post on 16-Apr-2017
LEDS
WWW.LEDS-PROJEKT.DE
LINKED ENTERPRISE DATA SERVICES
DR. MICHAEL MARTINUNIVERSITÄT LEIPZIG / AKSW
8. November
20161
LEDS
• Forschungseinrichtungen:• Universität Leipzig / AKSW
Forschungsgruppe• Technische Universität Chemnitz
• Firmen:• brox IT-Solutions GmbH• Ontos GmbH• Netresearch GmbH & Co. KG• Lecos GmbH• eccenca GmbH
PROJEKTÜBERSICHT
November 8, 2016
2
LEDSFORSCHUNGSGEBIETE
November 8, 2016
3
• Open eGovernment Data
• Semantic eCommerce
• Natural Language Processing
• Linking and Knowledge Extraction
• Versioning and Co-Evolution
• Big Data / Linked Data Integration
LEDSFORSCHUNGSGEBIETE
November 8, 2016
4
• Open eGovernment Data
• Semantic eCommerce
• Natural Language Processing
• Linking and Knowledge Extraction
• Versioning and Co-Evolution
• Big Data / Linked Data Integration
LEDSFORSCHUNGSGEBIETE
November 8, 2016
5
• Open eGovernment Data
• Semantic eCommerce
• Natural Language Processing
• Linking and Knowledge Extraction
• Versioning and Co-Evolution
• Big Data / Linked Data Integration
LEDS
• Open eGovernment Data
• Semantic eCommerce
• Natural Language Processing
• Linking and Knowledge Extraction
• Versioning and Co-Evolution
• Big Data / Linked Data Integration
FORSCHUNGSGEBIETE
November 8, 2016
6
LEDSFORSCHUNGSGEBIETE
November 8, 2016
7
• Open eGovernment Data
• Semantic eCommerce
• Natural Language Processing
• Linking and Knowledge Extraction
• Versioning and Co-Evolution
• Big Data / Linked Data Integration
LEDSFORSCHUNGSGEBIETE
November 8, 2016
8
BusinessUnit2 BusinessUnit3 BusinessUnit4 BusinessUnit5BusinessUnit1
CorporateMemory
Inbound
DataSources
OutboundandConsumption
InboundRawDataStore
BigData DWH-Infrastructure
KnowledgeGraphforMetaData,KPIDefinitionand DataModels
FrontendtoAccessRelationshipandKPIDefinition/Documentation FrontendtoAccess(adhoc)Reports
OutboundDataDeliverytoTarget
Systems
• Open eGovernment Data
• Semantic eCommerce
• Natural Language Processing
• Linking and Knowledge Extraction
• Versioning and Co-Evolution
• Big Data / Linked Data Integration
LEDS
WWW.LEDS-PROJEKT.DE
ECCENCA CORPORATE MEMORY
SEMANTICALLY INTEGRATED ENTERPRISE DATA LAKES
RENE PIETZSCH
8. November
20169
LEDSMOTIVATION
Enterprise Data Management Objective:
“Ensure all data is aligned to a common meaning in order to achieve automation in performing complex analytics and generating trusted reports.”
Source:2015DataManagementIndustryBenchmark- EDMCouncil
November 8, 2016
10
In 2015 only 7% of respondents claim to already be using shared and unambiguous definitions of data across the firm and have it accessible as operational metadata.
7%
©eccencaGmbH2016
LEDS
Accounting RegulatoryReporting
RiskMgmt. Treasury...
PerspectivesonDataturnintosilosofdatabeingduplicated,annotated,simplychangedovertime,makingreconciliationandinterpretationachallenge
MOTIVATION
©eccencaGmbH2016
LEDSARCHITECTURE
November 8, 2016
12
ManagementAccounting
RiskManagementRegulatoryReporting
Treasury MarketingAccounting
CorporateMemory
Inbound
DataSources
OutboundandConsumption
InboundRawDataStore
KnowledgeGraphforMetaData,KPIDefinitionandDataModels
FrontendtoAccessRelationshipandKPIDefinition/Documentation FrontendtoAccess(adhoc)Reports OutboundDataDeliveryto
TargetSystems
BigData DWH-Infrastructure
©eccencaGmbH2016
LEDSARCHITECTURE
ManagementAccounting
RiskManagementRegulatoryReporting
Treasury MarketingAccounting
InboundRawDataStore
KnowledgeGraphforMetaData,KPIDefinitionandDataModels
FrontendtoAccessRelationshipandKPIDefinition/Documentation FrontendtoAccess(adhoc)Reports OutboundDataDeliveryto
TargetSystems
BigData DWH-Infrastructure
DataIngestion• Filesinthedatalake(CSV,XML,Excel)• (relational)Databases
©eccencaGmbH2016
LEDSARCHITECTURE
ManagementAccounting
RiskManagementRegulatoryReporting
Treasury MarketingAccounting
InboundRawDataStore
KnowledgeGraphforMetaData,KPIDefinitionandDataModels
FrontendtoAccessRelationshipandKPIDefinition/Documentation FrontendtoAccess(adhoc)Reports OutboundDataDeliveryto
TargetSystems
BigData
DWH-Infrastructure
DataLake• Emergingapproachtohandlelargeamounts
ofdata• Cost-effectivestorage• DataisheldintheirnativeformatsGoodDoesnotforceanup-frontintegrationoftheingesteddatasetsBadRetaininganoverviewofdisparatedatasilosinthelakewithouthavingacoherentsharedviewisachallengingissue
DataWarehouses• Existinginfrastucture• Typicallyrelationaldatabases
©eccencaGmbH2016
LEDSARCHITECTURE
ManagementAccounting
RiskManagementRegulatoryReporting
Treasury MarketingAccounting
InboundRawDataStore
KnowledgeGraphforMetaData,KPIDefinitionandDataModels
FrontendtoAccessRelationshipandKPIDefinition/Documentation FrontendtoAccess(adhoc)Reports OutboundDataDeliveryto
TargetSystems
BigData DWH-Infrastructure
Metadata Layer• DatasetMetadata• Ontologies• IntegrationRules
Graphical UserInterface
CustomerApplications
©eccencaGmbH2016
LEDSDATASET MANAGEMENT
DatasetManagement• CatalogDatasets• CatalogOntologies• ManageMetadata
DatasetDiscovery• DataProfiling• DatasetExploration
DatasetIntegration• DatasetLifting• DatasetLinking• DataQualityValidation
DataAccess• DomainSpecificConsolidatedViews
• ExecutiononHadoop
November 8, 2016
16
©eccencaGmbH2016
LEDSDATASET DISCOVERY
November 8, 2016
17
DatasetManagement• CatalogDatasets• CatalogOntologies• ManageMetadata
DatasetDiscovery• DataProfiling• DatasetExploration
DatasetIntegration• DatasetLifting• DatasetLinking• DataQualityValidation
DataAccess• DomainSpecificConsolidatedViews
• ExecutiononHadoop
©eccencaGmbH2016
LEDSINTEGRATION PROCESS 1/2
November 8, 2016
18
DatasetManagement• CatalogDatasets• CatalogOntologies• ManageMetadata
DatasetDiscovery• DataProfiling• DatasetExploration
DatasetIntegration• DatasetLifting• DatasetLinking• DataQualityValidation
DataAccess• DomainSpecificConsolidatedViews
• ExecutiononHadoop
©eccencaGmbH2016
LEDSINTEGRATION PROCESS 2/2
November 8, 2016
19
DatasetManagement• CatalogDatasets• CatalogOntologies• ManageMetadata
DatasetDiscovery• DataProfiling• DatasetExploration
DatasetIntegration• DatasetLifting• DatasetLinking• DataQualityValidation
DataAccess• DomainSpecificConsolidatedViews
• ExecutiononHadoop
©eccencaGmbH2016
LEDSDATA ACCESS
DatasetManagement• CatalogDatasets• CatalogOntologies• ManageMetadata
DatasetDiscovery• DataProfiling• DatasetExploration
DatasetIntegration• DatasetLifting• DatasetLinking• DataQualityValidation
DataAccess• DomainSpecificConsolidatedViews
• ExecutiononHadoop
©eccencaGmbH2016
LEDS
ContactRenePietzschTel:+491726940915email:rene.pietzsch@eccenca.com
eccencaCommand your Data!