EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International...

12

Transcript of EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International...

Page 1: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP
Page 2: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

EMSODataManagementPlatform

PasqualeAndrianiasdasdasdOceanologyInternational2018London,Excel/March15,2018

Page 3: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

Context

Page 4: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

HIGH-LEVELARCHITECTURE

Dataaccess

DMPtools

DATAMANAGEMENTPLATFORM

Real-tim

eAsynch.

Dataingestion

Ingestionspeed

Page 5: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

PHYSICALINFRASTRUCTURE

EMSO DMP •  Production VO: vo.emsodev.eu •  Cloud Compute: 11 VMs (8 CPUs + 16GB RAM + 40GB) •  File Storage: 5 TB

Jun-2016«TestVO» Aug-2016«SLAdefinition» Nov-2016«ProductionVO»

Page 6: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

FROMSOLUTION… datause

dataprocessing

WebHDFSAPI

datapublishing

dataacquisition

EMSODEVDATAMANAGEMENTPLATFORM

datacuration

MOODA

Page 7: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

FROMSOLUTION…(DATAACQUISITION)

DataacquisitionPushTransferFlow:dataissenttoaDMPservice.PullTransferFlow:dataisretrievedviaAPIexposedbyanEGIMSOSserver.

dataacquisition

PUSHtransferflow

PULLtransferflowAPI

Page 8: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

FROMSOLUTION…(DATACURATION)Datacuration

•  Datamanipulation(datascrapingandmunging)•  Datacollection(datapersistintodifferentstorage)

dataacquisition

datacuration

Page 9: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

FROMSOLUTION…(DATAUSE)

TheDMPprovidesthescientificuserwithasetofdifferenttools(DMPToolS):•  EMSODEVAPIallowsscientificusersandotherEuropeaninitiativesintheoceansciencesto

interfacewiththedataavailablewithintheDMP.•  MOODA(ModuleforOceanObservatoryDataAnalysis)isacustomapplicationdevelopedfor

thescientificcommunity.•  GRAFANAisadashboardtoquery,visualiseandunderstandthemetricsstoredintotheDMP.

MOODA

Page 10: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

FROMSOLUTION…(DATAPROCESSING)

dataprocessing

•  ApacheSpark:fastandgeneralengineforlarge-scaledataprocessing.Itisabletoexecutebothbatchandstreamingdataprocessing.

•  ApacheZeppelin:interactivedataanalyticstool.Itisabletoperformdataanalysiswithdifferentprocessingsysteminterpreters.

ODVFileOceanDataView

Page 11: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

…TOOPERATION

12hoursofSeaWaterTemperature

Page 12: EMSO Data Management Platform - AtlantOS · Pasquale Andriani asdasdasd Oceanology International 2018 London, Excel / March 15, 2018 Context HIGH-LEVEL ARCHITECTURE Data access DMP

The EMSO-Link project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreements N° 731036.

Thankyouforyourattention.

[email protected]