Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality...

20
DICE Horizon 2020 Project Grant Agreement no. 644869 http://www.dice-h2020.eu Funded by the Horizon 2020 Framework Programme of the European Union DICE: Developing Data- Intensive Cloud Applications with Iterative Quality Enhancements Giuliano Casale Imperial College London Project Coordinator

Transcript of Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality...

Page 1: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE Horizon 2020 Project Grant Agreement no. 644869http://www.dice-h2020.eu Funded by the Horizon 2020

Framework Programme of the European Union

DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

Giuliano CasaleImperial College LondonProject Coordinator

Page 2: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 2

DICE Projecto Horizon 2020 Research & Innovation Action

Quality-Aware Development for Big Data applications Feb 2015 - Jan 2018, 4M Euros budget 9 partners (Academia & SMEs), 7 EU countries

©DICE 05/02/2023

Page 3: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 3

o Software market rapidly shifting to Big Data 32% compound annual growth rate in EU through 2016 35% Big data projects are successful [CapGemini 2015]

o European call for software quality assurance (QA) ISTAG: call to define environments “for understanding the

consequences of different implementation alternatives (e.g. quality, robustness, performance, maintenance, evolvability, ...)”

o QA evolving too slowly compared to the trends in software development (Big data, Cloud, DevOps ...) Still crucial for competiveness!

Motivation

©DICE 05/02/2023

Page 4: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 4

Platform-Indep. Model

Domain Models

Quality-Aware MDE Today

©DICE 05/02/2023

QAModels

ArchitectureModel

Platform-Specific Model

Codegeneration

C#JavaC++

PlatformDescription

MARTE

Analytical Models

Cost-Quality Models

Page 5: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 5

Challenge 1: QA for Big Datao 5Vs:

o Volume, o Velocity, o Variety, o Veracity, o Value

o Problem: today no QA toolchain can reason on the quality of complex Big Data applications

o Heteregeous Big Data Technologies o NoSQL, Spark, Hadoop/MapReduce, Storm, CEP, ...

o Cloud infrastructure adds complexityo Cloud storage, auto-scaling, private/public/hybrid, ...

©DICE 05/02/2023

Page 6: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 6

Challenge 2: Embracing DevOps

©DICE 05/02/2023

o QA must become lean as well Continuous quality checks and model versioning

o Modelling of the operations Dev needs awareness of infrastructure and costs

o Continuous feedback Forward and backward model synchronisation Tracking of self-adaptation events (e.g. auto-scaling)

o Big data coming from continuous monitoring QA has its own Big data, use machine learning?

Page 7: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 7

Platform-Indep. Model

Domain Models

An Holistic Approach: DICE

©DICE 05/02/2023

ContinuousValidation

ContinuousMonitoring

DataAwareness

ArchitectureModel

Platform-Specific Model

PlatformDescription

DICE MARTE

Deployment &Continuous Integration

DICE IDE

Big Data

QAModels

Page 8: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 8

Benefitso Tackling skill shortage and steep learning curves

Data-aware methods, models, and OSS toolso Shorter time to market for Big Data applications

Cost reduction, without sacrificing product qualityo Decrease development and testing costs

Select optimal architectures that can meet SLAso Reduce number and severity of quality incidents

Iterative refinement of application design

©DICE 05/02/2023

Page 9: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 9

DICE QA: Quality Dimensions

o Reliability

o Efficiency

o Safety &Privacy

©DICE 05/02/2023

Risk of harm Privacy & data protection

Performance Time behaviour Costs

Availability Fault-tolerance

Page 10: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

Footer 10

DICE Platform Independent Model (DPIM)

©DICE 05/02/2023

Page 11: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 11

DICE Profile: PIM Levelo Functional approach to data to be expanded

o Data dependencies graph relationships between data, archives and streams

o QA focuses on quantitative aspects of datao Static characteristics of data

volumes, value, storage location, replication pattern, consistency policies, data access costs, known schedules of data transfers, data access control / privacy, ...

o Dynamic characteristics of data cache hit/miss probabilities, read/write/update rates,

burstiness, ...

©DICE

Page 12: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

Footer 12

DICE Platform and Technology Specific Model (DTSM)

©DICE 05/02/2023

Page 13: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

Footer 13

DICE Platform, Technology and Deployment Specific Model (DDSM)

©DICE 05/02/2023

Page 14: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 14

DICE Profile: PSM Levelo Need for technology-specific abstractions

Hadoop: Number of mappers and reducers , ... In-memory DBs: Peak memory and variable threading Streaming: merge/split/operators, networking, ... Storage: Supported operations, cost/byte , ... NoSQL: Consistency policies , ...

o Generation of deployment plan Proposed Chef + TOSCA extension

o Interest is both on private and public clouds Private clouds more relevant for batch processing Public clouds more relevant for streaming

©DICE

Page 15: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 15

Demonstrators

©DICE 05/02/2023

Case study Domain Features & ChallengesDistributed data-intensive media system (ATC)

• News & Media• Social media

• Large-scale software• Data velocities• Data volumes• Data granularity• Multiple data sources and channels• Privacy

Big Data for e-Government(Netfective)

• E-Gov application

• Data volumes• Legacy data• Data consolidation • Data stores• Privacy• Forecasting and data analysis

Geo-fencing (Prodevelop)

• Maritime sector

• Vessels movements• Safety requirements• Streaming & CEP• Geographical information

Page 16: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 16

Thanks!www.dice-h2020.eu

©DICE

Page 17: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 17

Challenge 2: Embracing DevOpso Software development process is evolving

Developer: “I want to change my code” Operator: “I want systems to be stable”

o ...but code changes are the cause of most instabilities!o DevOps closes the gap between Dev and Ops

Lean release cycles with automated tests and tools Deep modelling of systems is the key to automation

©DICE 05/02/2023

AgileDevelopment DevOps

Business Dev Ops

Page 18: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 18

Main Technical Outputs1. DICE Profile (WP2)

New UML profile to characterize data location, processing, transformation, and usage

Data-aware quality annotations Deployment models (output to TOSCA)

2. QA Tools (WP3/WP4) OSS tools (analysis, simulation, verification, feedback)

3. Integrated Development Environment (WP1) Guides through the DICE methodology

4. Delivery Tools (WP5) Deployment, continuous integration, testing

©DICE 05/02/2023

Page 19: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 19

DICE QA: Possible Baselines UML MARTE

Performance Timing Verification

MODACloudML Cloud/PMI Not UML

UML DAM Dependability/ZAR, covers our quality dimensions

©DICE

UML DAM core package

Page 20: Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements

DICE RIA - Overview 20

Year 1 - Expected Achievements

©DICE 05/02/2023

Milestone DeliverablesBaseline andRequirements -July 2015

• State of the art analysis• Requirement specification• Dissemination, communication,

collaboration and standardisation report• Data management plan

ArchitectureDefinition -January 2016

• Design and quality abstractions• DICE simulation tools• DICE verification tools• Monitoring and data warehousing tools• DICE delivery tools• Architecture definition and integration plan• Exploitation plan