PINCETTE project: Validation of changes and upgrades in large software systems Unique challenges and...

PINCETTE project:Validation of changes and upgrades

in large software systemsUnique challenges and suggested solutions

Hana Chockler

Motivation:Challenges of validation of evolving software Large software systems are usually built incrementally:

Maintenance (fixing errors and flaws, hardware changes, etc.) Enhancements (new functionality, improved efficiency,

extension, new regulations, etc.) Changes are done frequently during the lifetime of most systems

and can introduce new software errors or expose old errors Upgrades are done gradually, so the old and new versions have to

co-exist in the same system Changes often require re-certification of the system, especially

for mission-critical systems

"Upgrading a networked system is similar to upgrading software of a car while the car's engine is running, and the car is moving on a highway. Unfortunately, in networked systems we don't have the option of shutting the whole system down while we upgrade and verify a part of it.“

source:

software update

PINCETTE Project – Validating Changes and Upgradesin Networked Software

Checking for crushes

Static Analysis Component

Black box testing

Dynamic AnalysisComponent

Front end

Methodologybook

Using function

summaries

Verifying only the change

White box testing

What does it mean to validate a change in a software system?

• Equivalence checking – when the new version should be equivalent to the previous version in terms of functionality• Changes in the underlying hardware• Optimizations

• No crashes – when several versions need to co-exist in the same system, and we need to ensure that the update will not crash the system• When there is no correctness specification, this is often the

only thing we can check• Checking that a specific bug was fixed

• A counterexample trace can be viewed as a specification of a behavior that needs to be eliminated in the new version

• Validation of the new functionality• If a correctness specification for the change exists, we can

check whether the new (or changed) behaviors satisfy this specification

Why is it validation of evolving software different from standard software validation?

• Software systems are too large to be formally verified or exhaustively tested at once

• Even if it is feasible to validate the whole system, often the process is too long and expensive and does not fit into the schedule of small frequent changes

• When validating the whole system, there is a danger of overlooking the change

How can we use the fact that we are validating evolving software?

• If the previous version was validated in some way, we can assume that it is correct and not re-validate the parts that were not changed

• If the results of previous validation exist, we can use them as a basis for the current validation – especially useful when there are many versions that differ from each other only slightly

• The previous version can be used as a specification

After more than two years of PINCETTE:

1. How to verify an evolving system that does not have assertions?

2. How to verify only the change without relying on results of previous verification?

1. How to verify an evolving system that does not have assertions?

Dynamically Discovering Assertionsto Support Formal Verification

Motivation: •“Gray-box” components (such as OTS components) – poor specifications, partial view of internal details•Lack of specification complicates validation and debugging•Lack of description of the correct behavior complicates integrationIdea: Analyze gray-box components by dynamic analysis techniques:•Monitor system executions by observing interactions at the component interface level and inside components• Derive models of the expected behavior from the observed events• Mark the model violations as symptoms of faults

©Leonardo Mariani, UniMiB

Dynamically Discovering Assertions at BCT (UniMiB tool)

• Combining dynamic analysis and model-based monitoring• Combining classic dynamic analysis techniques (Daikon) with

incremental finite state generation techniques (kBehavior) to produce I/O models and interaction models• FSA are produced and refined based on subsequent executions

• Extracting information about likely causes of failures by automatically relating the detected anomalies

• Filtering false positives in two steps:• Identify and eliminate false positives by comparing failing and

successful executions with heuristics already experienced in other contexts

• Rank the remaining anomalies according to their mutual correlation and use this information to push the related likely false positives far from the top anomalies

©Leonardo Mariani, UniMiB

Extracting a set of valid assertions for regression

Program V.0 Program V.1

Testing

set of traces

Analysis

Set of candidateassertions

Formalmethods

Set of assertionsthat hold in V.0

Set of invalidassertions in V.1

Formalmethods

Set of assertionsthat hold in V.0

Efficient and automatic regression verification

UniMiB+Oxford+USI

Parameters of the assertions discovery

• Scope of monitoring• All lines• Function invocations• Loops• Particular variables

• Output refinement• Many violations – maybe leave only those in the slice of

the change• Look for a minimal set

• How to design the search algorithm? • Can we check properties of the type “if P_1, P_2,…P_n

are true then Q_1, Q_2,..Q_m” where each property may hold at a different code location?

1. how2. How to verify only the change without

relying on results of previous verification?

The main idea: Verify only the changeignoring the rest of the program

Using ExpliSAT

Joint work with Sitvanit Ruah

What is ExpliSAT?

• A model-checker that combines explicit traversal of CFG with symbolic representation of data:• Traverses CFG• Ensure that we traverse only real paths by computing a

representative path with real data values• Builds a SAT formula for each path on the control flow graph and

invokes a SAT solver• Bugs-hunting heuristics can be added to change the order of

traversal of the graph, so that the places where bugs are more likely to occur are traversed first

The concolic approach is

widely used in testing

concrete+symbolic

Heuristics direct the search towards areas in which bugs are more likely to occur

The path representative – checking that the symbolic path is real

• ExpliSAT traverse the CFG and create all possible control paths

• Symbolically represent data variables of all executions that follow the same control path

• A path representative for a certain control path is a valuation of every variable in the control path that holds the path constraint.

• For every legal control path there exists a path representative

• The representative is computed iteratively

AUpdate the representative until A and look at command in A

Use the representative prefix until A to decide which branch to take

a control path with a representative is a legal control path

The other branch may be

infeasible – checked by

trying to construct a

representative

Ordering - Heuristically deciding in which order to traverse the control paths

11: x=input();

2: if (x>N) then

3: y=x;

4: y=N;

1: x=input();

2: if (x>N) then

else4: y=N;

5: assert(y>=N);

55: assert(y>=N);

The particular heuristic depends on the verification goals – for updates, we use

the update-checking heuristic

Extracting the part of the control-flow graph affected by a change

change

subgraph induced by the changed node

initial state

Backward reachability of the initial state from the changed node

ExpliSAT with update-checking

ExpliSAT with

update checking

goto-cc SAT solversyntactic

marking of changes

Oldversion

------------------

Newversion

------------------

only the new version with marking is

passed

Checks paths that go through a

changed node first

result: goto-program with the change

markingParser + translationto a goto-program

Cprover framework

Update-checking heuristics in ExpliSAT

Turns the model-checker to a bug

hunter

Two options:

Check only the change Check the change first, then continue checking the whole program

Very efficient New bugs will

be discovered

quickly

The whole process is complete

Effect of a change on functionality of the program

• Claim: in sequential programs, the algorithm from the previous slide finds all paths affected by a change

straightforward

• Observation: in concurrent programs, a change can also affect other paths – future work

Changes in global

variables

Experimental results

• Compared performance of ExpliSAT on the whole program with the performance of ExpliSAT on the change only

Not surprisingly, ExpliSAT on

changes only is much faster

• Real-life example: a C++ program supplied by VTT Technical Research Center of Finland, computing the velocity and acceleration of a robot used in the European ITER project – a new type of a reactor based on energy of nuclear fusion

ExpliSAT found a bug in update in several seconds

On the whole program ExpliSAT doesn’t terminate

VTT (Finland) - the ITER EU Project: the thermo-nuclear reactor

ITER is an international magnetic confinement fusion experimental project participated by the EU, India, Japan, China, Russia, South Korea, and the USA. The expected budget of ITER is €5B with 50% of funding provided by the EU. The reactor will be constructed in Cadarache, France, around 2016. ITER is one of the most important and complex research projects in the history of the EU.

divertor:goes at the bottom of the reactor;

should be changed periodicallyRobot that changes

the divertor cassette

PINCETTE project: Validation of changes and upgrades in large software systems Unique challenges and...

Documents

Transcript of PINCETTE project: Validation of changes and upgrades in large software systems Unique challenges and...

ALICE UPGRADES

CMS Upgrades

View 46 Upgrades

Murphy Moose Upgrades - eAeroFabeaerofab.com/MooseForSale/UpgradeReview.pdfMurphy Moose Upgrades . Summary . Most of the upgrades I’ve built into this aircraft are key, structural

COMPLETE SOLUTIONS THE SOLUTION - ElectronInvest · 2006-02-18 · Pincette 40 430 EDJ Holder for: A 29 Tech tool, Micro tool, Power tool Holder for Pincette 40 A 30 Tip exchanger

March 25, 2012 Organizing committee: Hana Chockler IBM Daniel Kroening Oxford Natasha Sharygina USI Leonardo Mariani Giovanni Denaro UniMiB.

Helicopter Maintenance Upgrades - Airbus · Building complete upgrades offer Standard upgrades Buy your kit online A large range of standard upgrades (SBs and STCs) available for

Planning for Upgrades

Neutron upgrades

1 Coverage Metrics in Formal Verification Hana Chockler Hebrew University.

Enabling forkless blockchain upgrades with Substrate Upgrades...Enabling forkless blockchain upgrades with Substrate Shawn Tabrizi GitHub / Twitter: @shawntabrizi Software developer

Upgrades and migrations

ConfigSnapshot Guide - Upgrades

CMS Upgrades

Enewsletters / Big Rivers / June 2015 / Airstrip upgrades ... · Enewsletters / Big Rivers / June 2015 / Airstrip upgrades ready to take off Airstrip upgrades ready to take off Upgrades

Airsoft Upgrades

OPODIS 05 Reconfigurable Distributed Storage for Dynamic Networks Gregory Chockler, Seth Gilbert, Vincent Gramoli, Peter M Musial, Alexander A Shvartsman.

Upgrades and Retrofits

Incremental formal verification of hardware Hana Chockler Alexander Ivrii Arie Matsliah Shiri Moran Ziv Nevo IBM Research - Haifa.

upgrades - Belushi's