REPOSITORY MINING AND PROGRAM ANALYSIS & TESTING: …ffffffff-896a-a3c8-ffff... · 2016-06-23 ·...

Partially supported by: NSF, DHS, and US Air Force

Alessandro (Alex) OrsoSchool of Computer Science – College of Computing

Georgia Institute of Technologyhttp://www.cc.gatech.edu/~orso/

REPOSITORY MINING AND PROGRAM ANALYSIS & TESTING: BETTER TOGETHER?

MSR PAPERS ANDPROGRAM ANALYSIS

2004 2005 2006 2007 2008 2009 2010

Note: this is only

for MSR!

• Mini-history of software archives

• < 1996 – Mostly small examples, limited evaluation

• 1996 – Siemens suite (<500 LOC)

• 2005 – Software-artifact Infrastructure Repository

• 2006 – Eclipse Bug Data

• 2007 – iBUGS

• In 2010, much (most?) research still uses the Siemens suite

PROGRAM ANALYSIS ANDSOFTWARE ARCHIVES

ISSUE #1

Communication

ISSTA PCs (76) MSR (72)

ISSUE #2

Mismatch in assumptions (or schisms)

• (Most) program analyses

• Complete programs

• Single language

• Restricted set of features

• Soundness

• False positives problematic

• Mining techniques

• Incomplete programs

• Multiple languages

• Complete languages

• Noisy data

• False positives acceptable

ISSUE #3

Infrastructure

• Program analysis tools

• Unavailable

• Unusable

• Limited

• Mining infrastructure

• No standard format

• Complicated setup

• Unusable

ISSUE #4

Narrow focus of some MSA research

LOOKING FOR GOLD...

LOOKING FOR KEYS...

MAYBE IF WE TURN ON THE LIGHT

MINING MORE THAN ARCHIVES

Software

Software Archives

Software Archives Program runsProgram traces...

Static/dynamic metrics

GAMMA PROJECT

Field Data

In house In the field

Maintenance tasks:Impact analysis

Regression testingDebugging

Behavior classification...

Developers

Maintenance tasks:Impact analysisRegression testing

DebuggingBehavior classification

"Gamma System: Continuous Evolution of Software after Deployment."

Orso et al., ISSTA 2002.

IMPACT ANALYSIS

• Assess effects of changes on a software system

• Predictive: help decide which changes to perform and how to implement changes

• Our approach

• Program-sensitive impact analysis

• User-sensitive impact analysis

IMPACT ANALYSIS USING FIELD DATA

Program P

XXA2XXXB2

XXXXB1XXA1

m6m5m4m3m2m1

m5 m6ex

User A User B

C1 X X"Leveraging Field Data for Impact Analysis and Regression Testing."

Orso et al., ESEC-FSE 2003.

PROGRAM-SENSITIVEIMPACT ANALYSIS

1. Field execution data

2. Change

Input:

C={m2, m5}

XXA2XXXB2

XXXXB1XXA1

m6m5m4m3m2m1

Impact set = Output:

Step 1• Identify user executions through

methods in C• Identify methods covered by such

executions

2. Change

Input:

C={m2, m5}

XXA2XXXB2

XXXXB1XXA1

m6m5m4m3m2m1

executions

2. Change

Input:

C={m2, m5}

XXA2XXXB2

XXXXB1XXA1

m6m5m4m3m2m1

executions

2. Change

Input:

C={m2, m5}

XXA2XXXB2

XXXXB1XXA1

m6m5m4m3m2m1

executions

2. Change

Input:

C={m2, m5}

XXA2XXXB2

XXXXB1XXA1

m6m5m4m3m2m1

executions

2. Change

Input:

C={m2, m5}

covered methods = {m1,m2,m3,m5,m6}

XXA2XXXB2

XXXXB1XXA1

m6m5m4m3m2m1

executions

2. Change

Input:

C={m2, m5} Step 2

• Dynamic forward slice from C

XXA2XXXB2

XXXXB1XXA1

m6m5m4m3m2m1

executions

2. Change

Input:

C={m2, m5} Step 2

• Dynamic forward slice from C

XXA2XXXB2

XXXXB1XXA1

m6m5m4m3m2m1

{m2,m5,m6}

dynamic fwd slice = {m2,m5,m6}

USER-SENSITIVEIMPACT ANALYSIS

1. Collective impact =

Collective impact• Percentage of executions through

at least one changed methodXXA2

XXXB2XXXXB1XXA1

Input:

Affected users• Percentage of users that executed

at least once one changed method

3/5 = 60%

3/3 = 100%

2. Affected users =

2. Change

Output:

C={m5, m6}

m6m5m4m3m2m1

EMPIRICAL STUDY

• Subject:

• JABA: Java Architecture for Bytecode Analysis (60 KLOC, 500 classes, 3K Methods)

• Data

• Field data: 1,100 executions (14 users, 12 weeks)

• In-house data: 195 test cases, 63% method coverage

• Changes: 20 real changes extracted from JABA’s CVS repository

• Research question: Does field data yield different results than in-house data?

• Experimental setup

• Computed impact sets for the 20 changes using field data and using in-house data

• Compared impact sets for the two datasets

RESULTS

C1 C2 C3 C4 C5 C6 C7 C8 C9C10 C11 C12 C13 C14 C15 C16 C17 C18 C19 C20

Field InHouse Field - InHouse InHouse - Field

InHouse

100 96636

changes

"Gammatella: Visualizing Program-Execution Data for Deployed Software." Jones et al., Information Visualization, 2004.

DEBUGGING FIELD FAILURES

FIELD FAILURES

Field failures: Anomalous behaviors (or crashes) of deployed software that occur on user machines

• Difficult to debug• Relevant to users

Ask the user

CURRENT PRACTICE

I opened my web browser.

Specifically, I clicked on the dock icon. It bounced twice before crashing.

Please help.

Gather static information

CURRENT PRACTICE

Difficult to reproduce the problem

Only locations directly correlated with the failure

OUR SOLUTION

Recordfailing executions

in the field

Replayfailing executions

in house

Debugfield failureseffectively

In the fieldIn house

USAGE SCENARIO

!Replay / Debug

Develop Record

Capturedfailure

345345

CHALLENGES

Large in size Contain sensitiveinformation

345345

CHALLENGES

Minimize

345345

CHALLENGES

Minimize Anonymize

In the fieldIn house

Replay / Debug

Develop Record

Capturedfailure

MinimizeAnonymize

USAGE SCENARIO

Results:• negligible overheads (i.e., less than 10%)• data size is acceptable (application dependent)

Subjects:• several cpu intensive applications (e.g., bzip, gcc)

Research question 1:• does the technique impose an acceptable

overhead?

EVALUATION (PRACTICALITY)

"A Technique for Enabling and Supporting Debugging of Field Failures" Clause and Orso, ICSE 2007.

EVALUATION (FEASIBILITY)Research question 2:• can the technique produce minimized executions

that can be used to debug the original failure?

Results:• execution reduced to less than 10% in size• all failures reproducible

Subject: Pine email and news client• two real field failures• 20 failing executions, 10 per failure

"A Technique for Enabling and Supporting Debugging of Field Failures" Clause and Orso, ICSE 2007.

EVALUATION (EFFECTIVENESS)Research question 3:• How much information about the original inputs is

revealed?

Results:• Anonymized inputs revealed between 2% and 60%

of the information in the original inputs

Subjects: NanoXML, htmlparser, Printtokens, Columba• 20 faults overall• inputs from 100 bytes to 5MB in size• all inputs considered sensitive

"Camouflage: Automated Anonymization of Field Data." Clause and Orso, GT Tech Report, March 2010.

RQ3: EFFECTIVENESSNANOXML

<!DOCTYPE Foo [ <!ELEMENT Foo (ns:Bar)> <!ATTLIST Foo xmlns CDATA #FIXED 'http://nanoxml.n3.net/bar' a CDATA #REQUIRED>

<!ELEMENT ns:Bar (Blah)> <!ATTLIST ns:Bar xmlns:ns CDATA #FIXED 'http://nanoxml.n3.net/bar'>

<!ELEMENT Blah EMPTY> <!ATTLIST Blah x CDATA #REQUIRED ns:x CDATA #REQUIRED>]><Foo a='very' b='secret' c='stuff'>vaz <ns:Bar> <Blah x="1" ns:x="2"/> </ns:Bar></Foo>

RQ3: EFFECTIVENESSNANOXML

<!DOCTYPE [ <! > <!ATTLIST #FIXED ' ' >

<!E > <!ATTLIST #FIXED ' '>

<!E > <!ATTLIST # : # >]>< =' ' =' ' =' '> < : > < =" " : =" "/> </ :

Wayne,Bartley,Bartley,Wayne,wbartly@acp.com,,Ronald,Kahle,Kahle,Ron,ron.kahle@kahle.com,,Wilma,Lavelle,Lavelle,Wilma,,lavelle678@aol.com,Jesse,Hammonds,Hammonds,Jesse,,hamj34@comcast.com,Amy,Uhl,Uhl,Amy,uhla@corp1,com,uhla@gmail.com,Hazel,Miracle,Miracle,Hazel,hazel.miracle@corp2.com,,Roxanne,Nealy,Nealy,Roxie,,roxie.nearly@gmail.com,Heather,Kane,Kane,Heather,kaneh@corp2.com,,Rosa,Stovall,Stovall,Rosa,,sstoval@aol.com,Peter,Hyden,Hyden,Pete,,peteh1989@velocity.net,Jeffrey,Wesson,Wesson,Jeff,jwesson@corp4.com,,Virginia,Mendoza,Mendoza,Ginny,gmendoza@corp4.com,,Richard,Robledo,Robledo,Ralph,ralphrobledo@corp1.com,,Edward,Blanding,Blanding,Ed,,eblanding@gmail.com,Sean,Pulliam,Pulliam,Sean,spulliam@corp2.com,,Steven,Kocher,Kocher,Steve,kocher@kocher.com,,Tony,Whitlock,Whitlock,Tony,,tw14567@aol.com,Frank,Earl,Earl,Frankie,,,Shelly,Riojas,Riojas,Shelly,srojas@corp6.com,,

RQ3: EFFECTIVENESSCOLUMBA

, , , ,, , , , , ,,Wilma,Lavelle,Lavelle,Wilma,,lavelle678@aol.com,Jesse,Hammonds,Hammonds,Jesse,,hamj34@comcast.com,Amy,Uhl,Uhl,Amy,uhla@corp1,com,uhla@gmail.com,Hazel,Miracle,Miracle,Hazel,hazel.miracle@corp2.com,,Roxanne,Nealy,Nealy,Roxie,,roxie.nearly@gmail.com,Heather,Kane,Kane,Heather,kaneh@corp2.com,,Rosa,Stovall,Stovall,Rosa,,sstoval@aol.com,Peter,Hyden,Hyden,Pete,,peteh1989@velocity.net,Jeffrey,Wesson,Wesson,Jeff,jwesson@corp4.com,,Virginia,Mendoza,Mendoza,Ginny,gmendoza@corp4.com,,Richard,Robledo,Robledo,Ralph,ralphrobledo@corp1.com,,Edward,Blanding,Blanding,Ed,,eblanding@gmail.com,Sean,Pulliam,Pulliam,Sean,spulliam@corp2.com,,Steven,Kocher,Kocher,Steve,kocher@kocher.com,,Tony,Whitlock,Whitlock,Tony,,tw14567@aol.com,Frank,Earl,Earl,Frankie,,,Shelly,Riojas,Riojas,Shelly,srojas@corp6.com,,

RQ3: EFFECTIVENESSHTMLPARSER

<?xml version="1.0" encoding="UTF-8" ?><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd">

<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en"><head><title>james clause @ gatech | home</title>

body { margin: 0px;...

/*]]>*/--></style></head><body> ...</body>

The portions of the inputs that remain after anonymization tend to be structural in nature and

therefore are safe to send to developers

CONCLUDING REMARKS

ADDRESSING THE ISSUES

• Issue #1: Communication

• Issue #2: Mismatch in assumptions

• Issue #3: Infrastructure

• Issue #4: Narrow focus of some MSA research

• Reaching out

• More common events

• Challenge

• Many similarities and potential synergies

• Opportunity for defining new (or specialized) analyses

• Opportunity for performing more thorough evaluations

• Related to communication

• Reciprocal help

• Go beyond the analysis of “easy” information in the repositories

• Consider all aspects of software, both static and dynamic

• Consider both in-vitro and in-vivo data

IN CONCLUSION,BETTER TOGETHER?

Techniques for analyzing/mining a program in all of its aspects, static and dynamic, and throughout its lifetime

ACKNOWLEDGEMENTS

• Taweesup Apiwattanapong

• James Clause

• Mary Jean Harrold

• James Jones

• Donglin Liang

• Dick Lipton

REPOSITORY MINING AND PROGRAM ANALYSIS & TESTING: …ffffffff-896a-a3c8-ffff... · 2016-06-23 ·...

Documents

Transcript of REPOSITORY MINING AND PROGRAM ANALYSIS & TESTING: …ffffffff-896a-a3c8-ffff... · 2016-06-23 ·...

Intergenerational Solidarity in Aging Families: An Example ...ffffffff-df42-7cac-0000-0000125afd0d/Bentson.pdfIntergenerational Solidarity in Aging Families: An Example of Formal Theory

Die strategiebasierte Theorie der Unternehmung und ...ffffffff-afb7-ed38-0000-00006720987b/... · • 1979 Erfindung Porter’s • Ziel: Strategie zu nachhaltigen Wettbewerbsvorteilen

Fff Ffffffff

Fffffff Fffffff Fffffff Ffffffff

News from VMS Software Inc. (VSI) x86 OpenVMS V9.0 Update · mba1 160 1 0 0 1 ffffffff.80089068 ffffffff.80088c00 mba2 160 1 0 0 1 ffffffff.80089428 ffffffff.80088c00 mba3 160 1 0

Fffffff Fffffff Fffffff Ffffffff Ffffffff

Sdf Ffffffff Gf

Introduction to Assembly Language Programmingkyoungsoo/ee209_2010/... · FFFFFFFF 00000000 The Program Break … Text Rodata Data BSS Heap Stack.text CODE SECTION.rodata DATA SECTION.data

SEVEN WAYS TO SAVE OUR SOILSfiles.ctctcdn.com/d28b6d2c401/1eb7f727-dc81-4e5c-a3c8-a3...Our soils are degrading, and so, therefore, is the long-term ability for farmers to keep up food

Next-Generation Immunohistochemistry: Multiplex tissue imaging …ffffffff-b34e-2810-ffff-ffff810ebefc/BoegeJ... · Next-Generation Immunohistochemistry: Multiplex tissue imaging

CVE 2012 0769, the case of the perfect info leakzhodiac.hispahack.com › my-stuff › security › Flash_ASLR_bypass.pdf · 09dd86b8 ffffffff ffffffff ffffffff ffffffff 09dd86c8

TPC$Happlied&to&MongoDB:& How&a&NoSQL ...ffffffff-96c1-007c-0000...Contents& Introduction 3! MongoDB 3!

MEMORY - ggn.dronacharya.infoggn.dronacharya.info/CSEDept/Downloads/QuestionBank/Odd/III sem… · FFFFFFFD FFFFFFFE FFFFFFFF Word. 5 Memory Signal Types • Memory signals fall into

9&+.',#5(-=+(>5)6).#&(?.@#A5A0)&( /29&:35;-4).

Measurement in Science, Engineering, and Softwareffffffff-896a-a3c8-ffff-fffff0ade7dd/... · Measurement? Why measure?Because without data, you only have opinions? or to characterize,

Logistikmanagement Ringvorlesung: „Make or Buy ...ffffffff-a043-804a...Frühjahrssemester 2015 ETH / BWI Strategische Grundlagen für Standortentscheidungen Bruno Simma, SIMMA Management

FfFfFfFf Pile Positioning Engineering and Construction Division.

Signatures January 29 2017 Classscharvey/Teaching/...ffffffff ffffffff ffffffff ffffffff fffffffe fffffc2f • base point: 04 79be667e f9dcbbac 55a06295 ce870b07 029bfcdb 2dce28d9

ローパーティローパーティション シショョンンショ …...ローパーティローパーティション シショョンンション FFFFFFFF- ---3333 パネルパネル

Framing corporate social responsibility and …ffffffff-972d-4cd7-ffff-ffffd...Framing corporate social responsibility and contribution to sustainable development Werner Hediger Center

ローパーティローパーティションシショョンンショ …...ローパーティローパーティションシショョンンション FFFFFFFF- ---3333 パネルパネル