2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software...

25
About MSR Types of analysis Possible lessons to be learned 10+ years of software analytics Lessons learned that may be useful for learning analytics Gregorio Robles, Jes´ us M. Gonz´ alez Barahona {grex,jgb}@gsyc.urjc.es GSyC/LibreSoft, Universidad Rey Juan Carlos, Madrid, Spain LASI, July 5th, 2013 Gregorio Robles, Jes´ us M. Gonz´ alez Barahona 10+ years of software analytics

description

2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

Transcript of 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software...

Page 1: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

10+ years of software analyticsLessons learned that may be useful for learning analytics

Gregorio Robles, Jesus M. Gonzalez Barahona

{grex,jgb}@gsyc.urjc.esGSyC/LibreSoft, Universidad Rey Juan Carlos, Madrid, Spain

LASI, July 5th, 2013

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 2: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

c©2013 Gregorio Robles, Jesus M. Gonzalez-Barahona

All figures are ours, except when the original source is specified.

Some rights reserved. This presentation is distributed under the“Attribution-ShareAlike 3.0” license, by Creative Commons, available at

http://creativecommons.org/licenses/by-sa/3.0/

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 3: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

What is this talk about?

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 4: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

About us

What we do

Main goal: Understandingfree software

Main focus: empiricalsoftware engineering

Main method: retrieval ofpublicly available data

Part of the GSyC/LibreSoftresearch team

Spin-off company: Bitergia

The other author

Jesus M. Gonzalez-BarahonaMember of eMadrid

Universidad Rey Juan CarlosMadrid, Spain

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 5: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Table of Contents

1 About MSR

2 Types of analysis

3 Possible lessons to be learned

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 6: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Data in software development

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 7: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Mining Software Repositories

Figure: Mining Software Repositories: http://msrconf.org

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 8: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Public available data sources

Mainly from free software projects

Artifacts

Mainly source code

Repositories (with meta-data)

Versioning systemsBug-tracking systemMailing list, forums, etc.Other: twitter, chats, etc.

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 9: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Artifacts vs. repositories

Figure: by Israel HerraizGregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 10: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Table of Contents

1 About MSR

2 Types of analysis

3 Possible lessons to be learned

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 11: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Important keywords

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 12: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Recommender systems

Figure: Ahmed Lamkanfi et al., MSR 2010

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 13: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Social Network Analysis

Figure: Interactions for Linux 1.0.

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 14: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Developer territoriality (the toothbrush effect)

Figure: Daniel German, 2004Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 15: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Entry patterns

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 16: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Table of Contents

1 About MSR

2 Types of analysis

3 Possible lessons to be learned

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 17: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Correlation and causation

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 18: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Not the right data

Figure: Sometimes we have lots of data... but not the one required toanswer the question! Picture: (c) StatusMind.com

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 19: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Noise!

Figure: Tools have been thought for learning, not for analytics! Pic. (c)www.socialresearchmethods.net

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 20: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Diversity

Figure: There is not size-fits-all. Every project has its own processes,peculiarities, history and culture.

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 21: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Challenges for the research community

Figure: Challenges not only on existing data sources (it is not only aboutthe method!). Allow having new data sources as a challenge per se.Picture: (c) Redeem the commute

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 22: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Repositories

Figure: There is a need for the existence of public repositories where datais shared. There is always a data source that you did not think about!Links among data in the sources is difficult to gather.

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 23: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

Replication

Figure: Have you thought about replicating your studies later? Andhaving it replicated by an independent resarch group? Figure: (c) 2009Archangel

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 24: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

To take away!

We’ve shown the experience of analytics in another field

There are plenty of ideas, concepts, models and algorithmsthat can be used

We may have lots of data to analyze... but

Be aware that correlation is not causation!We need good (and the right) dataWe need diverse dataWe need to think about replicability

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics

Page 25: 2013 07 05 (uc3m) lasi emadrid grobles jgbarahona urjc lecciones aprendidas analitica software analitica aprendizaje

About MSRTypes of analysis

Possible lessons to be learned

10+ years of software analyticsLessons learned that may be useful for learning analytics

Gregorio Robles, Jesus M. Gonzalez Barahona

{grex,jgb}@gsyc.urjc.esGSyC/LibreSoft, Universidad Rey Juan Carlos, Madrid, Spain

LASI, July 5th, 2013

Gregorio Robles, Jesus M. Gonzalez Barahona 10+ years of software analytics