SlideShare una empresa de Scribd logo
1 de 20
Are Change Metrics Good Predictors for an Evolving Software Product Line? Sandeep Krishnan, ISU Chris Strasburg, ISU & Ames Laboratory  Robyn R. Lutz, ISU & JPL, California Institute of Technology Katerina Goseva-Popstojanova, WVU 1 This research is supported by NSF grants 0916275 and 0916284 Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Background Product line – “A family of products designed to take advantage of their common aspects and predicted variabilities” [Weiss and Lai 1999] e.g., Nokia cellphones, HP printers, etc. Products -  Commonalities – Shared by all products. e.g., Platform Variabilities – Differentiate the products ,[object Object]
JDT, PDE, Mylyn, Webtools, etc.
Reused in more than three products and for more than six years.
Low-reuse variation
CDT, Datatools, Java EE tools.
Reused in three or fewer products and for more than four years.2 Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Related Work Eclipse as a product line. [Chastek, McGregor, and Northrop, 2007], [Linden, 2009], [Krishnan et al., 2011].  Summary of previous work 3 Failure-prone file  -A file with one or more non-trivial post-release bugs recorded in the Eclipse Bugzilla database. Important/Good predictor – Predictor providing high information gain for classification of failure-prone files Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Product Line Evolution Product line evolution in two dimensions 4 New Releases P1 R1 P1 R2 P1 R3 P1 Rn P2 R1 P2 R2 P2 R3 P2 Rn New Products Pn R1 Pn R2 Pn R3 Pn Rn Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Motivation Can we leverage the reducing amount of change in product lines to better predict failure-prone files? 5 Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Eclipse case study 6 Blocker Eclipse Classic Critical Eclipse Java Major Eclipse C/C++ Normal Eclipse JavaEE Minor
Research Questions As a product evolves, do any change metrics serve as good predictors of failure-prone files? Is there a subset of change metrics which are good predictors across all product line members? Does our ability to predict failure-prone files improve as product line evolves? 7 Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Findings The change metrics provide good classification of the failure-prone files in the Eclipse product line. As each product evolves, there is a stable set of change metrics that are prominent predictors of failure-prone files across its releases. There is a subset of change metrics that is among the prominent predictors of all the products across most of the releases. As the product line matures, prediction performance improves for each of the four Eclipse products. 8 Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Data Source 9 Data Timeline Data Timeline Source of failure reports- Source of change reports – CVS repository of Eclipse. Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Approach 10 Get prediction results + best predictors Weka      J48 decision tree learner Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Replication Results 11 Learner performance compared to previous results Classification performance comparison for Eclipse Classic 2.0, 2.1, and 3.0 PC- Percentage of correctly classified instances TPR- True positive rate FPR- False positive rate Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
12 Top five predictors for earlier releases of Eclipse Classic Replication Results Top predictors from this study ,[object Object],Top predictors from previous study ,[object Object],Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
13 Learner performance improves as single product evolves Extension Results Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
14 Top five predictors for later releases of Eclipse Classic Extension Results Revisions is good predictor for later releases also. Max_changeset is a good predictor also. Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
15 Learner performance improves as product line evolves Extension Results Percentage of correctly classified instances increases across releases for each product Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011

Más contenido relacionado

Similar a Promise 2011: "Are Change Metrics Good Predictors for an Evolving Software Product Line?"

Kuchinsky_Cytoscape_BOSC2009
Kuchinsky_Cytoscape_BOSC2009Kuchinsky_Cytoscape_BOSC2009
Kuchinsky_Cytoscape_BOSC2009
bosc
 
[SAC2014]Splitting Approaches for Context-Aware Recommendation: An Empirical ...
[SAC2014]Splitting Approaches for Context-Aware Recommendation: An Empirical ...[SAC2014]Splitting Approaches for Context-Aware Recommendation: An Empirical ...
[SAC2014]Splitting Approaches for Context-Aware Recommendation: An Empirical ...
YONG ZHENG
 
Agile Methods Cost of Quality: Benefits of Testing Early & Often
Agile Methods Cost of Quality: Benefits of Testing Early & OftenAgile Methods Cost of Quality: Benefits of Testing Early & Often
Agile Methods Cost of Quality: Benefits of Testing Early & Often
David Rico
 
Exercise 3 Worksheet Create a Peer Reviewed ReferenceSave this .docx
Exercise 3 Worksheet Create a Peer Reviewed ReferenceSave this .docxExercise 3 Worksheet Create a Peer Reviewed ReferenceSave this .docx
Exercise 3 Worksheet Create a Peer Reviewed ReferenceSave this .docx
SANSKAR20
 
Enabling and Supporting the Debugging of Field Failures (Job Talk)
Enabling and Supporting the Debugging of Field Failures (Job Talk)Enabling and Supporting the Debugging of Field Failures (Job Talk)
Enabling and Supporting the Debugging of Field Failures (Job Talk)
James Clause
 

Similar a Promise 2011: "Are Change Metrics Good Predictors for an Evolving Software Product Line?" (20)

Software Sustainability Institute
Software Sustainability InstituteSoftware Sustainability Institute
Software Sustainability Institute
 
Kuchinsky_Cytoscape_BOSC2009
Kuchinsky_Cytoscape_BOSC2009Kuchinsky_Cytoscape_BOSC2009
Kuchinsky_Cytoscape_BOSC2009
 
Defect Prediction: Accomplishments and Future Challenges
Defect Prediction: Accomplishments and Future ChallengesDefect Prediction: Accomplishments and Future Challenges
Defect Prediction: Accomplishments and Future Challenges
 
Intensive metrics software evolution
Intensive metrics software evolutionIntensive metrics software evolution
Intensive metrics software evolution
 
From Bugs to Decision Support - Selected Research Highlights
From Bugs to Decision Support - Selected Research HighlightsFrom Bugs to Decision Support - Selected Research Highlights
From Bugs to Decision Support - Selected Research Highlights
 
Past, Present, and Future of Analyzing Software Data
Past, Present, and Future of Analyzing Software DataPast, Present, and Future of Analyzing Software Data
Past, Present, and Future of Analyzing Software Data
 
ANALYZABILITY METRIC FOR MAINTAINABILITY OF OBJECT ORIENTED SOFTWARE SYSTEM
ANALYZABILITY METRIC FOR MAINTAINABILITY OF OBJECT ORIENTED SOFTWARE SYSTEMANALYZABILITY METRIC FOR MAINTAINABILITY OF OBJECT ORIENTED SOFTWARE SYSTEM
ANALYZABILITY METRIC FOR MAINTAINABILITY OF OBJECT ORIENTED SOFTWARE SYSTEM
 
[SAC2014]Splitting Approaches for Context-Aware Recommendation: An Empirical ...
[SAC2014]Splitting Approaches for Context-Aware Recommendation: An Empirical ...[SAC2014]Splitting Approaches for Context-Aware Recommendation: An Empirical ...
[SAC2014]Splitting Approaches for Context-Aware Recommendation: An Empirical ...
 
Why Do Computational Scientists Trust Their So
Why Do Computational Scientists Trust Their SoWhy Do Computational Scientists Trust Their So
Why Do Computational Scientists Trust Their So
 
Resource Description Framework Approach to Data Publication and Federation
Resource Description Framework Approach to Data Publication and FederationResource Description Framework Approach to Data Publication and Federation
Resource Description Framework Approach to Data Publication and Federation
 
Software Defect Trend Forecasting In Open Source Projects using A Univariate ...
Software Defect Trend Forecasting In Open Source Projects using A Univariate ...Software Defect Trend Forecasting In Open Source Projects using A Univariate ...
Software Defect Trend Forecasting In Open Source Projects using A Univariate ...
 
Performance Analysis of Leading Application Lifecycle Management Systems for...
Performance Analysis of Leading Application Lifecycle  Management Systems for...Performance Analysis of Leading Application Lifecycle  Management Systems for...
Performance Analysis of Leading Application Lifecycle Management Systems for...
 
Agile methods cost of quality
Agile methods cost of qualityAgile methods cost of quality
Agile methods cost of quality
 
Agile Methods Cost of Quality: Benefits of Testing Early & Often
Agile Methods Cost of Quality: Benefits of Testing Early & OftenAgile Methods Cost of Quality: Benefits of Testing Early & Often
Agile Methods Cost of Quality: Benefits of Testing Early & Often
 
lecture24.ppt
lecture24.pptlecture24.ppt
lecture24.ppt
 
02 stein intro_4th-pv_modeling_workshop_2015-10-22_sand2015-8571_c
02 stein intro_4th-pv_modeling_workshop_2015-10-22_sand2015-8571_c02 stein intro_4th-pv_modeling_workshop_2015-10-22_sand2015-8571_c
02 stein intro_4th-pv_modeling_workshop_2015-10-22_sand2015-8571_c
 
Journal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingJournal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific Computing
 
Exercise 3 Worksheet Create a Peer Reviewed ReferenceSave this .docx
Exercise 3 Worksheet Create a Peer Reviewed ReferenceSave this .docxExercise 3 Worksheet Create a Peer Reviewed ReferenceSave this .docx
Exercise 3 Worksheet Create a Peer Reviewed ReferenceSave this .docx
 
Chemistry made mobile – the expanding world of chemistry in the hand
Chemistry made mobile – the expanding world of chemistry in the handChemistry made mobile – the expanding world of chemistry in the hand
Chemistry made mobile – the expanding world of chemistry in the hand
 
Enabling and Supporting the Debugging of Field Failures (Job Talk)
Enabling and Supporting the Debugging of Field Failures (Job Talk)Enabling and Supporting the Debugging of Field Failures (Job Talk)
Enabling and Supporting the Debugging of Field Failures (Job Talk)
 

Más de CS, NcState

Lexisnexis june9
Lexisnexis june9Lexisnexis june9
Lexisnexis june9
CS, NcState
 
Ai4se lab template
Ai4se lab templateAi4se lab template
Ai4se lab template
CS, NcState
 
Automated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSUAutomated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSU
CS, NcState
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1
CS, NcState
 

Más de CS, NcState (20)

Talks2015 novdec
Talks2015 novdecTalks2015 novdec
Talks2015 novdec
 
Future se oct15
Future se oct15Future se oct15
Future se oct15
 
GALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software EngineeringGALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software Engineering
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest link
 
Three Laws of Trusted Data Sharing: (Building a Better Business Case for Dat...
Three Laws of Trusted Data Sharing:(Building a Better Business Case for Dat...Three Laws of Trusted Data Sharing:(Building a Better Business Case for Dat...
Three Laws of Trusted Data Sharing: (Building a Better Business Case for Dat...
 
Lexisnexis june9
Lexisnexis june9Lexisnexis june9
Lexisnexis june9
 
Welcome to ICSE NIER’15 (new ideas and emerging results).
Welcome to ICSE NIER’15 (new ideas and emerging results).Welcome to ICSE NIER’15 (new ideas and emerging results).
Welcome to ICSE NIER’15 (new ideas and emerging results).
 
Icse15 Tech-briefing Data Science
Icse15 Tech-briefing Data ScienceIcse15 Tech-briefing Data Science
Icse15 Tech-briefing Data Science
 
Kits to Find the Bits that Fits
Kits to Find  the Bits that Fits Kits to Find  the Bits that Fits
Kits to Find the Bits that Fits
 
Ai4se lab template
Ai4se lab templateAi4se lab template
Ai4se lab template
 
Automated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSUAutomated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSU
 
Requirements Engineering
Requirements EngineeringRequirements Engineering
Requirements Engineering
 
172529main ken and_tim_software_assurance_research_at_west_virginia
172529main ken and_tim_software_assurance_research_at_west_virginia172529main ken and_tim_software_assurance_research_at_west_virginia
172529main ken and_tim_software_assurance_research_at_west_virginia
 
Automated Software Engineering
Automated Software EngineeringAutomated Software Engineering
Automated Software Engineering
 
Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)
 
Tim Menzies, directions in Data Science
Tim Menzies, directions in Data ScienceTim Menzies, directions in Data Science
Tim Menzies, directions in Data Science
 
Goldrush
GoldrushGoldrush
Goldrush
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1
 
Know thy tools
Know thy toolsKnow thy tools
Know thy tools
 
The Art and Science of Analyzing Software Data
The Art and Science of Analyzing Software DataThe Art and Science of Analyzing Software Data
The Art and Science of Analyzing Software Data
 

Último

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
UXDXConf
 

Último (20)

FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty Secure
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
THE BEST IPTV in GERMANY for 2024: IPTVreel
THE BEST IPTV in  GERMANY for 2024: IPTVreelTHE BEST IPTV in  GERMANY for 2024: IPTVreel
THE BEST IPTV in GERMANY for 2024: IPTVreel
 
Designing for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastDesigning for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at Comcast
 
Connecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAKConnecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAK
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 

Promise 2011: "Are Change Metrics Good Predictors for an Evolving Software Product Line?"

  • 1. Are Change Metrics Good Predictors for an Evolving Software Product Line? Sandeep Krishnan, ISU Chris Strasburg, ISU & Ames Laboratory Robyn R. Lutz, ISU & JPL, California Institute of Technology Katerina Goseva-Popstojanova, WVU 1 This research is supported by NSF grants 0916275 and 0916284 Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 2.
  • 3. JDT, PDE, Mylyn, Webtools, etc.
  • 4. Reused in more than three products and for more than six years.
  • 7. Reused in three or fewer products and for more than four years.2 Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 8. Related Work Eclipse as a product line. [Chastek, McGregor, and Northrop, 2007], [Linden, 2009], [Krishnan et al., 2011]. Summary of previous work 3 Failure-prone file -A file with one or more non-trivial post-release bugs recorded in the Eclipse Bugzilla database. Important/Good predictor – Predictor providing high information gain for classification of failure-prone files Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 9. Product Line Evolution Product line evolution in two dimensions 4 New Releases P1 R1 P1 R2 P1 R3 P1 Rn P2 R1 P2 R2 P2 R3 P2 Rn New Products Pn R1 Pn R2 Pn R3 Pn Rn Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 10. Motivation Can we leverage the reducing amount of change in product lines to better predict failure-prone files? 5 Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 11. Eclipse case study 6 Blocker Eclipse Classic Critical Eclipse Java Major Eclipse C/C++ Normal Eclipse JavaEE Minor
  • 12. Research Questions As a product evolves, do any change metrics serve as good predictors of failure-prone files? Is there a subset of change metrics which are good predictors across all product line members? Does our ability to predict failure-prone files improve as product line evolves? 7 Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 13. Findings The change metrics provide good classification of the failure-prone files in the Eclipse product line. As each product evolves, there is a stable set of change metrics that are prominent predictors of failure-prone files across its releases. There is a subset of change metrics that is among the prominent predictors of all the products across most of the releases. As the product line matures, prediction performance improves for each of the four Eclipse products. 8 Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 14. Data Source 9 Data Timeline Data Timeline Source of failure reports- Source of change reports – CVS repository of Eclipse. Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 15. Approach 10 Get prediction results + best predictors Weka J48 decision tree learner Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 16. Replication Results 11 Learner performance compared to previous results Classification performance comparison for Eclipse Classic 2.0, 2.1, and 3.0 PC- Percentage of correctly classified instances TPR- True positive rate FPR- False positive rate Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 17.
  • 18. 13 Learner performance improves as single product evolves Extension Results Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 19. 14 Top five predictors for later releases of Eclipse Classic Extension Results Revisions is good predictor for later releases also. Max_changeset is a good predictor also. Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 20. 15 Learner performance improves as product line evolves Extension Results Percentage of correctly classified instances increases across releases for each product Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 21. 16 Learner performance improves as product line evolves Extension Results Percentage of true positives shows improvement across releases for each product Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 22. 17 Learner performance improves as product line evolves Extension Results Percentage of false positives shows reduces across releases for each product Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 23. 18 Top five predictors for four products of Eclipse Product Line Extension Results No common set of predictors across each product and each release. Max_changeset, Revisions and Authors are prominent predictors for all products. Some predictors are prominent for only one product. Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 24. 19 Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 25. Thank You! 20 Our data is available at http://www.cs.iastate.edu/~lss/PROMISE11Data.tar.gz Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011