Start Over

An Empirical Examination of the Impact of Bias on Just-in-time Defect Prediction

Authors :: Iftekhar Ahmed
Jiawei Li
Jiri Gesi
Source :: ESEM
Publication Year :: 2021
Publisher :: ACM, 2021.
Abstract: Background: Just-In-Time (JIT) defect prediction models predict if a commit will introduce defects in the future. DeepJIT and CC2Vec are two state-of-the-art JIT Deep Learning (DL) techniques. Usually, defect prediction techniques are evaluated, treating all training data equally. However, data is usually imbalanced not only in terms of the overall class label (e.g., defect and non-defect) but also in terms of characteristics such as File Count, Edit Count, Multiline Comments, Inward Dependency Sum etc. Prior research has investigated the impact of class imbalance on prediction technique's performance but not the impact of imbalance of other characteristics. Aims: We aim to explore the impact of different commit related characteristic's imbalance on DL defect prediction. Method: We investigated different characteristic's impact on the overall performance of DeepJIT and CC2Vec. We also propose a Siamese network based few-shot learning framework for JIT defect prediction (SifterJIT) combining Siamese network and DeepJIT. Results: Our results show that DeepJIT and CC2Vec lose out on the performance by around 20% when trained and tested on imbalanced data. However, SifterJIT can outperform state-of-the-art DL techniques with an average of 8.65% AUC score, 11% precision, and 6% F1-score improvement. Conclusions: Our results highlight that dataset imbalanced in terms of commit characteristics can significantly impact prediction performance, and few-shot learning based techniques can help alleviate the situation.

Subjects :: Training set
Dependency (UML)
Computer science
business.industry
Deep learning
Commit
Machine learning
computer.software_genre
Class (biology)
Class imbalance
Empirical examination
Artificial intelligence
business
computer
Predictive modelling

Details

Database :: OpenAIRE
Journal :: Proceedings of the 15th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)
Accession number :: edsair.doi...........7e8d86fbad4d2116d06cbc2bedb2242f
Full Text :: https://doi.org/10.1145/3475716.3475791

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

An Empirical Examination of the Impact of Bias on Just-in-time Defect Prediction

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

An Empirical Examination of the Impact of Bias on Just-in-time Defect Prediction

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources