Back to Search Start Over

Are duplicates really harmful? An empirical study on bug report summarization techniques.

Authors :
Hao, Rui
Li, Yuying
Feng, Yang
Chen, Zhenyu
Source :
Journal of Software: Evolution & Process. Nov2023, Vol. 35 Issue 11, p1-25. 25p.
Publication Year :
2023

Abstract

Recent research works have proven that duplicate bug reports can provide helpful information to assist developers in software tasks such as fault localization and program fixing, while thoroughly reading duplicate bug reports is time‐consuming and inefficient. Summarization is a possible solution for gaining essential information quickly. However, there are many challenges when applying existing summarizing techniques on duplicate bug reports. Duplicate bug reports describe the same problem from different views and vary in quality, content, and writing style. Moreover, the code snippet understanding and the semantic gap between natural and programming languages make the summary generation even more difficult. Thus, in this paper, we want to investigate whether the state‐of‐the‐art summarization approaches can overcome the resistance and generate an effective summary for duplicate bug reports. We collected more than 8,000 groups of duplicate reports from GitHub and labeled 60 groups with 149 reports manually for the evaluation. Results showed that although the existing summarization approaches can work on duplicate bug reports, there are significant differences between them when it comes to code snippet summarization. Moreover, several methods can be very sluggish for summarizing long bug reports. Our study provides insights and guidelines for choosing proper summarization approaches in different scenarios. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
20477473
Volume :
35
Issue :
11
Database :
Academic Search Index
Journal :
Journal of Software: Evolution & Process
Publication Type :
Academic Journal
Accession number :
173397362
Full Text :
https://doi.org/10.1002/smr.2424