Back to Search Start Over

Comparing hierarchical dirichlet process with latent dirichlet allocation in bug report multiclass classification

Authors :
Hideaki Hata
Nachai Limsettho
Kenichi Matsumoto
Source :
SNPD
Publication Year :
2014
Publisher :
IEEE, 2014.

Abstract

Bug reports play essential roles in many software engineering tasks. Since validity and performance of these tasks definitely rely on the quality of bug reports, accurate information from bug reports is very important. However, as found in previous study, significant numbers of reports classified as bug are not really a bug. Recent studies proposed techniques to automatically classify bug reports into binary classes, yet there is still more to desire. These bug reports can be classified into multiple classes, which could help to identify what these reports are actually about. Moreover, previous study only looks into one possibility of topic modeling, that is, Latent Dirichlet Allocation (LDA). While LDA has its advantage, parameter tuning is required. In this paper, we propose a nonparametric approach to automatically classify bug reports with, another topic modeling method, Hierarchical Dirichlet Process (HDP). The result indicates that our nonparametric approach performance is comparable to the parametric one. We also examine various aspects of LDA to provide more thoroughly understanding of this process.<br />SNPD 2014 : 15th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, 30 June-2 July 2014, Las Vegas, NV, USA

Details

Language :
English
Database :
OpenAIRE
Journal :
SNPD
Accession number :
edsair.doi.dedup.....1766b7eb48ca1e23952f0346006fa5bc