Back to Search Start Over

Interactive Near Duplicate Search in Software Documentation

Authors :
A. I. Vasenina
Dmitry A. Grigoriev
D. V. Borovkov
A. N. Smirnova
George A. Chernishev
D. V. Luciv
Dmitry Koznov
Andrey Terekhov
K. Yu. Romanovsky
A. A. Shelikhovskii
Source :
Programming and Computer Software. 45:346-355
Publication Year :
2019
Publisher :
Pleiades Publishing Ltd, 2019.

Abstract

Various software features such as classes, methods, requirements, and tests often have similar functionality. This can lead to emergence of duplicates in their descriptive documentation. Uncontrolled duplicates created via copy/paste hinder the process of documentation maintenance. Therefore, the task of duplicate detection in software documentation is of importance. Solving it makes planned reuse possible, as well as creating and using templates for unification and automatic generation of documentation. In this paper, we present an approach for interactive detection of near duplicates that involves the user in order to conduct meaningful search. It includes a new formal definition of a near duplicate, a pattern-based , and the proof of its completeness. Moreover, we demonstrate the results of experimenting on a collection of documents of several industrial projects.

Details

ISSN :
16083261 and 03617688
Volume :
45
Database :
OpenAIRE
Journal :
Programming and Computer Software
Accession number :
edsair.doi...........b7e9e99f5dc96cf4a91e223c75d6af9f
Full Text :
https://doi.org/10.1134/s0361768819060045