Back to Search
Start Over
Interactive Near Duplicate Search in Software Documentation
- Source :
- Programming and Computer Software. 45:346-355
- Publication Year :
- 2019
- Publisher :
- Pleiades Publishing Ltd, 2019.
-
Abstract
- Various software features such as classes, methods, requirements, and tests often have similar functionality. This can lead to emergence of duplicates in their descriptive documentation. Uncontrolled duplicates created via copy/paste hinder the process of documentation maintenance. Therefore, the task of duplicate detection in software documentation is of importance. Solving it makes planned reuse possible, as well as creating and using templates for unification and automatic generation of documentation. In this paper, we present an approach for interactive detection of near duplicates that involves the user in order to conduct meaningful search. It includes a new formal definition of a near duplicate, a pattern-based , and the proof of its completeness. Moreover, we demonstrate the results of experimenting on a collection of documents of several industrial projects.
- Subjects :
- Software documentation
Information retrieval
Unification
business.industry
Computer science
Process (engineering)
020207 software engineering
0102 computer and information sciences
02 engineering and technology
Reuse
01 natural sciences
Task (project management)
Software
Documentation
010201 computation theory & mathematics
Completeness (order theory)
0202 electrical engineering, electronic engineering, information engineering
business
Subjects
Details
- ISSN :
- 16083261 and 03617688
- Volume :
- 45
- Database :
- OpenAIRE
- Journal :
- Programming and Computer Software
- Accession number :
- edsair.doi...........b7e9e99f5dc96cf4a91e223c75d6af9f
- Full Text :
- https://doi.org/10.1134/s0361768819060045