Back to Search Start Over

Code similarity detection through control statement and program features

Authors :
M. Sudhamani
Lalitha Rangarajan
Source :
Expert Systems with Applications. 132:63-75
Publication Year :
2019
Publisher :
Elsevier BV, 2019.

Abstract

Software clone detection is an emerging research area in the field of software engineering. Software systems are subjected to continuous modifications in source code to improve the performance of the software, which may lead to code redundancy. Duplicate code/code clone is a piece of code reworked several times in software programs due to copy paste activity or reusability of existing software. Code clone is a prime subject in software evolution. Detection of software clones at the time of software evolution may improve the performance of software and reduce the maintenance cost and effort. This paper proposes metric based methods to detect code clones, as software clone is a universal problem in large scale programming environment. This paper introduces two metric based approaches to detect code clones by comparing (i) Control Statement Features (ii) Program Features like different types of statements, operators and operands. In order to demonstrate the effectiveness of the proposed approaches, extensive experiments are conducted on two datasets, C projects of Bellon's benchmark dataset and student lab programs (SLP).The methods efficiently identify similar functional clones. Proposed models only find similarity of whole programs but intelligent enough to highlight similar code segments across program files.

Details

ISSN :
09574174
Volume :
132
Database :
OpenAIRE
Journal :
Expert Systems with Applications
Accession number :
edsair.doi...........763ab2be87126f8e471de86437103a10