Back to Search Start Over

JCF: joint coarse- and fine-grained similarity comparison for plagiarism detection based on NLP.

Authors :
Chang, Chih-Yung
Jhang, Syu-Jhih
Wu, Shih-Jung
Roy, Diptendu Sinha
Source :
Journal of Supercomputing; Jan2024, Vol. 80 Issue 1, p363-394, 32p
Publication Year :
2024

Abstract

Document similarity recognition is one of the most important problems in natural language processing. This paper proposes a plagiarism comparison mechanism called JCF. Initially, the TF–IDF scheme is applied to build a bag of words as the representation of the common features of all documents. Then, the plagiarism comparison is carried out in a coarse-grained manner, which speeds up the similarity comparison. Finally, the most similar documents can then be compared in detail based on a fine-grained approach. In addition, the JCF detects plagiarism at both syntax level and semantic-like level. To prevent the distortion of similarity comparison, this paper further develops a similarity restoration approach such that the proposed JCF can obtain both advantages of quickness and accuracy. Performance studies confirm that the proposed JCF outperforms existing studies in terms of precision, recall and F1 score. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09208542
Volume :
80
Issue :
1
Database :
Complementary Index
Journal :
Journal of Supercomputing
Publication Type :
Academic Journal
Accession number :
174659420
Full Text :
https://doi.org/10.1007/s11227-023-05472-0