Back to Search
Start Over
JCF: joint coarse- and fine-grained similarity comparison for plagiarism detection based on NLP.
- Source :
- Journal of Supercomputing; Jan2024, Vol. 80 Issue 1, p363-394, 32p
- Publication Year :
- 2024
-
Abstract
- Document similarity recognition is one of the most important problems in natural language processing. This paper proposes a plagiarism comparison mechanism called JCF. Initially, the TF–IDF scheme is applied to build a bag of words as the representation of the common features of all documents. Then, the plagiarism comparison is carried out in a coarse-grained manner, which speeds up the similarity comparison. Finally, the most similar documents can then be compared in detail based on a fine-grained approach. In addition, the JCF detects plagiarism at both syntax level and semantic-like level. To prevent the distortion of similarity comparison, this paper further develops a similarity restoration approach such that the proposed JCF can obtain both advantages of quickness and accuracy. Performance studies confirm that the proposed JCF outperforms existing studies in terms of precision, recall and F1 score. [ABSTRACT FROM AUTHOR]
- Subjects :
- PLAGIARISM
NATURAL language processing
Subjects
Details
- Language :
- English
- ISSN :
- 09208542
- Volume :
- 80
- Issue :
- 1
- Database :
- Complementary Index
- Journal :
- Journal of Supercomputing
- Publication Type :
- Academic Journal
- Accession number :
- 174659420
- Full Text :
- https://doi.org/10.1007/s11227-023-05472-0