Back to Search Start Over

Automatic Construction of Fine-Grained Paraphrase Corpora System Using Language Inference Model

Authors :
Ying Zhou
Xiaokang Hu
Vera Chung
Source :
Applied Sciences, Vol 12, Iss 1, p 499 (2022)
Publication Year :
2022
Publisher :
MDPI AG, 2022.

Abstract

Paraphrase detection and generation are important natural language processing (NLP) tasks. Yet the term paraphrase is broad enough to include many fine-grained relations. This leads to different tolerance levels of semantic divergence in the positive paraphrase class among publicly available paraphrase datasets. Such variation can affect the generalisability of paraphrase classification models. It may also impact the predictability of paraphrase generation models. This paper presents a new model which can use few corpora of fine-grained paraphrase relations to construct automatically using language inference models. The fine-grained sentence level paraphrase relations are defined based on word and phrase level counterparts. We demonstrate that the fine-grained labels from our proposed system can make it possible to generate paraphrases at desirable semantic level. The new labels could also contribute to general sentence embedding techniques.

Details

Language :
English
ISSN :
12010499 and 20763417
Volume :
12
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Applied Sciences
Publication Type :
Academic Journal
Accession number :
edsdoj.f9d4353e078f45dda25dd320381cc65e
Document Type :
article
Full Text :
https://doi.org/10.3390/app12010499