Back to Search
Start Over
Automatic Construction of Fine-Grained Paraphrase Corpora System Using Language Inference Model
- Source :
- Applied Sciences, Vol 12, Iss 1, p 499 (2022)
- Publication Year :
- 2022
- Publisher :
- MDPI AG, 2022.
-
Abstract
- Paraphrase detection and generation are important natural language processing (NLP) tasks. Yet the term paraphrase is broad enough to include many fine-grained relations. This leads to different tolerance levels of semantic divergence in the positive paraphrase class among publicly available paraphrase datasets. Such variation can affect the generalisability of paraphrase classification models. It may also impact the predictability of paraphrase generation models. This paper presents a new model which can use few corpora of fine-grained paraphrase relations to construct automatically using language inference models. The fine-grained sentence level paraphrase relations are defined based on word and phrase level counterparts. We demonstrate that the fine-grained labels from our proposed system can make it possible to generate paraphrases at desirable semantic level. The new labels could also contribute to general sentence embedding techniques.
Details
- Language :
- English
- ISSN :
- 12010499 and 20763417
- Volume :
- 12
- Issue :
- 1
- Database :
- Directory of Open Access Journals
- Journal :
- Applied Sciences
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.f9d4353e078f45dda25dd320381cc65e
- Document Type :
- article
- Full Text :
- https://doi.org/10.3390/app12010499