Back to Search Start Over

BlendCSE: Blend contrastive learnings for sentence embeddings with rich semantics and transferability.

Authors :
Xu, Jiahao
Zhanyi, Charlie Soh
Xu, Liwen
Chen, Lihui
Source :
Expert Systems with Applications. Mar2024:Part E, Vol. 238, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Sentence representation is one of the most fundamental research topics in natural language processing (NLP), as its quality directly affects various downstream task performances. Recent studies for sentence representations have established state-of-the-art (SOTA) performance on semantic representation tasks. However, embeddings by those approaches share unsatisfying transferability when applied to various specific applications. Seldom work studies the transferability of semantic sentence embeddings. In this paper, we first explore the transferability characteristic of the sentence embeddings, and present BlendCSE, a new sentence embedding model targeting rich semantics and transferability. BlendCSE blends three recent advanced NLP learning methodologies, namely, continue learning on masked language modeling (MLM), contrastive learning (CL) with data augmentations (DA), and semantic supervised learning. The main objectives of BlendCSE are to capture token/word level information, diversified linguistic properties, and sentence semantics, respectively. Empirical studies demonstrate that BlendCSE captures semantics comparably well on STS tasks, yet surpasses existing methods on various transfer tasks, yielding even stronger transferability on document-level applications. Ablation studies verified that the three learning objectives synergy well to capture semantics and transferability effectively. • Sentence representation for rich semantics and strong transferability. • Sentence representation via contrastive learning compatible with data augmentation. • Blends supervised, self-supervised learning and masked language modeling. • Achieves state-of-the-art for downstream tasks and document applications. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09574174
Volume :
238
Database :
Academic Search Index
Journal :
Expert Systems with Applications
Publication Type :
Academic Journal
Accession number :
173726906
Full Text :
https://doi.org/10.1016/j.eswa.2023.121909