1. Preserving Generalization of Language models in Few-shot Continual Relation Extraction
- Author
-
Tran, Quyen, Thanh, Nguyen Xuan, Anh, Nguyen Hoang, Hai, Nam Le, Le, Trung, Van Ngo, Linh, and Nguyen, Thien Huu
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Few-shot Continual Relations Extraction (FCRE) is an emerging and dynamic area of study where models can sequentially integrate knowledge from new relations with limited labeled data while circumventing catastrophic forgetting and preserving prior knowledge from pre-trained backbones. In this work, we introduce a novel method that leverages often-discarded language model heads. By employing these components via a mutual information maximization strategy, our approach helps maintain prior knowledge from the pre-trained backbone and strategically aligns the primary classification head, thereby enhancing model performance. Furthermore, we explore the potential of Large Language Models (LLMs), renowned for their wealth of knowledge, in addressing FCRE challenges. Our comprehensive experimental results underscore the efficacy of the proposed method and offer valuable insights for future work., Comment: Accepted to EMNLP 2024
- Published
- 2024