Back to Search Start Over

Keyphrase Identification Using Minimal Labeled Data with Hierarchical Context and Transfer Learning

Authors :
Rohan Goli
Nina Hubig
Hua Min
Yang Gong
Dean Sitting
Lior Rennert
David Robinson
Paul Biondich
Adam Wright
Christian Nohr
Timothy Law
Arild Faxvaag
Aneesa Weaver
Ronald Gimbel
Xia Jing
Source :
Goli, R, Hubig, N, Min, H, Gong, Y, Sittig, D F, Rennert, L, Robinson, D, Biondich, P, Wright, A, Nøhr, C G, Law, T, Faxvaag, A, Weaver, A, Gimbel, R & Jing, X 2023, ' Keyphrase Identification Using Minimal Labeled Data with Hierarchical Context and Transfer Learning ', medRxiv . https://doi.org/10.1101/2023.01.26.23285060, medRxiv
Publication Year :
2023

Abstract

Interoperable clinical decision support system (CDSS) rules are a pathway to achieving interoperability which is a well-recognized challenge in health information technology. Building an ontology facilitates the creation of interoperable CDSS rules, which can be achieved by identifying the keyphrases (KP) from the existing literature. However, KP identification for labeling the data requires human expertise, consensus, and contextual understanding. This paper aims to present a semi-supervised framework for the CDSS using minimal labeled data based on hierarchical attention over the documents fused with domain adaptation approaches. Then, evaluate the effectiveness of KP identification with this framework. In the view of semi-supervised learning, our methodology toward building this framework outperforms the prior neural architectures by learning with document-level context, no explicit hand-crafted features, knowledge transfer from pre-trained models (on unlabeled corpus), and post-fine-tuning with smaller gold standard-labeled data. To the best of our knowledge, this is the first functional framework for the CDSS sub-domain to identify the KP, which is trained on limited labeled data. It contributes to the general natural language processing (NLP) architectures in areas such as clinical NLP, where manual data labeling is challenging.

Subjects

Subjects :
health informatics
Article

Details

Language :
English
Database :
OpenAIRE
Journal :
Goli, R, Hubig, N, Min, H, Gong, Y, Sittig, D F, Rennert, L, Robinson, D, Biondich, P, Wright, A, Nøhr, C G, Law, T, Faxvaag, A, Weaver, A, Gimbel, R & Jing, X 2023, ' Keyphrase Identification Using Minimal Labeled Data with Hierarchical Context and Transfer Learning ', medRxiv . https://doi.org/10.1101/2023.01.26.23285060, medRxiv
Accession number :
edsair.doi.dedup.....1abaf81870114fd6fd00b2dbc6b1d172