Back to Search
Start Over
Exploring the effectiveness of instruction tuning in biomedical language processing.
- Source :
-
Artificial intelligence in medicine [Artif Intell Med] 2024 Nov 07; Vol. 158, pp. 103007. Date of Electronic Publication: 2024 Nov 07. - Publication Year :
- 2024
- Publisher :
- Ahead of Print
-
Abstract
- Large Language Models (LLMs), particularly those similar to ChatGPT, have significantly influenced the field of Natural Language Processing (NLP). While these models excel in general language tasks, their performance in domain-specific downstream tasks such as biomedical and clinical Named Entity Recognition (NER), Relation Extraction (RE), and Medical Natural Language Inference (NLI) is still evolving. In this context, our study investigates the potential of instruction tuning for biomedical language processing, applying this technique to two general LLMs of substantial scale. We present a comprehensive, instruction-based model trained on a dataset that consists of approximately 200,000 instruction-focused samples. This dataset represents a carefully curated compilation of existing data, meticulously adapted and reformatted to align with the specific requirements of our instruction-based tasks. This initiative represents an important step in utilising such models to achieve results on par with specialised encoder-only models like BioBERT and BioClinicalBERT for various classical biomedical NLP tasks. Our work includes an analysis of the dataset's composition and its impact on model performance, providing insights into the intricacies of instruction tuning. By sharing our codes, models, and the distinctively assembled instruction-based dataset, we seek to encourage ongoing research and development in this area. <superscript>2</superscript> .<br />Competing Interests: Declaration of competing interest The authors declare the following financial relationships: This research was partially funded by the National Institute for Health Research (NIHR) Oxford Biomedical Research Centre (BRC) and by an InnoHK Project at the Hong Kong Centre for Cerebro-cardiovascular Health Engineering (COCHE). Omid Rohanian has received grant support from the Medical Research Council (grant number MR/W01761X/ 1). David A. Clifton is supported by an NIHR Research Professorship, an RAEng Research Chair, the UKRI, COCHE, and the Pandemic Sciences Institute at the University of Oxford. None of the authors have any employment, consultancies, stock ownership, honoraria, paid expert testimony, or patent applications/ registrations that could be considered as potential conflicts of interest affecting this work. The funding sources had no involvement in the study design, in the collection, analysis, and interpretation of data, in the writing of the report, or in the decision to submit the article for publication. None beyond the funding sources mentioned.<br /> (Copyright © 2024. Published by Elsevier B.V.)
Details
- Language :
- English
- ISSN :
- 1873-2860
- Volume :
- 158
- Database :
- MEDLINE
- Journal :
- Artificial intelligence in medicine
- Publication Type :
- Academic Journal
- Accession number :
- 39541861
- Full Text :
- https://doi.org/10.1016/j.artmed.2024.103007