Back to Search
Start Over
Comparative analysis of generative LLMs for labeling entities in clinical notes.
- Source :
-
Genomics & informatics [Genomics Inform] 2025 Feb 06; Vol. 23 (1), pp. 3. Date of Electronic Publication: 2025 Feb 06. - Publication Year :
- 2025
-
Abstract
- This paper evaluates and compares different fine-tuned variations of generative large language models (LLM) in the zero-shot named entity recognition (NER) task for the clinical domain. As part of the 8th Biomedical Linked Annotation Hackathon, we examined Llama 2 and Mistral models, including base versions and those that have been fine-tuned for code, chat, and instruction-following tasks. We assess both the number of correctly identified entities and the models' ability to retrieve entities in structured formats. We used a publicly available set of clinical cases labeled with mentions of diseases, symptoms, and medical procedures for the evaluation. Results show that instruction fine-tuned models perform better than chat fine-tuned and base models in recognizing entities. It is also shown that models perform better when simple output structures are requested.<br />Competing Interests: Declarations. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.<br /> (© 2025. The Author(s).)
Details
- Language :
- English
- ISSN :
- 1598-866X
- Volume :
- 23
- Issue :
- 1
- Database :
- MEDLINE
- Journal :
- Genomics & informatics
- Publication Type :
- Academic Journal
- Accession number :
- 39915888
- Full Text :
- https://doi.org/10.1186/s44342-024-00036-x