Back to Search Start Over

Assessment of Pathology Domain-Specific Knowledge of ChatGPT and Comparison to Human Performance.

Authors :
Wang AY
Lin S
Tran C
Homer RJ
Wilsdon D
Walsh JC
Goebel EA
Sansano I
Sonawane S
Cockenpot V
Mukhopadhyay S
Taskin T
Zahra N
Cima L
Semerci O
Özamrak BG
Mishra P
Vennavalli NS
Chen PC
Cecchini MJ
Source :
Archives of pathology & laboratory medicine [Arch Pathol Lab Med] 2024 Oct 01; Vol. 148 (10), pp. 1152-1158.
Publication Year :
2024

Abstract

Context.—: Artificial intelligence algorithms hold the potential to fundamentally change many aspects of society. Application of these tools, including the publicly available ChatGPT, has demonstrated impressive domain-specific knowledge in many areas, including medicine.<br />Objectives.—: To understand the level of pathology domain-specific knowledge for ChatGPT using different underlying large language models, GPT-3.5 and the updated GPT-4.<br />Design.—: An international group of pathologists (n = 15) was recruited to generate pathology-specific questions at a similar level to those that could be seen on licensing (board) examinations. The questions (n = 15) were answered by GPT-3.5, GPT-4, and a staff pathologist who recently passed their Canadian pathology licensing exams. Participants were instructed to score answers on a 5-point scale and to predict which answer was written by ChatGPT.<br />Results.—: GPT-3.5 performed at a similar level to the staff pathologist, while GPT-4 outperformed both. The overall score for both GPT-3.5 and GPT-4 was within the range of meeting expectations for a trainee writing licensing examinations. In all but one question, the reviewers were able to correctly identify the answers generated by GPT-3.5.<br />Conclusions.—: By demonstrating the ability of ChatGPT to answer pathology-specific questions at a level similar to (GPT-3.5) or exceeding (GPT-4) a trained pathologist, this study highlights the potential of large language models to be transformative in this space. In the future, more advanced iterations of these algorithms with increased domain-specific knowledge may have the potential to assist pathologists and enhance pathology resident training.<br />Competing Interests: Chen is an employee of Need Inc and owns Need Inc equity. The other authors have no relevant financial interest in the products or companies described in this article.<br /> (© 2024 College of American Pathologists.)

Details

Language :
English
ISSN :
1543-2165
Volume :
148
Issue :
10
Database :
MEDLINE
Journal :
Archives of pathology & laboratory medicine
Publication Type :
Academic Journal
Accession number :
38244054
Full Text :
https://doi.org/10.5858/arpa.2023-0296-OA