Back to Search Start Over

Quality and Agreement With Scientific Consensus of ChatGPT Information Regarding Corneal Transplantation and Fuchs Dystrophy.

Authors :
Barclay KS
You JY
Coleman MJ
Mathews PM
Ray VL
Riaz KM
De Rojas JO
Wang AS
Watson SH
Koo EH
Eghrari AO
Source :
Cornea [Cornea] 2024 Jun 01; Vol. 43 (6), pp. 746-750. Date of Electronic Publication: 2023 Nov 28.
Publication Year :
2024

Abstract

Purpose: ChatGPT is a commonly used source of information by patients and clinicians. However, it can be prone to error and requires validation. We sought to assess the quality and accuracy of information regarding corneal transplantation and Fuchs dystrophy from 2 iterations of ChatGPT, and whether its answers improve over time.<br />Methods: A total of 10 corneal specialists collaborated to assess responses of the algorithm to 10 commonly asked questions related to endothelial keratoplasty and Fuchs dystrophy. These questions were asked from both ChatGPT-3.5 and its newer generation, GPT-4. Assessments tested quality, safety, accuracy, and bias of information. Chi-squared, Fisher exact tests, and regression analyses were conducted.<br />Results: We analyzed 180 valid responses. On a 1 (A+) to 5 (F) scale, the average score given by all specialists across questions was 2.5 for ChatGPT-3.5 and 1.4 for GPT-4, a significant improvement ( P < 0.0001). Most responses by both ChatGPT-3.5 (61%) and GPT-4 (89%) used correct facts, a proportion that significantly improved across iterations ( P < 0.00001). Approximately a third (35%) of responses from ChatGPT-3.5 were considered against the scientific consensus, a notable rate of error that decreased to only 5% of answers from GPT-4 ( P < 0.00001).<br />Conclusions: The quality of responses in ChatGPT significantly improved between versions 3.5 and 4, and the odds of providing information against the scientific consensus decreased. However, the technology is still capable of producing inaccurate statements. Corneal specialists are uniquely positioned to assist users to discern the veracity and application of such information.<br />Competing Interests: The authors have no funding or conflicts of interest to disclose.<br /> (Copyright © 2023 Wolters Kluwer Health, Inc. All rights reserved.)

Details

Language :
English
ISSN :
1536-4798
Volume :
43
Issue :
6
Database :
MEDLINE
Journal :
Cornea
Publication Type :
Academic Journal
Accession number :
38016014
Full Text :
https://doi.org/10.1097/ICO.0000000000003439