Back to Search Start Over

GPT-4 outperforms ChatGPT in answering non-English questions related to cirrhosis

Authors :
Yee Hui Yeo
Jamil S. Samaan
Wee Han Ng
Xiaoyan Ma
Peng-Sheng Ting
Min-Sun Kwak
Arturo Panduro
Blanca Lizaola-Mayo
Hirsh Trivedi
Aarshi Vipani
Walid Ayoub
Ju Dong Yang
Omer Liran
Brennan Spiegel
Alexander Kuo
Publication Year :
2023
Publisher :
Cold Spring Harbor Laboratory, 2023.

Abstract

Background and ObjectivesArtificial intelligence is increasingly being employed in healthcare, raising concerns about the exacerbation of disparities. This study evaluates ChatGPT and GPT-4’s ability to comprehend and respond to cirrhosis-related questions in English, Korean, Mandarin, and Spanish, addressing language barriers that may impact patient care.MethodsA set of 36 cirrhosis-related questions were translated into Korean, Mandarin, and Spanish and prompted to both ChatGPT and GPT-4 models. Non-English responses were graded by native-speaking hepatologists on accuracy and similarity to English responses. Chi-square tests were used to compare the proportions of grading between ChatGPT and GPT-4.ResultsGPT-4 showed a marked improvement in the proportion of comprehensive and correct answers compared to ChatGPT across all four languages (pConclusionsGPT-4 exhibited significantly higher accuracy in English and non-English cirrhosis-related questions, highlighting its potential for more accurate and reliable language model applications in diverse linguistic contexts. These advancements have important implications for patients with language discordance, contributing to equalizing health literacy on a global scale.

Details

Database :
OpenAIRE
Accession number :
edsair.doi...........026668830fb9cffa5453d66c3dd168ea
Full Text :
https://doi.org/10.1101/2023.05.04.23289482