1. A comparative study of GPT-4o and human ophthalmologists in glaucoma diagnosis
- Author
-
Junxiu Zhang, Yao Ma, Rong Zhang, Yanhua Chen, Mengyao Xu, Su Rina, and Ke Ma
- Subjects
Medicine ,Science - Abstract
Abstract Artificial intelligence (AI), particularly large language models like GPT-4o, holds promise for enhancing diagnostic accuracy in healthcare. This study evaluates the diagnostic performance of GPT-4o compared to human ophthalmologists in glaucoma cases. A prospective, observational study was conducted at a tertiary care ophthalmology center. Twenty-six glaucoma cases, including both primary and secondary types, were selected from publicly available databases and institutional records. The cases were analyzed by GPT-4o and three ophthalmologists with varying levels of experience. The accuracy and completeness of primary and differential diagnoses were assessed using 10-point and 6-point Likert scales, respectively. Statistical analyses were performed using nonparametric methods, including the Kruskal–Wallis and Mann–Whitney U tests. GPT-4o was significantly less accurate in primary diagnosis compared to human ophthalmologists. Specifically, GPT-4o achieved a mean score of 5.500 (p
- Published
- 2024
- Full Text
- View/download PDF