1. Comparison of artificial intelligence large language model chatbots in answering frequently asked questions in anaesthesia
- Author
-
Teresa P. Nguyen, Brendan Carvalho, Hannah Sukhdeo, Kareem Joudi, Nan Guo, Marianne Chen, Jed T. Wolpaw, Jesse J. Kiefer, Melissa Byrne, Tatiana Jamroz, Allison A. Mootz, Sharon C. Reale, James Zou, and Pervez Sultan
- Subjects
anaesthesia frequently asked questions ,artificial intelligence ,Bing Chat ,chatbot ,Google Bard ,GPT ,Anesthesiology ,RD78.3-87.3 - Abstract
Background: Patients are increasingly using artificial intelligence (AI) chatbots to seek answers to medical queries. Methods: Ten frequently asked questions in anaesthesia were posed to three AI chatbots: ChatGPT4 (OpenAI), Bard (Google), and Bing Chat (Microsoft). Each chatbot's answers were evaluated in a randomised, blinded order by five residency programme directors from 15 medical institutions in the USA. Three medical content quality categories (accuracy, comprehensiveness, safety) and three communication quality categories (understandability, empathy/respect, and ethics) were scored between 1 and 5 (1 representing worst, 5 representing best). Results: ChatGPT4 and Bard outperformed Bing Chat (median [inter-quartile range] scores: 4 [3–4], 4 [3–4], and 3 [2–4], respectively; P
- Published
- 2024
- Full Text
- View/download PDF