Back to Search
Start Over
Assessing the Performance of Chatbots on the Taiwan Psychiatry Licensing Examination Using the Rasch Model.
- Source :
- Healthcare (2227-9032); Nov2024, Vol. 12 Issue 22, p2305, 11p
- Publication Year :
- 2024
-
Abstract
- Background/Objectives: The potential and limitations of chatbots in medical education and clinical decision support, particularly in specialized fields like psychiatry, remain unknown. By using the Rasch model, our study aimed to evaluate the performance of various state-of-the-art chatbots on psychiatry licensing exam questions to explore their strengths and weaknesses. Methods: We assessed the performance of 22 leading chatbots, selected based on LMArena benchmark rankings, using 100 multiple-choice questions from the 2024 Taiwan psychiatry licensing examination, a nationally standardized test required for psychiatric licensure in Taiwan. Chatbot responses were scored for correctness, and we used the Rasch model to evaluate chatbot ability. Results: Chatbots released after February 2024 passed the exam, with ChatGPT-o1-preview achieving the highest score of 85. ChatGPT-o1-preview showed a statistically significant superiority in ability (p < 0.001), with a 1.92 logits improvement compared to the passing threshold. It demonstrated strengths in complex psychiatric problems and ethical understanding, yet it presented limitations in up-to-date legal updates and specialized psychiatry knowledge, such as recent amendments to the Mental Health Act, psychopharmacology, and advanced neuroimaging. Conclusions: Chatbot technology could be a valuable tool for medical education and clinical decision support in psychiatry, and as technology continues to advance, these models are likely to play an increasingly integral role in psychiatric practice. [ABSTRACT FROM AUTHOR]
- Subjects :
- STATISTICAL models
MEDICAL education
PSYCHIATRY
CRONBACH'S alpha
DATA analysis
CLINICAL decision support systems
QUESTIONNAIRES
PROFESSIONAL licensure examinations
DESCRIPTIVE statistics
PROFESSIONS
ABILITY
STATISTICS
U.S. states
DATA analysis software
PSYCHOPHARMACOLOGY
NEURORADIOLOGY
MEDICAL ethics
GOVERNMENT regulation
Subjects
Details
- Language :
- English
- ISSN :
- 22279032
- Volume :
- 12
- Issue :
- 22
- Database :
- Complementary Index
- Journal :
- Healthcare (2227-9032)
- Publication Type :
- Academic Journal
- Accession number :
- 181166680
- Full Text :
- https://doi.org/10.3390/healthcare12222305