Back to Search Start Over

Diagnostic Accuracy of a Mobile AI-Based Symptom Checker and a Web-Based Self-Referral Tool in Rheumatology: Multicenter Randomized Controlled Trial

Authors :
Johannes Knitza
Koray Tascilar
Franziska Fuchs
Jacob Mohn
Sebastian Kuhn
Daniela Bohr
Felix Muehlensiepen
Christina Bergmann
Hannah Labinsky
Harriet Morf
Elizabeth Araujo
Matthias Englbrecht
Wolfgang Vorbrüggen
Cay-Benedict von der Decken
Stefan Kleinert
Andreas Ramming
Jörg H W Distler
Peter Bartz-Bazzanella
Nicolas Vuillerme
Georg Schett
Martin Welcker
Axel Hueber
Source :
Journal of Medical Internet Research, Vol 26, p e55542 (2024)
Publication Year :
2024
Publisher :
JMIR Publications, 2024.

Abstract

BackgroundThe diagnosis of inflammatory rheumatic diseases (IRDs) is often delayed due to unspecific symptoms and a shortage of rheumatologists. Digital diagnostic decision support systems (DDSSs) have the potential to expedite diagnosis and help patients navigate the health care system more efficiently. ObjectiveThe aim of this study was to assess the diagnostic accuracy of a mobile artificial intelligence (AI)–based symptom checker (Ada) and a web-based self-referral tool (Rheport) regarding IRDs. MethodsA prospective, multicenter, open-label, crossover randomized controlled trial was conducted with patients newly presenting to 3 rheumatology centers. Participants were randomly assigned to complete a symptom assessment using either Ada or Rheport. The primary outcome was the correct identification of IRDs by the DDSSs, defined as the presence of any IRD in the list of suggested diagnoses by Ada or achieving a prespecified threshold score with Rheport. The gold standard was the diagnosis made by rheumatologists. ResultsA total of 600 patients were included, among whom 214 (35.7%) were diagnosed with an IRD. Most frequent IRD was rheumatoid arthritis with 69 (11.5%) patients. Rheport’s disease suggestion and Ada’s top 1 (D1) and top 5 (D5) disease suggestions demonstrated overall diagnostic accuracies of 52%, 63%, and 58%, respectively, for IRDs. Rheport showed a sensitivity of 62% and a specificity of 47% for IRDs. Ada’s D1 and D5 disease suggestions showed a sensitivity of 52% and 66%, respectively, and a specificity of 68% and 54%, respectively, concerning IRDs. Ada’s diagnostic accuracy regarding individual diagnoses was heterogenous, and Ada performed considerably better in identifying rheumatoid arthritis in comparison to other diagnoses (D1: 42%; D5: 64%). The Cohen κ statistic of Rheport for agreement on any rheumatic disease diagnosis with Ada D1 was 0.15 (95% CI 0.08-0.18) and with Ada D5 was 0.08 (95% CI 0.00-0.16), indicating poor agreement for the presence of any rheumatic disease between the 2 DDSSs. ConclusionsTo our knowledge, this is the largest comparative DDSS trial with actual use of DDSSs by patients. The diagnostic accuracies of both DDSSs for IRDs were not promising in this high-prevalence patient population. DDSSs may lead to a misuse of scarce health care resources. Our results underscore the need for stringent regulation and drastic improvements to ensure the safety and efficacy of DDSSs. Trial RegistrationGerman Register of Clinical Trials DRKS00017642; https://drks.de/search/en/trial/DRKS00017642

Details

Language :
English
ISSN :
14388871
Volume :
26
Database :
Directory of Open Access Journals
Journal :
Journal of Medical Internet Research
Publication Type :
Academic Journal
Accession number :
edsdoj.2d37aa700fa44b96886fc7d0fb1095ed
Document Type :
article
Full Text :
https://doi.org/10.2196/55542