Back to Search Start Over

Simulating A/B testing versus SMART designs for LLM-driven patient engagement to close preventive care gaps.

Authors :
Basu S
Schillinger D
Patel SY
Rigdon J
Source :
NPJ digital medicine [NPJ Digit Med] 2024 Nov 18; Vol. 7 (1), pp. 322. Date of Electronic Publication: 2024 Nov 18.
Publication Year :
2024

Abstract

Population health initiatives often rely on cold outreach to close gaps in preventive care, such as overdue screenings or immunizations. Tailoring messages to diverse patient populations remains challenging, as traditional A/B testing requires large sample sizes to test only two alternative messages. With increasing availability of large language models (LLMs), programs can utilize tiered testing among both LLM and manual human agents, presenting the dilemma of identifying which patients need different levels of human support to cost-effectively engage large populations. Using microsimulations, we compared both the statistical power and false positive rates of A/B testing and Sequential Multiple Assignment Randomized Trials (SMART) for developing personalized communications across multiple effect sizes and sample sizes. SMART showed better cost-effectiveness and net benefit across all scenarios, but superior power for detecting heterogeneous treatment effects (HTEs) only in later randomization stages, when populations were more homogeneous and subtle differences drove engagement differences.<br />Competing Interests: Competing interests S.B. and S.Y.P. receive stock options and salaries from Waymark, which engages in patient outreach. The remaining authors declare no competing interests.<br /> (© 2024. The Author(s).)

Details

Language :
English
ISSN :
2398-6352
Volume :
7
Issue :
1
Database :
MEDLINE
Journal :
NPJ digital medicine
Publication Type :
Academic Journal
Accession number :
39558021
Full Text :
https://doi.org/10.1038/s41746-024-01330-2