Back to Search Start Over

Voice EHR: introducing multimodal audio data for health

Authors :
James Anibal
Hannah Huth
Ming Li
Lindsey Hazen
Veronica Daoud
Dominique Ebedes
Yen Minh Lam
Hang Nguyen
Phuc Vo Hong
Michael Kleinman
Shelley Ost
Christopher Jackson
Laura Sprabery
Cheran Elangovan
Balaji Krishnaiah
Lee Akst
Ioan Lina
Iqbal Elyazar
Lenny Ekawati
Stefan Jansen
Richard Nduwayezu
Charisse Garcia
Jeffrey Plum
Jacqueline Brenner
Miranda Song
Emily Ricotta
David Clifton
C. Louise Thwaites
Yael Bensoussan
Bradford Wood
Source :
Frontiers in Digital Health, Vol 6 (2025)
Publication Year :
2025
Publisher :
Frontiers Media S.A., 2025.

Abstract

IntroductionArtificial intelligence (AI) models trained on audio data may have the potential to rapidly perform clinical tasks, enhancing medical decision-making and potentially improving outcomes through early detection. Existing technologies depend on limited datasets collected with expensive recording equipment in high-income countries, which challenges deployment in resource-constrained, high-volume settings where audio data may have a profound impact on health equity.MethodsThis report introduces a novel protocol for audio data collection and a corresponding application that captures health information through guided questions.ResultsTo demonstrate the potential of Voice EHR as a biomarker of health, initial experiments on data quality and multiple case studies are presented in this report. Large language models (LLMs) were used to compare transcribed Voice EHR data with data (from the same patients) collected through conventional techniques like multiple choice questions. Information contained in the Voice EHR samples was consistently rated as equally or more relevant to a health evaluation.DiscussionThe HEAR application facilitates the collection of an audio electronic health record (“Voice EHR”) that may contain complex biomarkers of health from conventional voice/respiratory features, speech patterns, and spoken language with semantic meaning and longitudinal context–potentially compensating for the typical limitations of unimodal clinical datasets.

Details

Language :
English
ISSN :
2673253X
Volume :
6
Database :
Directory of Open Access Journals
Journal :
Frontiers in Digital Health
Publication Type :
Academic Journal
Accession number :
edsdoj.fb6e033d1c7467db8d98021c3e6d383
Document Type :
article
Full Text :
https://doi.org/10.3389/fdgth.2024.1448351