Back to Search
Start Over
Is the patient speaking or the nurse? Automatic speaker type identification in patient-nurse audio recordings.
- Source :
-
Journal of the American Medical Informatics Association : JAMIA [J Am Med Inform Assoc] 2023 Sep 25; Vol. 30 (10), pp. 1673-1683. - Publication Year :
- 2023
-
Abstract
- Objectives: Patient-clinician communication provides valuable explicit and implicit information that may indicate adverse medical conditions and outcomes. However, practical and analytical approaches for audio-recording and analyzing this data stream remain underexplored. This study aimed to 1) analyze patients' and nurses' speech in audio-recorded verbal communication, and 2) develop machine learning (ML) classifiers to effectively differentiate between patient and nurse language.<br />Materials and Methods: Pilot studies were conducted at VNS Health, the largest not-for-profit home healthcare agency in the United States, to optimize audio-recording patient-nurse interactions. We recorded and transcribed 46 interactions, resulting in 3494 "utterances" that were annotated to identify the speaker. We employed natural language processing techniques to generate linguistic features and built various ML classifiers to distinguish between patient and nurse language at both individual and encounter levels.<br />Results: A support vector machine classifier trained on selected linguistic features from term frequency-inverse document frequency, Linguistic Inquiry and Word Count, Word2Vec, and Medical Concepts in the Unified Medical Language System achieved the highest performance with an AUC-ROC = 99.01 ± 1.97 and an F1-score = 96.82 ± 4.1. The analysis revealed patients' tendency to use informal language and keywords related to "religion," "home," and "money," while nurses utilized more complex sentences focusing on health-related matters and medical issues and were more likely to ask questions.<br />Conclusion: The methods and analytical approach we developed to differentiate patient and nurse language is an important precursor for downstream tasks that aim to analyze patient speech to identify patients at risk of disease and negative health outcomes.<br /> (© The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.)
- Subjects :
- Humans
Communication
Linguistics
Machine Learning
Sound Recordings
Language
Subjects
Details
- Language :
- English
- ISSN :
- 1527-974X
- Volume :
- 30
- Issue :
- 10
- Database :
- MEDLINE
- Journal :
- Journal of the American Medical Informatics Association : JAMIA
- Publication Type :
- Academic Journal
- Accession number :
- 37478477
- Full Text :
- https://doi.org/10.1093/jamia/ocad139