Back to Search Start Over

Assessing the Performance of Machine Learning Methods Trained on Public Health Observational Data: A Case Study From COVID-19.

Authors :
Pigoli D
Baker K
Budd J
Butler L
Coppock H
Egglestone S
Gilmour SG
Holmes C
Hurley D
Jersakova R
Kiskin I
Koutra V
Mellor J
Nicholson G
Packham J
Patel S
Payne R
Roberts SJ
Schuller BW
Tendero-Cañadas A
Thornley T
Titcomb A
Source :
Statistics in medicine [Stat Med] 2024 Nov 10; Vol. 43 (25), pp. 4861-4871. Date of Electronic Publication: 2024 Sep 05.
Publication Year :
2024

Abstract

From early in the coronavirus disease 2019 (COVID-19) pandemic, there was interest in using machine learning methods to predict COVID-19 infection status based on vocal audio signals, for example, cough recordings. However, early studies had limitations in terms of data collection and of how the performances of the proposed predictive models were assessed. This article describes how these limitations have been overcome in a study carried out by the Turing-RSS Health Data Laboratory and the UK Health Security Agency. As part of the study, the UK Health Security Agency collected a dataset of acoustic recordings, SARS-CoV-2 infection status and extensive study participant meta-data. This allowed us to rigorously assess state-of-the-art machine learning techniques to predict SARS-CoV-2 infection status based on vocal audio signals. The lessons learned from this project should inform future studies on statistical evaluation methods to assess the performance of machine learning techniques for public health tasks.<br /> (© 2024 The Author(s). Statistics in Medicine published by John Wiley & Sons Ltd.)

Details

Language :
English
ISSN :
1097-0258
Volume :
43
Issue :
25
Database :
MEDLINE
Journal :
Statistics in medicine
Publication Type :
Academic Journal
Accession number :
39237100
Full Text :
https://doi.org/10.1002/sim.10211