Back to Search Start Over

Building the Model: Challenges and Considerations of Developing and Implementing Machine Learning Tools for Clinical Laboratory Medicine Practice.

Authors :
He S. Yang
Rhoads, Daniel D.
Sepulveda, Jorge
Chengxi Zang
Chadburn, Amy
Fei Wang
Source :
Archives of Pathology & Laboratory Medicine. Jul2023, Vol. 147 Issue 7, p826-836. 11p. 2 Diagrams, 1 Chart, 1 Graph.
Publication Year :
2023

Abstract

* Context.--Machine learning (ML) allows for the analysis of massive quantities of high-dimensional clinical laboratory data, thereby revealing complex patterns and trends. Thus, ML can potentially improve the efficiency of clinical data interpretation and the practice of laboratory medicine. However, the risks of generating biased or unrepresentative models, which can lead to misleading clinical conclusions or overestimation of the model performance, should be recognized. Objectives.--To discuss the major components for creating ML models, including data collection, data preprocessing, model development, and model evaluation. We also highlight many of the challenges and pitfalls in developing ML models, which could result in misleading clinical impressions or inaccurate model performance, and provide suggestions and guidance on how to circumvent these challenges. Data Sources.--The references for this review were identified through searches of the PubMed database, US Food and Drug Administration white papers and guidelines, conference abstracts, and online preprints. Conclusions.--With the growing interest in developing and implementing ML models in clinical practice, laboratorians and clinicians need to be educated in order to collect sufficiently large and high-quality data, properly report the data set characteristics, and combine data from multiple institutions with proper normalization. They will also need to assess the reasons for missing values, determine the inclusion or exclusion of outliers, and evaluate the completeness of a data set. In addition, they require the necessary knowledge to select a suitable ML model for a specific clinical question and accurately evaluate the performance of the ML model, based on objective criteria. Domain-specific knowledge is critical in the entire workflow of developing ML models. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00039985
Volume :
147
Issue :
7
Database :
Academic Search Index
Journal :
Archives of Pathology & Laboratory Medicine
Publication Type :
Academic Journal
Accession number :
164738850
Full Text :
https://doi.org/10.5858/arpa.2021-0635-RA