1. Using random forest to detect multiple inherited metabolic diseases simultaneously based on GC-MS urinary metabolomics.
- Author
-
Chen, Nan, Wang, Hai-Bo, Wu, Ben-Qing, Jiang, Jian-Hui, Yang, Jiang-Tao, Tang, Li-Juan, He, Hong-Qin, and Linghu, Dan-Dan
- Subjects
- *
RANDOM forest algorithms , *GENETIC disorders , *INBORN errors of metabolism , *METABOLIC disorders , *SUDDEN infant death syndrome , *URINE - Abstract
Inborn errors of metabolism, also known as inherited metabolic diseases (IMDs), are related to genetic mutations and cause corresponding biochemical metabolic disorder of newborns and even sudden infant death. Timely detection and diagnosis of IMDs are of great significance for improving survival of newborns. Here we propose a strategy for simultaneously detecting six types of IMDs via combining GC-MS technique with the random forest algorithm (RF). Clinical urine samples from IMD and healthy patients are analyzed using GC-MS for acquiring metabolomics data. Then, the RF model is established as a multi-classification tool for the GC-MS data. Compared with the models built by artificial neural network and support vector machine, the results demonstrated the RF model has superior performance of high specificity, sensitivity, precision, accuracy, and matthews correlation coefficients on identifying all six types of IMDs and normal samples. The proposed strategy can afford a useful method for reliable and effective identification of multiple IMDs in clinical diagnosis. [Display omitted] • A strategy was proposed for identifying multiple inherited metabolic diseases (IMDs). • We used combined GC-MS for acquiring metabolomics data with random forest for data handling. • The encouraging results implied the promise of the strategy for robust and reliable identification of multiple IMDs. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF