Back to Search Start Over

Opening the Random Forest Black Box of 1H NMR Metabolomics Data by the Exploitation of Surrogate Variables

Authors :
Soeren Wenck
Thorsten Mix
Markus Fischer
Thomas Hackl
Stephan Seifert
Source :
Metabolites, Vol 13, Iss 10, p 1075 (2023)
Publication Year :
2023
Publisher :
MDPI AG, 2023.

Abstract

The untargeted metabolomics analysis of biological samples with nuclear magnetic resonance (NMR) provides highly complex data containing various signals from different molecules. To use these data for classification, e.g., in the context of food authentication, machine learning methods are used. These methods are usually applied as a black box, which means that no information about the complex relationships between the variables and the outcome is obtained. In this study, we show that the random forest-based approach surrogate minimal depth (SMD) can be applied for a comprehensive analysis of class-specific differences by selecting relevant variables and analyzing their mutual impact on the classification model of different truffle species. SMD allows the assignment of variables from the same metabolites as well as the detection of interactions between different metabolites that can be attributed to known biological relationships.

Details

Language :
English
ISSN :
22181989
Volume :
13
Issue :
10
Database :
Directory of Open Access Journals
Journal :
Metabolites
Publication Type :
Academic Journal
Accession number :
edsdoj.f92461e4eea14bd78dce0f041dc82941
Document Type :
article
Full Text :
https://doi.org/10.3390/metabo13101075