1. Supervised and Unsupervised Classification of Cocoa Bean Origin and Processing using Liquid Chromatography-Mass Spectrometry
- Author
-
Matthias S. Ullrich, Britta Behrends, Marc-Thorsten Huett, Santhust Kumar, Nikolai Kuhnert, Marcello Corno, and Roy N. Dsouza
- Subjects
business.industry ,Gaussian ,Pattern recognition ,COCOA BEAN ,Decomposition analysis ,food.food ,symbols.namesake ,Metabolomics ,food ,Liquid chromatography–mass spectrometry ,Principal component analysis ,symbols ,Artificial intelligence ,business ,Cluster analysis ,Multivariate classification ,Mathematics - Abstract
Liquid Chromatography-Mass Spectrometry (LC-MS) provides an unprecedented wealth of metabolomics information for food products, including insights into compositional changes during food processing. Here, we employed the largest available LC-MS dataset of around 300 cocoa bean samples to assess the capability of two popular multivariate classification methods, principal component analysis (PCA) and linear decomposition analysis (LDA), for studying bean geographic origin and responsible characteristic compounds.The unsupervised method, PCA, only provides a limited separation in bean origin. Expectedly, the supervised method, LDA, provides a better origin clustering. However, it suffers from a strong, nonlinear dependence on the set of compounds used in the analysis. We show that for LDA a compound filtering criterion based on Gaussian intensity distributions dramatically enhances origin clustering of samples, thus increasing its predictive efficiency. In this form, the supervised method of LDA holds the possibility to identify potential markers of a specific origin.
- Published
- 2020
- Full Text
- View/download PDF