Back to Search Start Over

Improving prediction model robustness with virtual sample construction for near-infrared spectra analysis.

Authors :
Hao, Yong
Li, Xiyan
Zhang, Chengxiang
Source :
Analytica Chimica Acta. Oct2023, Vol. 1279, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

In a qualitative analysis of near-infrared spectroscopy (NIRS), when the samples to be analyzed are difficult to obtain or there are few counterexamples, the robustness of the models is poor, resulting in the decline of the generalization ability of the models. In this case, the effective method is to construct virtual samples to achieve the balance of categories. In this contribution, three virtual spectrum construction strategies including Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic Sampling (ADASYN), and Deep Convolutional Generative Adversarial Network (DCGAN) were explored to deal with the problem of insufficient or imbalanced sample numbers in NIRS analysis. The strategies were tested with the melamine and Yali pears two spectral datasets. The PLS-DA and Correct Recognition Rate (CRR) were used for discriminant model construction and accuracy evaluation, respectively. The results show that SMOTE, ADASYN, and DCGAN processing strategies can all improve the global CRR (CRR glob). The SMOTE and ADASYN can improve the CRR for majority class sample (CRR maj), but the CRR for minority class sample (CRR min) has decreased. For the DCGAN method, the CRR glob , CRR maj , and CRR min were all improved. The standard deviation of the results of the multiple parallel calculations demonstrates the robustness of DCGAN generation method. Therefore, the DCGAN method has good reliability and practicability, and can increase the robustness and generalization ability of the NIRS model. [Display omitted] • Exploring virtual spectrum construction strategies. • Evaluate the plausibility and diversity of the virtual spectrum. • Improve the robustness and generalization ability of the NIRS model. • Test model performance on public datasets and experimental data. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00032670
Volume :
1279
Database :
Academic Search Index
Journal :
Analytica Chimica Acta
Publication Type :
Academic Journal
Accession number :
172917213
Full Text :
https://doi.org/10.1016/j.aca.2023.341763