Back to Search Start Over

A Stacking Ensemble Learning Framework for Genomic Prediction

Authors :
Mang Liang
Tianpeng Chang
Bingxing An
Xinghai Duan
Lili Du
Xiaoqiao Wang
Jian Miao
Lingyang Xu
Xue Gao
Lupei Zhang
Junya Li
Huijiang Gao
Source :
Frontiers in Genetics, Vol 12 (2021)
Publication Year :
2021
Publisher :
Frontiers Media S.A., 2021.

Abstract

Machine learning (ML) is perhaps the most useful tool for the interpretation of large genomic datasets. However, the performance of a single machine learning method in genomic selection (GS) is currently unsatisfactory. To improve the genomic predictions, we constructed a stacking ensemble learning framework (SELF), integrating three machine learning methods, to predict genomic estimated breeding values (GEBVs). The present study evaluated the prediction ability of SELF by analyzing three real datasets, with different genetic architecture; comparing the prediction accuracy of SELF, base learners, genomic best linear unbiased prediction (GBLUP) and BayesB. For each trait, SELF performed better than base learners, which included support vector regression (SVR), kernel ridge regression (KRR) and elastic net (ENET). The prediction accuracy of SELF was, on average, 7.70% higher than GBLUP in three datasets. Except for the milk fat percentage (MFP) traits, of the German Holstein dairy cattle dataset, SELF was more robust than BayesB in all remaining traits. Therefore, we believed that SEFL has the potential to be promoted to estimate GEBVs in other animals and plants.

Details

Language :
English
ISSN :
16648021
Volume :
12
Database :
Directory of Open Access Journals
Journal :
Frontiers in Genetics
Publication Type :
Academic Journal
Accession number :
edsdoj.7a7b1fab9e34cc2bbb5aebb8efc3b48
Document Type :
article
Full Text :
https://doi.org/10.3389/fgene.2021.600040