Back to Search Start Over

Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle

Authors :
Toshimi Baba
Lúcio Flávio Macedo Mota
Giovanni Bittante
Sara Pegolo
Alessio Cecchinato
Francisco Peñagaricano
Gota Morota
Virginia Polytechnic Institute and State University [Blacksburg]
Dipartimento di Agronomia Animali Alimenti Risorse Naturali e Ambiente
Universita degli Studi di Padova
University of Wisconsin-Madison
Source :
Genetics Selection Evolution, Vol 53, Iss 1, Pp 1-14 (2021), Genetics Selection Evolution, Genetics Selection Evolution, BioMed Central, 2021, 53 (1), pp.29. ⟨10.1186/s12711-021-00620-7⟩, Genetics, Selection, Evolution : GSE
Publication Year :
2021
Publisher :
BMC, 2021.

Abstract

BackgroundOver the past decade, Fourier transform infrared (FTIR) spectroscopy has been used to predict novel milk protein phenotypes. Genomic data might help predict these phenotypes when integrated with milk FTIR spectra. The objective of this study was to investigate prediction accuracy for milk protein phenotypes when heterogeneous on-farm, genomic, and pedigree data were integrated with the spectra. To this end, we used the records of 966 Italian Brown Swiss cows with milk FTIR spectra, on-farm information, medium-density genetic markers, and pedigree data. True and total whey protein, and five casein, and two whey protein traits were analyzed. Multiple kernel learning constructed from spectral and genomic (pedigree) relationship matrices and multilayer BayesB assigning separate priors for FTIR and markers were benchmarked against a baseline partial least squares (PLS) regression. Seven combinations of covariates were considered, and their predictive abilities were evaluated by repeated random sub-sampling and herd cross-validations (CV).ResultsAddition of the on-farm effects such as herd, days in milk, and parity to spectral data improved predictions as compared to those obtained using the spectra alone. Integrating genomics and/or the top three markers with a large effect further enhanced the predictions. Pedigree data also improved prediction, but to a lesser extent than genomic data. Multiple kernel learning and multilayer BayesB increased predictive performance, whereas PLS did not. Overall, multilayer BayesB provided better predictions than multiple kernel learning, and lower prediction performance was observed in herd CV compared to repeated random sub-sampling CV.ConclusionsIntegration of genomic information with milk FTIR spectral can enhance milk protein trait predictions by 25% and 7% on average for repeated random sub-sampling and herd CV, respectively. Multiple kernel learning and multilayer BayesB outperformed PLS when used to integrate heterogeneous data for phenotypic predictions.

Details

Language :
German
ISSN :
12979686 and 0999193X
Volume :
53
Issue :
1
Database :
OpenAIRE
Journal :
Genetics Selection Evolution
Accession number :
edsair.doi.dedup.....1350db0db8a654ab2aee504777ebb452