Back to Search Start Over

Integrating heterogeneous across-country data for proxy-based random forest prediction of enteric methane in dairy cattle

Authors :
European Commission
Negussie, Enyew [0000-0003-4892-9938]
González Recio, Oscar [0000-0002-9106-4063]
Battagin, Mara [0000-0001-7309-6793]
Bayat, Ali-Reza [0000-0002-4894-0662]
Boland, Tommy [0000-0002-7433-130X]
de Haas, Yvette [0000-0002-4331-4101]
García-Rodríguez, Aser [0000-0001-5519-6766]
Garnsworthy, Philip C [0000-0001-5131-3398]
Gengler, Nicolas [0000-0002-5981-5509]
Kreuzer, Michael [0000-0002-9978-1171]
Kuhla, Björn [0000-0002-2032-5502
Lassen, Jan [0000-0002-1338-8644]
Peiren, Nico [0000-0001-5500-1607]
Pszczola, Marcin [0000-0003-2833-5083]
Schwarm, Angela [0000-0002-5750-2111]
Soyeurt, Hélène [0000-0001-9883-9047]
Vanlierde, Amélie [0000-0002-4619-1936]
Yan, Tianhai [0000-0002-1994-5202]
Biscarini, Filippo [0000-0002-3901-2354]
Negussie, Enyew
González Recio, Oscar
Battagin, Mara
Bayat, Ali-Reza
Boland, Tommy
de Haas, Yvette
García-Rodríguez, Aser
Garnsworthy, Philip C
Gengler, Nicolas
Kreuzer, Michael
Kuhla, Björn
Lassen, Jan
Peiren, Nico
Pszczola, Marcin
Schwarm, Angela
Soyeurt, Hélène
Vanlierde, Amélie
Yan, Tianhai
Biscarini, Filippo
European Commission
Negussie, Enyew [0000-0003-4892-9938]
González Recio, Oscar [0000-0002-9106-4063]
Battagin, Mara [0000-0001-7309-6793]
Bayat, Ali-Reza [0000-0002-4894-0662]
Boland, Tommy [0000-0002-7433-130X]
de Haas, Yvette [0000-0002-4331-4101]
García-Rodríguez, Aser [0000-0001-5519-6766]
Garnsworthy, Philip C [0000-0001-5131-3398]
Gengler, Nicolas [0000-0002-5981-5509]
Kreuzer, Michael [0000-0002-9978-1171]
Kuhla, Björn [0000-0002-2032-5502
Lassen, Jan [0000-0002-1338-8644]
Peiren, Nico [0000-0001-5500-1607]
Pszczola, Marcin [0000-0003-2833-5083]
Schwarm, Angela [0000-0002-5750-2111]
Soyeurt, Hélène [0000-0001-9883-9047]
Vanlierde, Amélie [0000-0002-4619-1936]
Yan, Tianhai [0000-0002-1994-5202]
Biscarini, Filippo [0000-0002-3901-2354]
Negussie, Enyew
González Recio, Oscar
Battagin, Mara
Bayat, Ali-Reza
Boland, Tommy
de Haas, Yvette
García-Rodríguez, Aser
Garnsworthy, Philip C
Gengler, Nicolas
Kreuzer, Michael
Kuhla, Björn
Lassen, Jan
Peiren, Nico
Pszczola, Marcin
Schwarm, Angela
Soyeurt, Hélène
Vanlierde, Amélie
Yan, Tianhai
Biscarini, Filippo
Publication Year :
2022

Abstract

Direct measurements of methane (CH4) from individual animals are difficult and expensive. Predictions based on proxies for CH4 are a viable alternative. Most prediction models are based on multiple linear regressions (MLR) and predictor variables that are not routinely available in commercial farms, such as dry matter intake (DMI) and diet composition. The use of machine learning (ML) algorithms to predict CH4 emissions from across-country heterogeneous data sets has not been reported. The objectives were to compare performances of ML ensemble algorithm random forest (RF) and MLR models in predicting CH4 emissions from proxies in dairy cows, and assess effects of imputing missing data points on prediction accuracy. Data on CH4 emissions and proxies for CH4 from 20 herds were provided by 10 countries. The integrated data set contained 43,519 records from 3,483 cows, with 18.7% missing data points imputed using k-nearest neighbor imputation. Three data sets were created, 3k (no missing records), 21k (missing DMI imputed from milk, fat, protein, body weight), and 41k (missing DMI, milk fat, and protein records imputed). These data sets were used to test scenarios (with or without DMI, imputed vs. nonimputed DMI, milk fat, and protein), and prediction models (RF vs. MLR). Model predictive ability was evaluated within and between herds through 10-fold cross-validation. Prediction accuracy was measured as correlation between observed and predicted CH4, root mean squared error (RMSE) and mean normalized discounted cumulative gain (NDCG). Inclusion of DMI in the model improved within and between-herd prediction accuracy to 0.77 (RMSE = 23.3%) and 0.58 (RMSE = 31.9%) in RF and to 0.50 (RMSE = 0.327) and 0.13 (RMSE = 42.71) in MLR, respectively than when DMI was not included in the predictive model. When missing DMI records were imputed, within and between-herd accuracy increased to 0.84 (RMSE = 18.5%) and 0.63 (RMSE = 29.9%), respectively. In all scenarios, RF models out-per

Details

Database :
OAIster
Notes :
English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1333185370
Document Type :
Electronic Resource