1. Data Synthesis for Alfalfa Biomass Yield Estimation.
- Author
-
Vance, Jonathan, Rasheed, Khaled, Missaoui, Ali, and Maier, Frederick W.
- Subjects
- *
BIOMASS estimation , *MACHINE learning , *ALFALFA , *CROP yields , *MACHINE performance , *DECISION trees - Abstract
Alfalfa is critical to global food security, and its data is abundant in the U.S. nationally, but often scarce locally, limiting the potential performance of machine learning (ML) models in predicting alfalfa biomass yields. Training ML models on local-only data results in very low estimation accuracy when the datasets are very small. Therefore, we explore synthesizing non-local data to estimate biomass yields labeled as high, medium, or low. One option to remedy scarce local data is to train models using non-local data; however, this only works about as well as using local data. Therefore, we propose a novel pipeline that trains models using data synthesized from non-local data to estimate local crop yields. Our pipeline, synthesized non-local training (SNLT pronounced like sunlight), achieves a gain of 42.9% accuracy over the best results from regular non-local and local training on our very small target dataset. This pipeline produced the highest accuracy of 85.7% with a decision tree classifier. From these results, we conclude that SNLT can be a useful tool in helping to estimate crop yields with ML. Furthermore, we propose a software application called Predict Your CropS (PYCS pronounced like Pisces) designed to help farmers and researchers estimate and predict crop yields based on pretrained models. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF