Back to Search Start Over

Identifying optimal variables for machine-learning-based fish distribution modeling.

Authors :
Xu, Shaohua
Wang, Jintao
Chen, Xinjun
Zhu, Jiangfeng
Source :
Canadian Journal of Fisheries & Aquatic Sciences. 2024, Vol. 81 Issue 6, p687-698. 12p.
Publication Year :
2024

Abstract

Machine learning occupies a central position in the modeling of fish distribution patterns. The augmentation of explanatory variables in fish habitat through many kinds of observational methodologies necessitates the discernment of an optimal combination of these variables for fish distribution modeling. We proposed a feature selection technique, recursive feature elimination with cross-validation (RFECV), to determine optimal variables combinations for yellowfin tuna distribution in the Pacific Ocean. Four tree-based models, random forest, eXtreme Gradient Boosting, Light Gradient Boosting Machine, and categorical boosting driven by RFECV, were developed using comprehensive fisheries and biotic/abiotic data. Habitat variables including sea temperature, dissolved oxygen concentration, chlorophyll-a concentration, sea salinity, and sea surface height were identified as significant features by all models. The models were trained using the corresponding selected variables, and these trained models were employed to predict the spatiotemporal distribution of yellowfin tuna from 1995 to 2019. The results obtained could inform useful knowledge for the sustainable exploitation of yellowfin tuna in the Pacific Ocean and furnish a benchmark of feature selection for machine-learning-based distribution modeling of other pelagic species. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0706652X
Volume :
81
Issue :
6
Database :
Academic Search Index
Journal :
Canadian Journal of Fisheries & Aquatic Sciences
Publication Type :
Academic Journal
Accession number :
177719797
Full Text :
https://doi.org/10.1139/cjfas-2023-0197