1. Developing workflow and diagnostics for model selection of a vector autoregressive spatiotemporal (VAST) model in comparison to design-based indices.
- Author
-
Cacciapaglia, Christopher, Brooks, Elizabeth N., Adams, Charles F., Legault, Christopher M., Perretti, Charles T., and Hart, Deborah
- Subjects
- *
FISH populations , *GAMMA distributions , *WORKFLOW , *POISSON distribution - Abstract
Modeling spatiotemporal variability of fish stocks is important for interpreting changes in magnitude and distribution over time. Biomass indices for four stocks with differences in life history, spatial extent, proportion of zero observations, biomass trajectory, and stock status were modeled using the Vector Autoregressive Spatio-Temporal (VAST) model. We iteratively tested model settings following a workflow developed to find the best model for each stock in the context of developing an absolute index of abundance from fisheries independent data. The best model was determined based on AIC, RMSE, cross-validation, and several metrics of deviation from design-based estimates if one is available for comparison. Biomass indices from the best VAST model had good agreement with design-based indices in three of the stocks. Improvements in model fit tended to asymptote at around 1000 knots for all stocks. The generalized gamma error distribution with a logit link was the best model for two of the stocks, and the gamma error distribution with Poisson link was the best model for the other two. Occasionally a model was selected with much larger estimated biomass compared to the design-based estimates, and metrics based on comparisons with design-based estimates helped resolve model selection in these instances. Simulation self-tests were performed from the best model and fitted with several estimation models, and the model selection tools comparing fitted models to design-based indices calculated from the simulated data were found to be robust in recovering the specifications of the original model. Additional visualizations for spatial residuals and clustering were used to aid interpretation of model fit and to highlight latent spatial patterns. These metrics may be useful in identifying important covariates. We added a depth covariate to the best model for each stock to demonstrate how future model building might proceed. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF