1. A Review of Spatiotemporal Models for Count Data in R Packages. A Case Study of COVID-19 Data
- Author
-
Marina Martinez-Garcia, María Ibáñez, and Amelia Simó
- Subjects
FOS: Computer and information sciences ,2019-20 coronavirus outbreak ,spatiotemporal models ,Coronavirus disease 2019 (COVID-19) ,Computer science ,General Mathematics ,Bayesian probability ,Machine learning ,computer.software_genre ,Statistics - Applications ,01 natural sciences ,010104 statistics & probability ,03 medical and health sciences ,0302 clinical medicine ,R packages ,Computer Science (miscellaneous) ,QA1-939 ,Applications (stat.AP) ,030212 general & internal medicine ,0101 mathematics ,Engineering (miscellaneous) ,Flexibility (engineering) ,Point (typography) ,business.industry ,COVID-19 ,count data ,Range (mathematics) ,Artificial intelligence ,Focus (optics) ,business ,computer ,Mathematics ,Count data - Abstract
Spatio-temporal models for count data are required in a wide range of scientific fields and they have become particularly crucial nowadays because of their ability to analyse COVID-19-related data. Models for count data are needed when the variable of interest take only non-negative integer values and these integers arise from counting occurrences. Several R-packages are currently available to deal with spatiotemporal areal count data. Each package focuses on different models and/or statistical methodologies. Unfortunately, the results generated by these models are rarely comparable due to differences in notation and methods. The main objective of this paper is to present a review describing the most important approaches that can be used to model and analyse count data when questions of scientific interest concern both their spatial and their temporal behaviour and we monitor their performance under the same data set. For this review, we focus on the three R-packages that can be used for this purpose and the different models assessed are representative of the two most widespread methodologies used to analyse spatiotemporal count data: the classical approach (based on Penalised Likelihood or Estimating Equations) and the Bayesian point of view. A case study is analysed as an illustration of these different methodologies. In this case study, these packages are used to model and predict daily hospitalisations from COVID-19 in 24 health regions within the Valencian Community (Spain), with data corresponding to the period from 28 June to 13 December 2020. Because of the current urgent need for monitoring and predicting data in the COVID-19 pandemic, this case study is, in itself, of particular importance and can be considered the secondary objective of this work. Satisfactory and promising results have been obtained in this second goal.
- Published
- 2021