1. Dual Reconstructive Autoencoder for Crowd Localization and Estimation in Density and FIDT Maps
- Author
-
Felipe I. Lamas, Jorge E. Pezoa, Sebastian E. Godoy, Gabriel A. Saavedra, Sergio N. Torres, Gonzalo A. Montalva, and Weixiao Meng
- Subjects
Deep learning ,convolutional neural networks ,artificial intelligence ,cascaded autoencoders ,crowd localization/counting ,FIDT maps ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
This paper proposes crowd estimation technology to help authorities make the right decisions in times of crisis. Specifically, deep learning models have faced these challenges, achieving excellent results. In particular, the trend of using single-column Fully Convolutional Networks (FCNs) has increased in recent years. A typical architecture that meets these characteristics is the autoencoder. However, this model presents an intrinsic difficulty: the search for the optimal dimensionality of the latent space. In order to alleviate such difficulty, we propose a dual architecture consisting of two cascaded autoencoders. The first autoencoder is responsible for carrying out the masked reconstruction of the original images, whereas the second obtains crowd maps from the outputs of the first one. In this way, our architecture improves the location of people and crowds in Focal Inverse Distance Transform (FIDT) maps, resulting in more accurate count estimates than estimates obtained through a single autoencoder architecture.
- Published
- 2022
- Full Text
- View/download PDF