Back to Search Start Over

Sparsity Driven People Localization with a Heterogeneous Network of Cameras

Authors :
Alexandre Alahi
Yannick Boursier
Pierre Vandergheynst
Laurent Jacques
UCL - SST/ICTM/ELEN - Pôle en ingénierie électrique
Ecole Polytechnique Fédérale de Lausanne - LTS2
Université Aix-Marseille 2 - CPPM
Laboratoire de Traitement du signal [EPFL] / Signal Processing Laboratories (SP Lab)
Ecole Polytechnique Fédérale de Lausanne (EPFL)
Transportation and Mobility Laboratory
Laboratoire de Télécommunications et Télédétection [Louvain] (TELE)
Université Catholique de Louvain = Catholic University of Louvain (UCL)
Centre de Physique des Particules de Marseille (CPPM)
Aix Marseille Université (AMU)-Institut National de Physique Nucléaire et de Physique des Particules du CNRS (IN2P3)-Centre National de la Recherche Scientifique (CNRS)
LTS2 - EPFL
European Project: 225913,EC:FP7:ICT,FP7-ICT-2007-C,SMALL(2009)
Source :
Journal of Mathematical Imaging and Vision, Vol. 41, no. 1-2, p. 39-58 (2011), Journal of Mathematical Imaging and Vision, Journal of Mathematical Imaging and Vision, 2011, 41 (1-2), ⟨10.1007/s10851-010-0258-7⟩, Journal of Mathematical Imaging and Vision, Springer Verlag, 2011, 41 (1-2), ⟨10.1007/s10851-010-0258-7⟩
Publication Year :
2011
Publisher :
Springer New York LLC, 2011.

Abstract

In this paper, we propose to study the problem of localization of a dense set of people with a network of heterogeneous cameras. We propose to recast the problem as a linear inverse problem. The proposed framework is generic to any scene, scalable in the number of cameras and versatile with respect to their geometry, e.g. planar or omnidirectional. It relies on deducing an \emph {occupancy vector}, i.e. the discretized occupancy of people on the ground, from the noisy binary silhouettes observed as foreground pixels in each camera. This inverse problem is regularized by imposing a sparse occupancy vector, i.e. made of few non- zero elements, while a particular dictionary of silhouettes linearly maps these non-empty grid locations to the multiple silhouettes viewed by the cameras network. This constitutes a linearization of the problem, where non- linearities, such as occlusions, are treated as additional noise on the observed silhouettes. Mathematically, we express the final inverse problem either as Basis Pursuit DeNoise or Lasso convex optimization programs. The sparsity measure is reinforced by iteratively re-weighting the $\ell_1$-norm of the occupancy vector for better approximating its $\ell_0$ ``norm'', and a new kind of ``repulsive'' sparsity is used to adapt further the Lasso procedure to the occupancy reconstruction. Practically, an adaptive sampling process is proposed to reduce the computation cost and monitor a large occupancy area. Qualitative and quantitative results are presented on a basketball game. The proposed algorithm successfully detects people occluding each other given severely degraded extracted features, while outperforming state-of-the-art people localization techniques.

Details

Language :
English
ISSN :
09249907 and 15737683
Database :
OpenAIRE
Journal :
Journal of Mathematical Imaging and Vision, Vol. 41, no. 1-2, p. 39-58 (2011), Journal of Mathematical Imaging and Vision, Journal of Mathematical Imaging and Vision, 2011, 41 (1-2), ⟨10.1007/s10851-010-0258-7⟩, Journal of Mathematical Imaging and Vision, Springer Verlag, 2011, 41 (1-2), ⟨10.1007/s10851-010-0258-7⟩
Accession number :
edsair.doi.dedup.....adee3da4fc15743691256222a779e716
Full Text :
https://doi.org/10.1007/s10851-010-0258-7⟩