1. A New Dataset and Performance Evaluation of a Region-Based CNN for Urban Object Detection
- Author
-
Miguel Cazorla, Alex Dominguez-Sanchez, Jose Garcia-Rodriguez, Sergio Orts-Escolano, Universidad de Alicante. Departamento de Ciencia de la Computación e Inteligencia Artificial, and Robótica y Visión Tridimensional (RoViT)
- Subjects
0209 industrial biotechnology ,Computer Networks and Communications ,Computer science ,Feature extraction ,lcsh:TK7800-8360 ,02 engineering and technology ,Convolutional neural network ,Kernel (linear algebra) ,020901 industrial engineering & automation ,convolutional neural networks ,0202 electrical engineering, electronic engineering, information engineering ,Traffic sign recognition ,Computer vision ,Electrical and Electronic Engineering ,real-time object detection ,business.industry ,Deep learning ,Autonomous driving assistance system ,Detector ,lcsh:Electronics ,Ciencia de la Computación e Inteligencia Artificial ,urban object detector ,Object (computer science) ,Urban object detector ,Object detection ,autonomous driving assistance system ,Hardware and Architecture ,Control and Systems Engineering ,Signal Processing ,020201 artificial intelligence & image processing ,Real-time object detection ,Convolutional neural networks ,Artificial intelligence ,business - Abstract
In recent years, we have seen a large growth in the number of applications which use deep learning-based object detectors. Autonomous driving assistance systems (ADAS) are one of the areas where they have the most impact. This work presents a novel study evaluating a state-of-the-art technique for urban object detection and localization. In particular, we investigated the performance of the Faster R-CNN method to detect and localize urban objects in a variety of outdoor urban videos involving pedestrians, cars, bicycles and other objects moving in the scene (urban driving). We propose a new dataset that is used for benchmarking the accuracy of a real-time object detector (Faster R-CNN). Part of the data was collected using an HD camera mounted on a vehicle. Furthermore, some of the data is weakly annotated so it can be used for testing weakly supervised learning techniques. There already exist urban object datasets, but none of them include all the essential urban objects. We carried out extensive experiments demonstrating the effectiveness of the baseline approach. Additionally, we propose an R-CNN plus tracking technique to accelerate the process of real-time urban object detection. This work has been partially funded by the Spanish Government TIN2016-76515-R grant for the COMBAHO project, supported with Feder funds. It has also been supported by the University of Alicante project GRE16-19.
- Published
- 2018