Back to Search Start Over

MobileNetV2 with Spatial Attention module for traffic congestion recognition in surveillance images.

Authors :
Lin, Chenxi
Hu, Xiaojian
Zhan, Yunfei
Hao, Xiatong
Source :
Expert Systems with Applications. Dec2024:Part D, Vol. 255, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Traffic congestion recognition is essential for road traffic condition monitoring and improving transportation operation efficiency. Recent works have proposed using computer vision to develop cost-effective traffic congestion detection systems. This paper proposed a custom framework for recognizing traffic congestion using a trained advanced CNNs model on highway surveillance images. The proposed CNNs model uses an attention mechanism that significantly helps the network performing better. Specifically, a spatial attention module is proposed to aggregate model features and increase representational power. Exploring the optimal model choice for edge device deployment: MobileNetV2 is considered the best model choice for this task due to its lowest FLOPs and params. Furthermore, the accuracy of MobileNetV2 with attention module on the test set exceeds that of several advanced CNNs models such as VGG16, GoogLeNet, ResNet50, Efficientnetb0, DenseNet121. For a selected dataset of real-world traffic congestion images, test accuracy of 98.58% strongly supports the model's efficiency in recognizing traffic congestion rendering image samples. In addition, proposed model also achieves 98.82% on the UCSD dataset, which as same as sort-of-the-art method. In summary, we have developed a congestion detection framework equipped with spatial attention module. Extensive experimental validation has demonstrated that using MobileNetV2 as the backbone network achieves state-of-the-art performance. The framework also uses the Grad-CAM approach for visual interpretability and localization of traffic congestion in surveillance images. Judging from the results of Grad-CAM, attention-integrated models can localize traffic congestion in surveillance images better. Our model is developed under the Pytorch framework using python3.8. • A deep learning framework for recognizing traffic congestion in real-world images. • A spatial attention module is proposed to extract features and improve accuracy. • Grad-CAM is used to highlight regions of the image. • High recall supports the reliability of model for congestion recognition task. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09574174
Volume :
255
Database :
Academic Search Index
Journal :
Expert Systems with Applications
Publication Type :
Academic Journal
Accession number :
179323133
Full Text :
https://doi.org/10.1016/j.eswa.2024.124701