1. Visual Object Detection and Tracking for Internet of Things Devices Based on Spatial Attention Powered Multidomain Network
- Author
-
Yinling Wang, Lei Yu, Yong Yang, Imran Khan, Hongdan Shen, and Gao Haining
- Subjects
Class (computer programming) ,Computer Networks and Communications ,Computer science ,business.industry ,Distributed computing ,Frame (networking) ,Tracking (particle physics) ,Convolutional neural network ,Object detection ,Computer Science Applications ,Cross entropy ,Hardware and Architecture ,Software deployment ,Signal Processing ,Internet of Things ,business ,Information Systems - Abstract
Internet of Things (IoT) has brought changes in many fields by joining physical space with the cyber space. The IoT devices are becoming increasingly complex. With the rapid deployment of cameras, tasks in IoT like visual information are more important, but IoT devices have limited computing resources, including power, computing ability, storage, etc. Some tasks that might be perfectly normal to perform on a computer would be rather challenging on an IoT device. Therefore, how to maintain acceptable performance while minimizing resources is becoming a more consequential part in IoT. In the paper, we aim to solve the problem of object detection and tracking in IoT while minimizing resources. The traditional algorithms need to use convolutional neural network (CNN) to identify different objects in each frame, and then determine the tracking target from identified objects, which typically requires a lot of computing resources. By incorporating spatial attention, and multi-domain network, we proposed a novel algorithm named as Spatial Attention Powered Multi-Domain Network (SA-MDNet). By adding spatial attention mechanism to the original MDNet model, and using multi class cross entropy loss, we are able to distinguish the background and the target in different video sequences effectively and efficiently. This novel algorithm achieves similar performance on the OTB 50/100/2013 datasets compared to several state-of-art models, while uses only a fraction of the memory compared to MDNet.
- Published
- 2023