Back to Search
Start Over
AdaPyramid: Adaptive Pyramid for Accelerating High-Resolution Object Detection on Edge Devices
- Source :
- IEEE Transactions on Mobile Computing; August 2024, Vol. 23 Issue: 8 p8208-8224, 17p
- Publication Year :
- 2024
-
Abstract
- Deep convolutional neural network (NN)-based object detectors are not appropriate for straightforward inference on high-resolution videos at edge devices, as maintaining high accuracy often brings about prohibitively long latency. Although existing solutions have attempted to reduce on-device inference latency by selecting a cheaper configuration (e.g., choosing a more lightweight NN or scaling a frame to a smaller size before inference) or eliminating a background containing no object, they often ignore various high-resolution features and fail to optimize for those videos. We thus present AdaPyramid, a framework to reduce as much on-device inference latency as possible, especially for high-resolution videos, while achieving the accuracy demand approximately. We observe that the cheapest configuration to achieve the accuracy demand varies significantly across both different frames and different regions in a frame. The underlying reason is that object features (e.g., the location, size and category of objects) are more uneven in high-resolution videos, both temporally and spatially. Moreover, we observe that the object size presents a prominent hierarchical distribution in high-resolution frames. AdaPyramid thus partitions each frame hierarchically just like a pyramid and chooses a content-aware configuration for each region, which is adapted online based on the feedback. We evaluate the performance of AdaPyramid on a public dataset and our collected real-world videos. The obtained results show that under comparable accuracy to the state-of-the-art solutions, AdaPyramid can decrease inference latency by 40% on average, with up to 2.5× speed-up.
Details
- Language :
- English
- ISSN :
- 15361233
- Volume :
- 23
- Issue :
- 8
- Database :
- Supplemental Index
- Journal :
- IEEE Transactions on Mobile Computing
- Publication Type :
- Periodical
- Accession number :
- ejs66892220
- Full Text :
- https://doi.org/10.1109/TMC.2023.3343448