Back to Search Start Over

Regularized Bottleneck with Early Labeling

Authors :
Castellano, Gabriele
Pianese, Fabio
Carra, Damiano
Zhang, Tianzhu
Neglia, Giovanni
Nokia Bell Labs
Università degli studi di Verona = University of Verona (UNIVR)
Network Engineering and Operations (NEO )
Inria Sophia Antipolis - Méditerranée (CRISAM)
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)
Source :
ITC 2022-34th International Teletraffic Congress, ITC 2022-34th International Teletraffic Congress, Sep 2022, Shenzhen, China
Publication Year :
2022
Publisher :
HAL CCSD, 2022.

Abstract

International audience; Small IoT devices, such as drones and lightweight battery-powered robots, are emerging as a major platform for the deployment of AI/ML capabilities. Autonomous and semiautonomous device operation relies on the systematic use of deep neural network models for solving complex tasks, such as image classification. The challenging restrictions of these devices in terms of computing capabilities, network connectivity, and power consumption are the main limits to the accuracy of latencysensitive inferences. This paper presents ReBEL, a split computing architecture enabling the dynamic remote offload of partial computations or, in alternative, a local approximate labeling based on a jointly-trained classifier. Our approach combines elements of head network distillation, early exit classification, and bottleneck injection with the goal of reducing the average endto-end latency of AI/ML inference on constrained IoT devices.

Details

Language :
English
Database :
OpenAIRE
Journal :
ITC 2022-34th International Teletraffic Congress, ITC 2022-34th International Teletraffic Congress, Sep 2022, Shenzhen, China
Accession number :
edsair.od.......165..009c8f66790389bbb91bd38492f3a387