1. AutoRank: Automated Rank Selection for Effective Neural Network Customization
- Author
-
Mohammad Samragh, Mojan Javaheripi, and Farinaz Koushanfar
- Subjects
Artificial neural network ,Computer science ,business.industry ,Deep learning ,Distributed computing ,Rank (computer programming) ,Inference ,Personalization ,Task (project management) ,Memory footprint ,Decomposition (computer science) ,Artificial intelligence ,Electrical and Electronic Engineering ,business - Abstract
Tensor decomposition is a promising approach for low-power and real-time application of neural networks on resource-constrained embedded devices. This paper proposes AutoRank, an end-to-end framework for customizing neural network decomposition using cross-layer rank-selection. For many-layer networks, determining the optimal decomposition ranks is a cumbersome task. To overcome this challenge, we establish a state-action-reward system that effectively absorbs inference accuracy and platform specifications into the rank-selection policy. Our proposed framework brings platform characteristics and performance in the customization loop to enable direct incorporation of hardware cost, e.g., runtime and memory footprint. By means of this hardware-awareness, AutoRank customization engine delivers high accuracy decomposed deep neural networks with low execution cost. Our framework minimizes the engineering cost associated with rank selection by providing an automated API for AutoRank that is compatible with popular deep learning libraries and can be readily used by developers.
- Published
- 2021