Back to Search Start Over

Learning to Quantize Deep Neural Networks: A Competitive-Collaborative Approach.

Authors :
Faysal Khan, Md Fahim
Kamani, Mohammad Mahdi
Mahdavi, Mehrdad
Narayanan, Vijaykrishnan
Source :
DAC: Annual ACM/IEEE Design Automation Conference; 2020, Issue 57, p287-292, 6p
Publication Year :
2020

Abstract

Reducing the model size and computation costs for dedicated AI accelerator designs, neural network quantization methods have attracted momentous attention recently. Unfortunately, merely minimizing quantization loss using constant discretization causes accuracy deterioration. In this paper, we propose an iterative accuracy-driven learning framework of competitive-collaborative quantization (CCQ) to gradually adapt the bit-precision of each individual layer. Orthogonal to prior quantization policies working with full precision for the first and last layers of the network, CCQ offers layer-wise competition for any target quantization policy with holistic layer fine-tuning to recover accuracy, where the state-of-the-art networks can be entirely quantized without any significant accuracy degradation. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0738100X
Issue :
57
Database :
Complementary Index
Journal :
DAC: Annual ACM/IEEE Design Automation Conference
Publication Type :
Conference
Accession number :
155539683