Back to Search Start Over

A 3.77TOPS/W Convolutional Neural Network Processor With Priority-Driven Kernel Optimization

Authors :
Jinshan Yue
Zhe Yuan
Chengmo Yang
Qingwei Guo
Huazhong Yang
Zhibo Wang
Jinyang Li
Yongpan Liu
Source :
IEEE Transactions on Circuits and Systems II: Express Briefs. 66:277-281
Publication Year :
2019
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2019.

Abstract

Convolutional neural network (CNN) has become very popular in image classification tasks. With the increasing demand on intelligent classification on battery-powered devices, energy-efficient ASICs for CNN are badly needed. While previous CNN ASIC processors support operations of different kernel sizes, they sacrifice efficiency to support flexible convolution operations. In fact, convolution operations with a certain kernel size are dominating in many real-case CNNs. This brief proposes a kernel-optimized architecture for $3\,{\times }\,3$ kernels (KOP3), which are dominating operations in mainstream image classification CNNs. Although KOP3 aims at $3\,{\times }\,3$ kernel operations, it also provides programmability to support arbitrary kernel sizes. KOP3 achieves average energy efficiency of 3.77TOPS/W, which is $4.01{ \times }$ better than the best state-of-the-art CNN ASIC processor.

Details

ISSN :
15583791 and 15497747
Volume :
66
Database :
OpenAIRE
Journal :
IEEE Transactions on Circuits and Systems II: Express Briefs
Accession number :
edsair.doi...........4c9aaee0eb17e45aaa555fdc8196023a
Full Text :
https://doi.org/10.1109/tcsii.2018.2846698