Back to Search Start Over

Accurate Inference With Inaccurate RRAM Devices: A Joint Algorithm-Design Solution

Authors :
Rajiv V. Joshi
Gouranga Charan
Xiaocong Du
Gokul Krishnan
Abinash Mohanty
Yu Cao
Source :
IEEE Journal on Exploratory Solid-State Computational Devices and Circuits, Vol 6, Iss 1, Pp 27-35 (2020)
Publication Year :
2020
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2020.

Abstract

Resistive random access memory (RRAM) is a promising technology for energy-efficient neuromorphic accelerators. However, when a pretrained deep neural network (DNN) model is programmed to an RRAM array for inference, the model suffers from accuracy degradation due to RRAM nonidealities, such as device variations, quantization error, and stuck-at-faults. Previous solutions involving multiple read–verify–write (R-V-W) to the RRAM cells require cell-by-cell compensation and, thus, an excessive amount of processing time. In this article, we propose a joint algorithm-design solution to mitigate the accuracy degradation. We first leverage knowledge distillation (KD), where the model is trained with the RRAM nonidealities to increase the robustness of the model under device variations. Furthermore, we propose random sparse adaptation (RSA), which integrates a small on-chip memory with the main RRAM array for postmapping adaptation. Only the on-chip memory is updated to recover the inference accuracy. The joint algorithm-design solution achieves the state-of-the-art accuracy of 99.41% for MNIST (LeNet-5) and 91.86% for CIFAR-10 (VGG-16) with up to 5% parameters as overhead while providing a 15– $150\times $ speedup compared with R-V-W.

Details

ISSN :
23299231
Volume :
6
Database :
OpenAIRE
Journal :
IEEE Journal on Exploratory Solid-State Computational Devices and Circuits
Accession number :
edsair.doi.dedup.....e5cadc87b9249320474618473990cb31
Full Text :
https://doi.org/10.1109/jxcdc.2020.2987605