Back to Search Start Over

Static Scheduling of Weight Programming for DNN Acceleration with Resource Constrained PIM.

Authors :
Gao, Xin
Wang, Hongyue
Chen, Yiyan
Zhang, Yuhao
Shen, Zhaoyan
Ju, Lei
Source :
ACM Transactions on Embedded Computing Systems; Nov2024, Vol. 23 Issue 6, p1-22, 22p
Publication Year :
2024

Abstract

Most existing architectural studies on ReRAM-based processing-in-memory (PIM) DNN accelerators assume that all weights of the DNN can be mapped to the crossbar at once. However, these studies are over-idealized. ReRAM crossbar resources for calculation are limited because of technological limitations, so multiple weight mapping procedures are required during the inference process. In this article, we propose a static scheduling framework which generates the mapping between DNN weights and ReRAM cells with minimum runtime weight programming cost. We first build a ReRAM crossbar programming latency model by simultaneously considering the DNN weight patterns, ReRAM programming operations, and PIM architecture characteristics. Then, the model is used in the searching process to obtain an optimized weight-to-OU mapping table with minimum online programming latency. Finally, an OU scheduler is used to coordinate the activation sequences of OUs in the crossbars to perform the inference computation correctly. Evaluation results show the proposed framework significantly reduces the weight programming overhead and the overall inference latency for various DNN models with different input datasets. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
SCHEDULING
COST

Details

Language :
English
ISSN :
15399087
Volume :
23
Issue :
6
Database :
Complementary Index
Journal :
ACM Transactions on Embedded Computing Systems
Publication Type :
Academic Journal
Accession number :
179806976
Full Text :
https://doi.org/10.1145/3615657