Back to Search Start Over

SWAP: A Server-Scale Communication-Aware Chiplet-Based Manycore PIM Accelerator.

Authors :
Sharma, Harsh
Mandal, Sumit K.
Doppa, Janardhan Rao
Ogras, Umit Y.
Pande, Partha Pratim
Source :
IEEE Transactions on Computer-Aided Design of Integrated Circuits & Systems; Nov2022, Vol. 41 Issue 11, p4145-4156, 12p
Publication Year :
2022

Abstract

Processing-in-memory (PIM) is a promising technique to accelerate deep learning (DL) workloads. Emerging DL workloads (e.g., ResNet with 152 layers) consist of millions of parameters, which increase the area and fabrication cost of monolithic PIM accelerators. The fabrication cost challenge can be addressed by 2.5-D systems integrating multiple PIM chiplets connected through a network-on-package (NoP). However, server-scale scenarios simultaneously execute multiple compute-heavy DL workloads, leading to significant interchiplet data volume. State-of-the-art NoP architectures proposed in the literature do not consider the nature of DL workloads. In this article, we propose a novel server scale 2.5-D manycore architecture called SWAP that accounts for the traffic characteristics of DL applications. Comprehensive experimental evaluations with different system sizes as well as diverse emerging DL workloads demonstrate that SWAP achieves significant performance and energy consumption improvements with much lower fabrication cost than state-of-the-art NoP topologies. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02780070
Volume :
41
Issue :
11
Database :
Complementary Index
Journal :
IEEE Transactions on Computer-Aided Design of Integrated Circuits & Systems
Publication Type :
Academic Journal
Accession number :
160652687
Full Text :
https://doi.org/10.1109/TCAD.2022.3197500