Back to Search Start Over

PETRA: A 22nm 6.97TFLOPS/W AIB-Enabled Configurable Matrix and Convolution Accelerator Integrated with an Intel Stratix 10 FPGA

Authors :
Sung-Gun Cho
Wei Tang
Chester Liu
Zhengya Zhang
Source :
VLSI Circuits
Publication Year :
2021
Publisher :
IEEE, 2021.

Abstract

PETRA is a configurable FP16 matrix multiplication and convolution accelerator designed to be 2.5D integrated using Advanced Interface Bus (AIB). PETRA is built upon four 16×16 systolic arrays, but it employs a configurable H-tree accumulation to improve both the latency and the utilization by up to 8×. A 22nm 3.04mm2 PETRA prototype provides 1.433TFLOPS in computing matrix-matrix multiplication (MMM) and convolution (conv) at 0.88V, and it achieves a 6.97TFLOPS/W peak efficiency at 0.7V. PETRA is integrated with an Intel Stratix 10 FPGA in a multi-chip package (MCP) to provide the flexibility of FPGA and the performance and efficiency of PETRA.

Details

Database :
OpenAIRE
Journal :
2021 Symposium on VLSI Circuits
Accession number :
edsair.doi...........6644db6ff9fd9f5d70a682541cb9611f
Full Text :
https://doi.org/10.23919/vlsicircuits52068.2021.9492517