1. DGCNN on FPGA: Acceleration of the Point Cloud Classifier Using FPGAs.
- Author
-
Jamali Golzar, Saleh, Karimian, Ghader, Shoaran, Maryam, and Fattahi Sani, Mohammad
- Subjects
- *
DEEP learning , *POINT cloud , *FIELD programmable gate arrays , *CONVOLUTIONAL neural networks - Abstract
Over the last few years, deep learning on irregular 3D data given its wide range of applications has become one of the active topics in the field. While field programmable gate array (FPGA)-based acceleration of deep learning models has been proved to produce power-efficient designs in comparison with other platforms such as CPUs and GPUs, only a few studies have been conducted regarding the models that consume point clouds as their input. Although tailoring the hardware designs to specific networks could lead to better optimization opportunities, it is also important to keep the reusability of the design in mind, especially for a new and evolving topic like learning on point clouds. In this work, we have aimed to achieve reusability by keeping the hardware isolated from the computational graph. Considering the numerous types of layers used in dynamic graph convolutional neural network (DGCNN) and its popularity, our proposed design aims for the thorough acceleration of DGCNN. The challenges including 18 types of tensor operations, achieving burst transfers, dealing with kernel complexities, external memory banks, in-order and out-of-order execution modes, and approaches with multiple processing elements have been explained in details throughout the paper. Our experiments on a single FPGA with a single bitstream, DDR4 memory subsystem, and Float32 data type demonstrated speedups of 2.73 × to 8.4 × compared to a sequential single-threaded implantation on an Intel Core i7 6700HQ. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF