Back to Search Start Over

A deep learning programming framework for FT-Matrix DSP+MatrixZone heterogeneous systems.

Authors :
KANG Yu-han
SHI Yang
CHEN Zhao-yun
WEN Mei
Source :
Computer Engineering & Science / Jisuanji Gongcheng yu Kexue; Jul2023, Vol. 45 Issue 7, p1149-1158, 10p
Publication Year :
2023

Abstract

To meet the fast iteration speed and high computing power requirements of deep learning models, mainstream hardware vendors are increasingly inclined towards heterogeneous systems consisting of general-purpose processors and AI-specific accelerator cores. However, AI-specific accelerator ores only support certain core operators and do not have general programming capabilities. Therefore, how to efficiently deploy deep learning tasks on such heterogeneous architectures is worth further research. Based on the domestically developed FT-Matrix DSP+MatrixZone heterogeneous system platform, this paper designs and implements a deep learning programming framework, called KaiSa. KaiSa analyzes the input parameters of the deep learning model, identifies the operator type, and assigns it to the corresponding computing core. For complex operators, KaiSa automatically completes the optimal search for the block size based on a performance model, improving the performance of dual-core parallel computing. At the same time, KaiSa shields all low-level hardware details to provide users with a friendly programming environment for efficient program development. Experimental results show that KaiSa can achieve performance improvements of up to 39.0%. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
1007130X
Volume :
45
Issue :
7
Database :
Complementary Index
Journal :
Computer Engineering & Science / Jisuanji Gongcheng yu Kexue
Publication Type :
Academic Journal
Accession number :
170068158
Full Text :
https://doi.org/10.3969/j.issn.1007-130X.2023.07.002