Back to Search
Start Over
A deep learning programming framework for FT-Matrix DSP+MatrixZone heterogeneous systems.
- Source :
- Computer Engineering & Science / Jisuanji Gongcheng yu Kexue; Jul2023, Vol. 45 Issue 7, p1149-1158, 10p
- Publication Year :
- 2023
-
Abstract
- To meet the fast iteration speed and high computing power requirements of deep learning models, mainstream hardware vendors are increasingly inclined towards heterogeneous systems consisting of general-purpose processors and AI-specific accelerator cores. However, AI-specific accelerator ores only support certain core operators and do not have general programming capabilities. Therefore, how to efficiently deploy deep learning tasks on such heterogeneous architectures is worth further research. Based on the domestically developed FT-Matrix DSP+MatrixZone heterogeneous system platform, this paper designs and implements a deep learning programming framework, called KaiSa. KaiSa analyzes the input parameters of the deep learning model, identifies the operator type, and assigns it to the corresponding computing core. For complex operators, KaiSa automatically completes the optimal search for the block size based on a performance model, improving the performance of dual-core parallel computing. At the same time, KaiSa shields all low-level hardware details to provide users with a friendly programming environment for efficient program development. Experimental results show that KaiSa can achieve performance improvements of up to 39.0%. [ABSTRACT FROM AUTHOR]
Details
- Language :
- Chinese
- ISSN :
- 1007130X
- Volume :
- 45
- Issue :
- 7
- Database :
- Complementary Index
- Journal :
- Computer Engineering & Science / Jisuanji Gongcheng yu Kexue
- Publication Type :
- Academic Journal
- Accession number :
- 170068158
- Full Text :
- https://doi.org/10.3969/j.issn.1007-130X.2023.07.002