Start Over

MARCO: A High-performance Task M apping a nd R outing Co -optimization Framework for Point-to-Point NoC-based Heterogeneous Computing Systems

Authors :: Xiangzhong Luo
Hui Chen
Zihao Zhang
Peng Chen
Shiqing Li
Weichen Liu
Source :: ACM Transactions on Embedded Computing Systems. 20:1-21
Publication Year :: 2021
Publisher :: Association for Computing Machinery (ACM), 2021.
Abstract: Heterogeneous computing systems (HCSs), which consist of various processing elements (PEs) that vary in their processing ability, are usually facilitated by the network-on-chip (NoC) to interconnect its components. The emerging point-to-point NoCs which support single-cycle-multi-hop transmission, reduce or eliminate the latency dependence on distance, addressing the scalability concern raised by high latency for long-distance transmission and enlarging the design space of the routing algorithm to search the non-shortest paths. For such point-to-point NoC-based HCSs, resource management strategies which are managed by compilers, scheduler, or controllers, e.g., mapping and routing, are complicated for the following reasons: (i) Due to the heterogeneity, mapping and routing need to optimize computation and communication concurrently (for homogeneous computing systems, only communication). (ii) Conducting mapping and routing consecutively cannot minimize the schedule length in most cases since the PEs with high processing ability may locate in the crowded area and suffer from high resource contention overhead. (iii) Since changing the mapping selection of one task will reconstruct the whole routing design space, the exploration of mapping and routing design space is challenging. Therefore, in this work, we propose MARCO, the m apping a nd r outing co -optimization framework, to decrease the schedule length of applications on point-to-point NoC-based HCSs. Specifically, we revise the tabu search to explore the design space and evaluate the quality of mapping and routing. The advanced reinforcement learning (RL)algorithm, i.e., advantage actor-critic, is adopted to efficiently compute paths. We perform extensive experiments on various real applications, which demonstrates that the MARCO achieves a remarkable performance improvement in terms of schedule length (+44.94% ∼ +50.18%) when compared with the state-of-the-art mapping and routing co-optimization algorithm for homogeneous computing systems. We also compare MARCO with different combinations of state-of-the-art mapping and routing approaches.

Subjects :: Point-to-point
Schedule
Hardware and Architecture
Computer science
Distributed computing
Scalability
Overhead (computing)
Symmetric multiprocessor system
Routing (electronic design automation)
Performance improvement
Software
Tabu search

Details

ISSN :: 15583465 and 15399087
Volume :: 20
Database :: OpenAIRE
Journal :: ACM Transactions on Embedded Computing Systems
Accession number :: edsair.doi...........72d70a129fc912ad6e05f22d8edbbe2b
Full Text :: https://doi.org/10.1145/3476985

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

MARCO: A High-performance Task <u>M</u> apping <u>a</u> nd <u>R</u> outing <u>Co</u> -optimization Framework for Point-to-Point NoC-based Heterogeneous Computing Systems

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

MARCO: A High-performance Task <u>M</u> apping <u>a</u> nd <u>R</u> outing <u>Co</u> -optimization Framework for Point-to-Point NoC-based Heterogeneous Computing Systems

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources