Back to Search Start Over

A Single-Shot Generalized Device Placement for Large Dataflow Graphs.

Authors :
Zhou, Yanqi
Roy, Sudip
Abdolrashidi, Amirali
Wong, Daniel Lin-Kit
Ma, Peter
Xu, Qiumin
Mirhoseini, Azalia
Laudon, James
Source :
IEEE Micro. Sep/Oct2020, Vol. 40 Issue 5, p26-36. 11p.
Publication Year :
2020

Abstract

With increasingly complex neural network architectures and heterogeneous device characteristics, finding a reasonable graph partitioning and device placement strategy is challenging. There have been prior attempts at learned approaches for solving device placement, these approaches are computationally expensive, unable to handle large graphs consisting over 50000 nodes, and do not generalize well to unseen graphs. To address all these limitations, we propose an efficient single-shot, generalized deep RL method (SGDP) based on a scalable sequential attention mechanism over a graph neural network that is transferable to new graphs. On a diverse set of representative deep learning models, our method on average achieves 20% improvement over human placement and 18% improvement over the prior art with 15× faster convergence. We are the first to demonstrate super human performance on 8-layer recurrent neural network language model and 8-layer GNMT consisting of over 50000 nodes, on 8-GPUs. We provide rationales and sensitivity study on model architecture selections. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02721732
Volume :
40
Issue :
5
Database :
Academic Search Index
Journal :
IEEE Micro
Publication Type :
Academic Journal
Accession number :
145693359
Full Text :
https://doi.org/10.1109/MM.2020.3015188