Back to Search
Start Over
Static Compilation Analysis for Host-Accelerator Communication Optimization
- Source :
- Lecture Notes in Computer Science, LCPC 2011, LNCS 7146, LCPC'2011 : The 24th International Workshop on Languages and Compilers for Parallel Computing, LCPC'2011 : The 24th International Workshop on Languages and Compilers for Parallel Computing, Sep 2011, Fort Collins, Colorado, United States. pp. 237-251, Languages and Compilers for Parallel Computing ISBN: 9783642360350, LCPC
- Publication Year :
- 2011
- Publisher :
- HAL CCSD, 2011.
-
Abstract
- International audience; We present an automatic, static program transformation that schedules and generates e cient memory transfers between a computer host and its hardware accelerator, addressing a well-known performance bottleneck. Our automatic approach uses two simple heuristics: to perform transfers to the accelerator as early as possible and to delay transfers back from the accelerator as late as possible. We implemented this transformation as a middle-end compilation pass in the pips/Par4All compiler. In the generated code, redundant communications due to data reuse between kernel executions are avoided. Instructions that initiate transfers are scheduled e ectively at compile-time. We present experimental results obtained with the Polybench 2.0, some Rodinia benchmarks, and with a real numerical simulation. We obtain an average speedup of 4 to 5 when compared to a naïve parallelization using a modern gpu with Par4All, hmpp, and pgi, and 3.5 when compared to an OpenMP version using a 12-core multiprocessor.
- Subjects :
- communication optimization
020203 distributed computing
Speedup
Computer science
[INFO.COMP]Computer Science [cs]/domain_info.comp
gpu
Program transformation
Multiprocessing
[INFO.PAR]Computer Science [cs]/domain_info.par
02 engineering and technology
Parallel computing
computer.software_genre
heterogeneous parallel architecture
Bottleneck
Automatic parallelization
0202 electrical engineering, electronic engineering, information engineering
Hardware acceleration
020201 artificial intelligence & image processing
source-to-source compilation
Compiler
computer
Host (network)
Subjects
Details
- Language :
- English
- ISBN :
- 978-3-642-36035-0
- ISSN :
- 03029743
- ISBNs :
- 9783642360350
- Database :
- OpenAIRE
- Journal :
- Lecture Notes in Computer Science, LCPC 2011, LNCS 7146, LCPC'2011 : The 24th International Workshop on Languages and Compilers for Parallel Computing, LCPC'2011 : The 24th International Workshop on Languages and Compilers for Parallel Computing, Sep 2011, Fort Collins, Colorado, United States. pp. 237-251, Languages and Compilers for Parallel Computing ISBN: 9783642360350, LCPC
- Accession number :
- edsair.doi.dedup.....1ca5b6c9546d20013b5b1e09b386b471