Back to Search Start Over

COUNTDOWN: A Run-Time Library for Performance-Neutral Energy Saving in MPI Applications.

Authors :
Cesarini, Daniele
Bartolini, Andrea
Bonfa, Pietro
Cavazzoni, Carlo
Benini, Luca
Source :
IEEE Transactions on Computers. May2021, Vol. 70 Issue 5, p682-695. 14p.
Publication Year :
2021

Abstract

Power and energy consumption are becoming key challenges for the supercomputers’ exascale race. HPC systems’ processors waist active power during communication and synchronization among the MPI processes in large-scale HPC applications. However, due to the time scale at which communication happens, transitioning into low-power states while waiting for the completion of each communication may introduce unacceptable overhead. In this article, we present COUNTDOWN, a run-time library for identifying and automatically reducing the power consumption of the CPUs during communication and synchronization. COUNTDOWN saves energy without penalizing the time-to-completion by lowering CPUs power consumption only during idle times for which power state transition overhead is negligible. This is done transparently to the user, without requiring labor-intensive and error-prone application code modifications, nor requiring recompilation of the application. We test our methodology on a production Tier-1 system. For the NAS benchmarks, COUNTDOWN saves between 6 and 50 percent energy, with a time-to-solution penalty lower than 5 percent. In a complete production—Quantum ESPRESSO—for a 3.5K cores run, COUNTDOWN saves 22.36 percent energy, with a performance penalty below 3 percent. Energy saving increases to 37 percent with a performance penalty of 6.38 percent, if the application is executed without communication tuning. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00189340
Volume :
70
Issue :
5
Database :
Academic Search Index
Journal :
IEEE Transactions on Computers
Publication Type :
Academic Journal
Accession number :
149773503
Full Text :
https://doi.org/10.1109/TC.2020.2995269