Back to Search Start Over

Towards elastic in situ analysis for high-performance computing simulations.

Authors :
Dorier, Matthieu
Wang, Zhe
Ramesh, Srinivasan
Ayachit, Utkarsh
Snyder, Shane
Ross, Rob
Parashar, Manish
Source :
Journal of Parallel & Distributed Computing. Jul2023, Vol. 177, p106-116. 11p.
Publication Year :
2023

Abstract

In situ analysis and visualization have grown increasingly popular for enabling direct access to data from high-performance computing (HPC) simulations. As a simulation progresses and interesting physical phenomena emerge, however, the data produced may become increasingly complex, and users may need to dynamically change the type and scale of in situ analysis tasks being carried out and consequently adapt the amount of resources allocated to such tasks. To date, none of the production in situ analysis frameworks offer such an elasticity feature, and for good reason: the assumption that the number of processes could vary during run time would force developers to rethink software and algorithms at every level of the in situ analysis stack. In this paper we present Colza, a data staging service with elastic in situ visualization capabilities. We demonstrate the use of Colza with the Deep Water Impact and the AMR-Wind simulations, coupling them with the ParaView Catalyst and Ascent in situ libraries, and show that Colza enables dynamic rescaling of these widely-used frameworks with no interruption to the simulation or staging service. We highlight the challenges of enabling such elasticity, which requires overcoming these frameworks' reliance on MPI, using distinct engineering approaches, namely dependency injection and dependency overload. To the best of our knowledge, this work is the first to enable elastic in situ visualization capabilities for HPC applications on top of existing production analysis tools. • We introduce Colza, an elastic service for in situ analysis of HPC applications. • We present two methods of integrating Colza with existing in situ frameworks. • We evaluate Colza on ANL's Theta and NERSC's Cori supercomputers. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
07437315
Volume :
177
Database :
Academic Search Index
Journal :
Journal of Parallel & Distributed Computing
Publication Type :
Academic Journal
Accession number :
163048025
Full Text :
https://doi.org/10.1016/j.jpdc.2023.02.014