Back to Search Start Over

Building representative and balanced datasets of OpenMP parallel regions

Authors :
Steven Sleder
Jordi Alcaraz
Ali Jannesari
Joan Sorribes
Anna Sikora
Eduardo César
Ali TehraniJamsaz
Source :
PDP
Publication Year :
2021
Publisher :
IEEE, 2021.

Abstract

Incorporating machine learning into automatic performance analysis and tuning tools is a promising path to tackle the increasing heterogeneity of current HPC applications. However, this introduces the need for generating balanced and representative datasets of parallel applications’ executions. This work proposes a methodology for building datasets of OpenMP parallel code regions patterns. It allows for determining whether a given code region covers a unique part of the pattern input space not covered by the patterns already included in the dataset. The proposed methodology uses hardware performance counters to represent the execution of the region, which is referred to as the region signature for a given number of cores. Then, a complete representation of the region is built by joining the signatures for every different thread configuration in the system. Next, correlation analysis is performed between this representation and the representation of all the patterns already in the training set. Finally, if this correlation is below a given threshold, the region is considered to cover a unique part of the pattern input space and is subsequently added to the dataset. For validating this methodology, an example dataset, obtained from well known benchmarks, has been used to train a carefully designed neural network model to demonstrate that it is able to classify different patterns of OpenMP parallel regions.

Details

Database :
OpenAIRE
Journal :
2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)
Accession number :
edsair.doi...........2ec2e7b055e724ab4c44128d4378a677
Full Text :
https://doi.org/10.1109/pdp52278.2021.00019