Back to Search
Start Over
Building representative and balanced datasets of OpenMP parallel regions
- Source :
- PDP
- Publication Year :
- 2021
- Publisher :
- IEEE, 2021.
-
Abstract
- Incorporating machine learning into automatic performance analysis and tuning tools is a promising path to tackle the increasing heterogeneity of current HPC applications. However, this introduces the need for generating balanced and representative datasets of parallel applications’ executions. This work proposes a methodology for building datasets of OpenMP parallel code regions patterns. It allows for determining whether a given code region covers a unique part of the pattern input space not covered by the patterns already included in the dataset. The proposed methodology uses hardware performance counters to represent the execution of the region, which is referred to as the region signature for a given number of cores. Then, a complete representation of the region is built by joining the signatures for every different thread configuration in the system. Next, correlation analysis is performed between this representation and the representation of all the patterns already in the training set. Finally, if this correlation is below a given threshold, the region is considered to cover a unique part of the pattern input space and is subsequently added to the dataset. For validating this methodology, an example dataset, obtained from well known benchmarks, has been used to train a carefully designed neural network model to demonstrate that it is able to classify different patterns of OpenMP parallel regions.
- Subjects :
- Artificial neural network
Cover (telecommunications)
Computer science
05 social sciences
050401 social sciences methods
02 engineering and technology
Thread (computing)
Space (commercial competition)
computer.software_genre
Signature (logic)
0504 sociology
Path (graph theory)
0202 electrical engineering, electronic engineering, information engineering
Code (cryptography)
020201 artificial intelligence & image processing
Data mining
Representation (mathematics)
computer
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)
- Accession number :
- edsair.doi...........2ec2e7b055e724ab4c44128d4378a677
- Full Text :
- https://doi.org/10.1109/pdp52278.2021.00019