Back to Search Start Over

MLKAPS: Machine Learning and Adaptive Sampling for HPC Kernel Auto-tuning

Authors :
Jam, Mathys
Petit, Eric
Castro, Pablo de Oliveira
Defour, David
Henry, Greg
Jalby, William
Publication Year :
2025

Abstract

Many High-Performance Computing (HPC) libraries rely on decision trees to select the best kernel hyperparameters at runtime,depending on the input and environment. However, finding optimized configurations for each input and environment is challengingand requires significant manual effort and computational resources. This paper presents MLKAPS, a tool that automates this task usingmachine learning and adaptive sampling techniques. MLKAPS generates decision trees that tune HPC kernels' design parameters toachieve efficient performance for any user input. MLKAPS scales to large input and design spaces, outperforming similar state-of-the-artauto-tuning tools in tuning time and mean speedup. We demonstrate the benefits of MLKAPS on the highly optimized Intel MKLdgetrf LU kernel and show that MLKAPS finds blindspots in the manual tuning of HPC experts. It improves over 85% of the inputswith a geomean speedup of x1.30. On the Intel MKL dgeqrf QR kernel, MLKAPS improves performance on 85% of the inputs with ageomean speedup of x1.18.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2501.05811
Document Type :
Working Paper