1. Wilson matrix kernel for lattice QCD on A64FX architecture
- Author
-
Kanamori, Issaku, Nitadori, Keigo, and Matsufuru, Hideo
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,High Energy Physics - Lattice - Abstract
We study the implementation of the even-odd Wilson fermion matrix for lattice QCD simulations on the A64FX architecture. Efficient coding of the stencil operation is investigated for two-dimensional packing to SIMD vectors. We measure the sustained performance on the supercomputer Fugaku at RIKEN R-CCS and show the profiler result of our code, which may signal an unexpected source of slow-down in addition to the detailed efficiency of each part of the code., Comment: 10 pages, contribtuion to the International Workshop on Arm-based HPC: Practice and Experience (IWAHPCE-2023), held in conjunction with The International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia 2023), Singapore, Feb 27 - March 2, 2023
- Published
- 2023
- Full Text
- View/download PDF