Back to Search Start Over

Maximizing the Bang Per Bit

Authors :
Clark, M. A.
Howarth, Dean
Tu, Jiqun
Wagner, Mathias
Weinberg, Evan
Source :
Proceedings of The 39th International Symposium on Lattice Field Theory - PoS(LATTICE2022) 338
Publication Year :
2023

Abstract

Reducing memory traffic is critical to accelerate Lattice QCD computations on modern processors, given that such computations are memory-bandwidth bound. A commonly used strategy is mixed-precision solvers, however, these require careful treatment to ensure stable convergence. We give an overview of the strategies employed in QUDA to stabilize mixed-precision variants of Conjugate Gradient (CG), and its multi-shift brethren. Through the use of customized numerical storage formats we can significantly improve upon the precision achievable compared to IEEE numerical formats, increasing both the solver precision and stability achievable at fixed word size. We give examples using BiCGStab(l) and multi-shift CG solvers using the HISQ operator.<br />Comment: 14 pages, 4 figures

Subjects

Subjects :
High Energy Physics - Lattice

Details

Database :
arXiv
Journal :
Proceedings of The 39th International Symposium on Lattice Field Theory - PoS(LATTICE2022) 338
Publication Type :
Report
Accession number :
edsarx.2302.09224
Document Type :
Working Paper
Full Text :
https://doi.org/10.22323/1.430.0338