Back to Search Start Over

IMPLEMENTING HIGH-PERFORMANCE COMPLEX MATRIX MULTIPLICATION VIA THE 1M METHOD.

Authors :
VAN ZEE, FIELD G.
Source :
SIAM Journal on Scientific Computing. 2020, Vol. 42 Issue 5, pC221-C244. 24p.
Publication Year :
2020

Abstract

Almost all efforts to optimize high-performance matrix-matrix multiplication have been focused on the case where matrices contain real elements. The community's collective assumption appears to have been that the techniques and methods developed for the real domain carry over directly to the complex domain. As a result, implementors have mostly overlooked a class of methods that compute complex matrix multiplication using only real matrix products. This is the second in a series of articles that investigate these so-called induced methods. In the previous article, we found that algorithms based on the more generally applicable of the two methods--the 4M method--lead to implementations that, for various reasons, often underperform their real domain counterparts. To overcome these limitations, we derive a superior 1M method for expressing complex matrix multiplication, one which addresses virtually all of the shortcomings inherent in 4M. Implementations are developed within the BLIS framework, and testing on microarchitectures by three vendors confirms that the 1M method yields performance that is generally competitive with solutions based on conventionally implemented complex kernels, sometimes even outperforming vendor libraries. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10648275
Volume :
42
Issue :
5
Database :
Academic Search Index
Journal :
SIAM Journal on Scientific Computing
Publication Type :
Academic Journal
Accession number :
148431581
Full Text :
https://doi.org/10.1137/19M1282040