Back to Search
Start Over
IMPLEMENTING HIGH-PERFORMANCE COMPLEX MATRIX MULTIPLICATION VIA THE 1M METHOD.
- Source :
-
SIAM Journal on Scientific Computing . 2020, Vol. 42 Issue 5, pC221-C244. 24p. - Publication Year :
- 2020
-
Abstract
- Almost all efforts to optimize high-performance matrix-matrix multiplication have been focused on the case where matrices contain real elements. The community's collective assumption appears to have been that the techniques and methods developed for the real domain carry over directly to the complex domain. As a result, implementors have mostly overlooked a class of methods that compute complex matrix multiplication using only real matrix products. This is the second in a series of articles that investigate these so-called induced methods. In the previous article, we found that algorithms based on the more generally applicable of the two methods--the 4M method--lead to implementations that, for various reasons, often underperform their real domain counterparts. To overcome these limitations, we derive a superior 1M method for expressing complex matrix multiplication, one which addresses virtually all of the shortcomings inherent in 4M. Implementations are developed within the BLIS framework, and testing on microarchitectures by three vendors confirms that the 1M method yields performance that is generally competitive with solutions based on conventionally implemented complex kernels, sometimes even outperforming vendor libraries. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 10648275
- Volume :
- 42
- Issue :
- 5
- Database :
- Academic Search Index
- Journal :
- SIAM Journal on Scientific Computing
- Publication Type :
- Academic Journal
- Accession number :
- 148431581
- Full Text :
- https://doi.org/10.1137/19M1282040