Start Over

A Hybrid CPU-GPU Multifrontal Optimizing Method in Sparse Cholesky Factorization.

Authors :: Chen, Yong
Jin, Hai
Zheng, Ran
Liu, Yuandong
Wang, Wei
Source :: Journal of Signal Processing Systems for Signal, Image & Video Technology; Jan2018, Vol. 90 Issue 1, p53-67, 15p
Publication Year :: 2018
Abstract: In many scientific computing applications, sparse Cholesky factorization is used to solve large sparse linear equations in distributed environment. GPU computing is a new way to solve the problem. However, sparse Cholesky factorization on GPU is hardly to achieve excellent performance due to the structure irregularity of matrix and the low GPU resource utilization. A hybrid CPU-GPU implementation of sparse Cholesky factorization is proposed based on multifrontal method. A large sparse coefficient matrix is decomposed into a series of small dense matrices (frontal matrices) in the method, and then multiple GEMM (General Matrix-matrix Multiplication) operations are computed on them. GEMMs are the main operations in sparse Cholesky factorization, but they are hardly to perform better in parallel on GPU. In order to improve the performance, the scheme of multiple task queues is adopted to perform multiple GEMMs parallelized with multifrontal method; all GEMM tasks are scheduled dynamically on GPU and CPU based on computation scales for load balance and computing-time reduction. Experimental results show that the approach can outperform the implementations of cuBLAS, achieving up to 1.98× speedup on GTX460 (Fermi micro-architecture) and 3.06× speedup on K20m (Kepler micro-architecture), respectively. [ABSTRACT FROM AUTHOR]

Details

Language :: English
ISSN :: 19398018
Volume :: 90
Issue :: 1
Database :: Complementary Index
Journal :: Journal of Signal Processing Systems for Signal, Image & Video Technology
Publication Type :: Academic Journal
Accession number :: 127247245
Full Text :: https://doi.org/10.1007/s11265-017-1227-9

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

A Hybrid CPU-GPU Multifrontal Optimizing Method in Sparse Cholesky Factorization.

Abstract

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

A Hybrid CPU-GPU Multifrontal Optimizing Method in Sparse Cholesky Factorization.

Abstract

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources