Back to Search
Start Over
A Fast Scalable Implicit Solver with Concentrated Computation for Nonlinear Time-Evolution Problems on Low-Order Unstructured Finite Elements
- Source :
- IPDPS
- Publication Year :
- 2018
- Publisher :
- IEEE, 2018.
-
Abstract
- Many supercomputers are shifting to architectures with low B (byte/s; memory transfer capability) per F (FLOPS capability) ratios. However, utilizing increased F is difficult for applications that inherently require large B. Targeting an implicit unstructured low-order finite-element analysis solver, which typically requires large B, we have developed a concentrated computation algorithm that yields significant performance improvements on low B/F supercomputers. 35.7% peak performance was achieved for a sparse matrix-vector multiplication kernel, and 15.6% peak performance was achieved for the whole solver on the second generation Xeon Phi-based Oakforest-PACS. This is 5.02 times faster than (and 6.90 times the peak performance of) the state-of-the-art solver (the SC14 Gordon Bell finalist solver). On Oakforest-PACS, the proposed solver was approximately 2.42 times faster than the state-of-the-art solver running on the K computer. The proposed approach has implications for systems and applications and is expected to have significant impact on various fields that use finite-element methods for nonlinear time evolution problems.
- Subjects :
- Computer science
Computation
Byte
02 engineering and technology
Solver
FLOPS
01 natural sciences
Finite element method
010305 fluids & plasmas
Computational science
0103 physical sciences
Scalability
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Multiplication
Xeon Phi
Sparse matrix
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
- Accession number :
- edsair.doi...........ddd264660c65dc994419e2091b3e5a4e
- Full Text :
- https://doi.org/10.1109/ipdps.2018.00071