Back to Search
Start Over
Fine‐grain task‐parallel algorithms for matrix factorizations and inversion on many‐threaded CPUs
- Source :
- E-Prints Complutense. Archivo Institucional de la UCM, instname, UPCommons. Portal del coneixement obert de la UPC, Universitat Politècnica de Catalunya (UPC)
- Publication Year :
- 2022
- Publisher :
- Wiley, 2022.
-
Abstract
- We extend a two-level task partitioning previously applied to the inversion of dense matrices via Gauss–Jordan elimination to the more challenging QR factorization as well as the initial orthogonal reduction to band form found in the singular value decomposition. Our new task-parallel algorithms leverage the tasking mechanism currently available in OpenMP to exploit “nested” task parallelism, with a first outer level that operates on matrix panels and a second inner level that processes the matrix either by µ -panels or by tiles, in order to expose a large number of independent tasks. We present a detailed performance analysis, including execution traces, which shows that the two-level refinement into fine grain tasks allows for an improved load balancing and delivers high performance on current general-purpose many-core processors (CPUs) from Intel and AMD. This research was sponsored by projects RTI2018-093684-B-I00, PID2019-107255GB andTIN2017-82972-R of Ministerio de Ciencia, Innovación y Universidades; project S2018/TCS-4423 of Comunidad de Madrid; project 2017-SGR-1414 of the Generalitat de Catalunya and the Madrid Government under the Multiannual Agreement with UCM in the line Program to Stimulate Research for Young Doctors in the context of the V PRICIT, project PR65/19-22445.
- Subjects :
- Informática
Sistemas expertos
Matrix factorizations
Algebras, Linear
Parallel processing (Electronic computers)
Computer Networks and Communications
CPUs
Matrix inversion
Processament en paral·lel (Ordinadors)
Task parallelism
OpenMP
Computer Science Applications
Theoretical Computer Science
High performance
Computational Theory and Mathematics
High performance computing
Àlgebra lineal
Informàtica::Arquitectura de computadors::Arquitectures paral·leles [Àrees temàtiques de la UPC]
Càlcul intensiu (Informàtica)
Software
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- E-Prints Complutense. Archivo Institucional de la UCM, instname, UPCommons. Portal del coneixement obert de la UPC, Universitat Politècnica de Catalunya (UPC)
- Accession number :
- edsair.doi.dedup.....ad89e55394bd5187f0ce53d23414d897