Back to Search
Start Over
OpenCL Implementation of Basic Operations for a High-order Finite-volume Polynomial Scheme on Unstructured Hybrid Meshes
- Source :
- Procedia Engineering. 61:76-80
- Publication Year :
- 2013
- Publisher :
- Elsevier BV, 2013.
-
Abstract
- A parallel finite-volume algorithm based on a cell-centered high-order polynomial scheme for unstructured hybrid meshes is under consideration. The work is focused on the adaptation and optimization of basic operations of the algorithm to different architec- tures of massively-parallel accelerators including GPU of AMD and NVIDIA. Such an algorithm is especially problematic for the GPU architectures since it has very low FLOP per byte ratio meaning that performance is dominated by the memory bandwidth but not the computing performance of a device. At the same time it has irregular memory access pattern since unstructured meshes are used. The calculation of polynomial coefficients and the calculation of convective fluxes through faces of cells are the most interesting and time consuming operations of the algorithm. Implementations of these operations for accelerators using OpenCL are considered here in detail. The ways to improve the computational efficiency are proposed, performance measurement results reaching up to 160 GFLOPS on a single GPU device are demonstrated.
- Subjects :
- Scheme (programming language)
Polynomial
Finite volume method
OpenCL
Computer science
GPU
Byte
OpenMP
Memory bandwidth
Parallel CFD
General Medicine
Parallel computing
FLOPS
Computational science
unstructured mesh
Computer Science::Mathematical Software
MPI
Polygon mesh
Implementation
computer
Engineering(all)
finite-volume
computer.programming_language
Subjects
Details
- ISSN :
- 18777058
- Volume :
- 61
- Database :
- OpenAIRE
- Journal :
- Procedia Engineering
- Accession number :
- edsair.doi.dedup.....68426726a1526e51256438d608e4c416
- Full Text :
- https://doi.org/10.1016/j.proeng.2013.07.096