Search

Your search keyword '"Xeon Phi"' showing total 142 results

Search Constraints

Start Over You searched for: Descriptor "Xeon Phi" Remove constraint Descriptor: "Xeon Phi" Topic cuda Remove constraint Topic: cuda
142 results on '"Xeon Phi"'

Search Results

1. Performance and Portability of a Linear Solver Across Emerging Architectures

2. A Case Study for Performance Portability Using OpenMP 4.5

3. Toward a BLAS library truly portable across different accelerator types.

4. Vessel Segmentation for Noisy CT Data with Quality Measure Based on Single-Point Contrast-to-Noise Ratio

5. SIMD Monte-Carlo Numerical Simulations Accelerated on GPU and Xeon Phi.

6. Speeding-up Bioinformatics Algorithms with Heterogeneous Architectures: Highly Heterogeneous Smith-Waterman (HHeterSW).

7. An Optimizing Multi-platform Source-to-source Compiler Framework for the NEURON MODeling Language

8. VIENNACL--LINEAR ALGEBRA LIBRARY FOR MULTI- AND MANY-CORE ARCHITECTURES.

9. Manycore Algorithms for Batch Scalar and Block Tridiagonal Solvers.

10. Bit-parallel approximate pattern matching: Kepler GPU versus Xeon Phi.

11. Vectorizing unstructured mesh computations for many-core architectures.

12. Exact diagonalization of quantum lattice models on coprocessors.

13. Multigrid for Matrix-Free High-Order Finite Element Computations on Graphics Processors

14. A high-order cross-platform incompressible Navier–Stokes solver via artificial compressibility with application to a turbulent jet

15. Performance and Portability of a Linear Solver Across Emerging Architectures

16. Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures

17. A benchmark set of highly-efficient CUDA and OpenCL kernels and its dynamic autotuning with Kernel Tuning Toolkit

18. Performance optimizations for scalable CFD applications on hybrid CPU+MIC heterogeneous computing system with millions of cores

19. Language Constructs and Semantics for Runtime-independent Parallelism Expression on Heterogeneous Systems

20. Accelerating supply chains with Ant Colony Optimization across range of hardware solutions

21. Parallelization and Performance of the NIM Weather Model on CPU, GPU, and MIC Processors

22. Out-of-core implementation for accelerator kernels on heterogeneous clouds

23. Modern gyrokinetic particle-in-cell simulation of fusion plasmas on top supercomputers

24. Accelerating gravitational microlensing simulations using the Xeon Phi coprocessor

25. A lightweight approach to performance portability with targetDP

26. A Cross-Platform SpMV Framework on Many-Core Architectures

27. GHOST: Building Blocks for High Performance Sparse Linear Algebra on Heterogeneous Systems

28. Performance portable C++ programming with RAJA

29. On the Portability of CPU-Accelerated Applications via Automated Source-to-Source Translation

30. A Case Study for Performance Portability Using OpenMP 4.5

31. Performance Impact of Memory Channels on Sparse and Irregular Algorithms

32. Compiling SIMT Programs on Multi- and Many-Core Processors with Wide Vector Units: A Case Study with CUDA

33. CosmoFlow: Using Deep Learning to Learn the Universe at Scale

34. Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks

35. A Technique for Large-Scale 2D Seismic Field Simulations on Supercomputers

36. Bit-parallel approximate pattern matching: Kepler GPU versus Xeon Phi

37. HIPAcc: A Domain-Specific Language and Compiler for Image Processing

38. ViennaCL---Linear Algebra Library for Multi- and Many-Core Architectures

39. HSTREAM: A directive-based language extension for heterogeneous stream computing

40. SIMD Monte-Carlo Numerical Simulations Accelerated on GPU and Xeon Phi

41. Chebyshev Filter Diagonalization on Modern Manycore Processors and GPGPUs

42. Abelian: A Compiler for Graph Analytics on Distributed, Heterogeneous Platforms

43. libtropicon: A Scalable Library for Computing Intersection Points of Generic Tropical Hyper-surfaces

44. Highly Heterogeneous Smith-Waterman (HHeterSW): Exploiting heterogeneous architectures to speed-up bioinformatics algorithms

45. Optimization of Hierarchical Matrix Computation on GPU

46. Evaluation of Asynchronous Offloading Capabilities of Accelerator Programming Models for Multiple Devices

47. Numerical simulation of compressible flows on heterogeneous computational architecture

48. Monte Carlo Methods for Massively Parallel Computers

49. Revisiting Online Autotuning for Sparse-Matrix Vector Multiplication Kernels on Next-Generation Architectures

50. MILC Code Performance on High End CPU and GPU Supercomputer Clusters

Catalog

Books, media, physical & digital resources