14 results on '"Enrique S. Quintana-Ortí"'
Search Results
2. Anatomy of the BLIS Family of Algorithms for Matrix Multiplication.
3. Towards Portable Realizations of Winograd-based Convolution with Vector Intrinsics and OpenMP.
4. High Performance and Energy Efficient Integer Matrix Multiplication for Deep Learning.
5. Evaluation of MPI Allreduce for Distributed Training of Convolutional Neural Networks.
6. Performance Modeling for Distributed Training of Convolutional Neural Networks.
7. Fast Blocking of Householder Reflectors on Graphics Processors.
8. Analysis of Strategies to Save Energy for Message-Passing Dense Linear Algebra Kernels.
9. Saving Energy in the LU Factorization with Partial Pivoting on Multi-core Processors.
10. Applying OOC Techniques in the Reduction to Condensed Form for Very Large Symmetric Eigenproblems on GPUs.
11. High Performance Matrix Inversion on a Multi-core Platform with Several GPUs.
12. Scheduling of QR Factorization Algorithms on SMP and Multi-Core Architectures.
13. Parallelization of GSL: The Web Service Interface.
14. A tool-kit for the design and simulation of systolic algorithms.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.