152 results on '"Oreste Villa"'
Search Results
2. FinePack: Transparently Improving the Efficiency of Fine-Grained Transfers in Multi-GPU Systems.
3. NVBitFI: Dynamic Fault Injection for GPUs.
4. Need for Speed: Experiences Building a Trustworthy System-Level GPU Simulator.
5. HMG: Extending Cache Coherence Protocols Across Modern Hierarchical Multi-GPU Systems.
6. NVBit: A Dynamic Binary Instrumentation Framework for NVIDIA GPUs.
7. Combining HW/SW Mechanisms to Improve NUMA Performance of Multi-GPU Systems.
8. Beyond the socket: NUMA-aware GPUs.
9. MCM-GPU: Multi-Chip-Module GPUs for Continued Performance Scalability.
10. Exploring Efficient Hardware Support for Applications with Irregular Memory Patterns on Multinode Manycore Architectures.
11. High-Performance, Distributed Dictionary Encoding of RDF Datasets.
12. Scaling Irregular Applications through Data Aggregation and Software Multithreading.
13. Scaling the Power Wall: A Path to Exascale.
14. Special Issue on: Systems for Learning, Inferencing, and Discovering (SLID).
15. In-Memory Graph Databases for Web-Scale Data.
16. Designing Efficient Heterogeneous Memory Architectures.
17. Accelerating subsurface transport simulation on heterogeneous clusters.
18. Power/Performance Trade-Offs of Small Batched LU Based Solvers on GPUs.
19. Exploring hardware support for scaling irregular applications on multi-node multi-core architectures.
20. Composing Data Parallel Code for a SPARQL Graph Engine.
21. YAPPA: A compiler-based parallelization framework for irregular applications on MPSoCs.
22. Toward a data scalable solution for facilitating discovery of scientific data resources.
23. Accelerating semantic graph databases on commodity clusters.
24. Prototyping hardware support for irregular applications.
25. Exploiting points-to maps for de-/serialization code generation.
26. A Bandwidth-Optimized Multi-core Architecture for Irregular Applications.
27. Efficient Sorting on the Tilera Manycore Architecture.
28. A High Performance Computing Network and System Simulator for the Power Grid: NGNS^2.
29. Exploring Fine-Grained Task-Based Execution on Multi-GPU Systems.
30. Contention Modeling for Multithreaded Distributed Shared Memory Machines: The Cray XMT.
31. Experiences with String Matching on the Fermi Architecture.
32. Special Issue on Theory and Practice of Irregular Applications (TaPIA).
33. Acceleration of Streamed Tensor Contraction Expressions on GPGPU-Based Clusters.
34. Dynamic load balancing on single- and multi-GPU systems.
35. Accelerating DNA analysis applications on GPU clusters.
36. High performance Molecular Dynamic simulation on single and multi-GPU systems.
37. Input-independent, scalable and fast string matching on the Cray XMT.
38. Scalable transparent checkpoint-restart of global address space applications on virtual machines over infiniband.
39. Scaling Semantic Graph Databases in Size and Performance.
40. Toward a data scalable solution for facilitating discovery of science resources.
41. High-speed string searching against large dictionaries on the Cell/B.E. Processor.
42. Efficiency and scalability of barrier synchronization on NoC based many-core architectures.
43. A Modular Approach to Model Heterogeneous MPSoC at Cycle Level.
44. Exact multi-pattern string matching on the cell/b.e. processor.
45. Transparent system-level migration of PGAS applications using Xen on InfiniBand.
46. Peak-Performance DFA-based String Matching on the Cell Processor.
47. Challenges in Mapping Graph Exploration Algorithms on Advanced Multi-core Processors.
48. Exploration of Distributed Shared Memory Architectures for NoC-based Multiprocessors.
49. Power/performance hardware optimization for synchronization intensive applications in MPSoCs.
50. Optimizing tensor contraction expressions for hybrid CPU-GPU execution.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.