98 results on '"Paul H. J. Kelly"'
Search Results
2. PCQ: Parallel Compact Quantum Circuit Simulation.
3. A shared compilation stack for distributed-memory parallelism in stencil DSLs.
4. Temporal blocking of finite-difference stencil operators with sparse 'off-the-grid' sources.
5. Simodense: a RISC-V softcore optimised for exploring custom SIMD instructions.
6. PathBench: A Benchmarking Platform for Classical and Learned Path Planning Algorithms.
7. BIT-VO: Visual Odometry at 300 FPS using Binary Features from the Focal Plane.
8. Scalable Uncertainty for Computer Vision With Functional Variational Inference.
9. Cain: Automatic Code Generation for Simultaneous Convolutional Kernels on Focal-plane Sensor-processors.
10. SLAMBench 3.0: Systematic Automated Reproducible Evaluation of SLAM Systems for Robot Vision Challenges and Scene Understanding.
11. Characterizing Visual Localization and Mapping Datasets.
12. Adaptive-Resolution Octree-Based Volumetric SLAM.
13. Investigating automatic vectorization for real-time 3D scene understanding.
14. SLAMBench2: Multi-Objective Head-to-Head Benchmarking for Visual SLAM.
15. Algorithmic Performance-Accuracy Trade-off in 3D Vision Applications Using HyperMapper.
16. Application-oriented design space exploration for SLAM algorithms.
17. Demonstrating custom SIMD instruction development for a RISC-V softcore.
18. Integrating Algorithmic Parameters into Benchmarking and Design Space Exploration in 3D Scene Understanding.
19. Diplomat: Mapping of Multi-kernel Applications Using a Static Dataflow Abstraction.
20. Comparative design space exploration of dense and semi-dense SLAM.
21. Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL.
22. A Fast and Scalable Graph Coloring Algorithm for Multi-core and Many-core Architectures.
23. Introducing SLAMBench, a performance and accuracy benchmarking methodology for SLAM.
24. Generalizing Run-Time Tiling with the Loop Chain Abstraction.
25. Dense planar SLAM.
26. Towards In-Situ Vortex Identification for Peta-Scale CFD Using Contour Trees.
27. Algorithmic Performance-Accuracy Trade-off in 3D Vision Applications.
28. Loop Chaining: A Programming Abstraction for Balancing Locality and Parallelism.
29. Split tiling for GPUs: automatic parallelization using trapezoidal tiles.
30. SLAM++: Simultaneous Localisation and Mapping at the Level of Objects.
31. Barrier invariants: a shared state abstraction for the analysis of data-dependent GPU kernels.
32. Parametric GPU Code Generation for Affine Loop Programs.
33. Computationally unifying urban masterplanning.
34. Performance-Portable Finite Element Assembly Using PyOP2 and FEniCS.
35. An Analytical Study of Loop Tiling for a Large-Scale Unstructured Mesh Application.
36. PyOP2: A High-Level Framework for Performance-Portable Simulations on Unstructured Meshes.
37. Compiler Optimizations for Industrial Unstructured Mesh CFD Applications on GPUs.
38. Mesh independent loop fusion for unstructured mesh applications.
39. Symbolic crosschecking of floating-point and SIMD code.
40. Design and Performance of the OP2 Library for Unstructured Mesh Applications.
41. Accelerating Anisotropic Mesh Adaptivity on nVIDIA's CUDA Using Texture Interpolation.
42. Symbolic Testing of OpenCL Code.
43. A batch algorithm for maintaining a topological order.
44. Deriving Efficient Data Movement from Decoupled Access/Execute Specifications.
45. Towards Metaprogramming for Parallel Systems on a Chip.
46. High-performance SIMT code generation in an active visual effects library.
47. Explicit Dependence Metadata in an Active Visual Effects Library.
48. A Declarative Framework for Analysis and Optimization.
49. A Domain-Specific Interpreter for Parallelizing a Large Mixed-Language Visualisation Application.
50. Minimizing Associativity Conflicts in Morton Layout.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.