473 results on '"Mitsuhisa Sato"'
Search Results
2. Extremely Scalable Spiking Neuronal Network Simulation Code: From Laptops to Exascale Computers
3. An Overview on Mixing MPI and OpenMP Dependent Tasking on A64FX.
4. Enhancing the Parallel UC2B Framework: Approach Validation and Scalability Study.
5. Advancements in Traffic Simulations with multiMATSim's Distributed Framework.
6. Performance Evaluation of a Hybrid Programming Model for RSDFT on T2K Open Supercomputer
7. OpenACC Execution Models for Manycore Processor with ARM SVE.
8. OpenACC Unified Programming Environment for Multi-hybrid Acceleration with GPU and FPGA.
9. Performance analysis of a state vector quantum circuit simulation on A64FX processor.
10. Performance tuning of the Helmholtz matrix-vector product kernel in the computational fluid dynamics solver Nek5000/RS for the A64FX processor.
11. Design and Performance Evaluation of UCX for Tofu-D Interconnect with OpenSHMEM-UCX on Fugaku.
12. Pushing the Frontier in the Design of Laser-Based Electron Accelerators with Groundbreaking Mesh-Refined Particle-In-Cell Simulations on Exascale-Class Supercomputers.
13. Scaling the PageRank Algorithm for Very Large Graphs on the Fugaku Supercomputer.
14. Performance improvement by enhancing spatial parallelism on FPGA for HPC applications.
15. Sequences of Sparse Matrix-Vector Multiplication on Fugaku's A64FX processors.
16. Performance Evaluation and Analysis of A64FX many-core Processor for the Fiber Miniapp Suite.
17. Evaluation of SPEC CPU and SPEC OMP on the A64FX.
18. Power/Performance/Area Evaluations for Next-Generation HPC Processors using the A64FX Chip.
19. Performance of the Supercomputer Fugaku for Breadth-First Search in Graph500 Benchmark.
20. Evaluation of Power Management Control on the Supercomputer Fugaku.
21. Preliminary Performance Evaluation of the Fujitsu A64FX Using HPC Applications.
22. Accuracy Improvement of Memory System Simulation for Modern Shared Memory Processor.
23. Parallelization of All-Pairs-Shortest-Path Algorithms in Unweighted Graph.
24. The Supercomputer 'Fugaku' and Arm-SVE enabled A64FX processor for energy-efficiency and sustained application performance.
25. OpenMP Task Generation for Batched Kernel APIs.
26. A Method for Order/Degree Problem Based on Graph Symmetry and Simulated Annealing with MPI/OpenMP Parallelization.
27. Distributed and Parallel Programming Paradigms on the K computer and a Cluster.
28. Multi-accelerator extension in OpenMP based on PGAS model.
29. Scalable communication performance prediction using auto-generated pseudo MPI event trace.
30. The Supercomputer 'Fugaku'.
31. MACC: An OpenACC Transpiler for Automatic Multi-GPU Use.
32. The Impact of Taskyield on the Design of Tasks Communicating Through MPI.
33. Trade-Off of Offloading to FPGA in OpenMP Task-Based Programming.
34. Multi-tasking Execution in PGAS Language XcalableMP and Communication Optimization on Many-core Clusters.
35. Performance evaluation for omni XcalableMP compiler on many-core cluster system based on knights landing.
36. Performance evaluation for a hydrodynamics application in XcalableACC PGAS language for accelerated clusters.
37. A Source-to-Source Translation of Coarray Fortran with MPI for High Performance.
38. Linkage of XcalableMP and Python languages for high productivity on HPC cluster system: application to graph order/degree problem.
39. InKS, a Programming Model to Decouple Performance from Algorithm in HPC Codes.
40. Design of Data Management for Multi SPMD Workflow Programming Model.
41. High-productivity Programming and Optimization Framework for Stream Processing on FPGA.
42. Metaprogramming Framework for Existing HPC Languages Based on the Omni Compiler Infrastructure.
43. Evaluation of Performance and Power Consumption on Supercomputer Fugaku Using SPEC HPC Benchmarks
44. A Performance Projection of Mini-Applications onto Benchmarks Toward the Performance Projection of Real-Applications.
45. Preliminary Performance Evaluation of Application Kernels Using ARM SVE with Multiple Vector Lengths.
46. Implementing Lattice QCD Application with XcalableACC Language on Accelerated Cluster.
47. Extending OpenMP SIMD Support for Target Specific Code and Application to ARM SVE.
48. Implementation and Evaluation of One-sided PGAS Communication in XcalableACC for Accelerated Clusters.
49. Preliminary Performance Evaluation of Coarray-based Implementation of Fiber Miniapp Suite using XcalableMP PGAS Language.
50. Performance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.