Search

Your search keyword '"Chu, Ching-Hsiang"' showing total 44 results

Search Constraints

Start Over You searched for: Author "Chu, Ching-Hsiang" Remove constraint Author: "Chu, Ching-Hsiang"
44 results on '"Chu, Ching-Hsiang"'

Search Results

1. The Llama 3 Herd of Models

2. Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression

3. Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation

5. Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models

6. Scalable Distributed DNN Training using TensorFlow and CUDA-Aware MPI: Characterization, Designs, and Performance Evaluation

7. Optimized Broadcast for Deep Learning Workloads on Dense-GPU InfiniBand Clusters: MPI or NCCL?

8. Designing a ROCm-Aware MPI Library for AMD GPUs: Early Experiences

10. Performance Evaluation of MPI Libraries on GPU-Enabled OpenPOWER Architectures: Early Experiences

11. Designing High-Performance In-Memory Key-Value Operations with Persistent GPU Kernels and OpenSHMEM

16. Software-hardware co-design for fast and scalable training of deep learning recommendation models

23. NV-group

44. Accelerator-enabled Communication Middleware for Large-scale Heterogeneous HPC Systems with Modern Interconnects

Catalog

Books, media, physical & digital resources