Search

Your search keyword '"Bhatele, Abhinav"' showing total 331 results

Search Constraints

Start Over You searched for: Author "Bhatele, Abhinav" Remove constraint Author: "Bhatele, Abhinav"
331 results on '"Bhatele, Abhinav"'

Search Results

1. From Pixels to Prose: A Large Dataset of Dense Image Captions

2. Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

3. Loki: Low-rank Keys for Efficient Sparse Attention

4. Transformers Can Do Arithmetic with the Right Embeddings

5. Performance-Aligned LLMs for Generating Fast Code

6. Taking GPU Programming Models to Task for Performance Portability

7. Automated Programmatic Performance Analysis of Parallel Programs

8. Can Large Language Models Write Parallel Code?

9. ML-based Modeling to Predict I/O Performance on Different Storage Sub-systems

10. Jorge: Approximate Preconditioning for GPU-efficient Second-order Optimization

11. HPC-Coder: Modeling Parallel Programs using Large Language Models

12. Pipit: Scripting the analysis of parallel execution traces

13. A 4D Hybrid Algorithm to Scale Parallel Training to Thousands of GPUs

14. A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training

15. Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training

16. Design Concerns for Integrated Scripting and Interactive Visualization in Notebook Environments

17. A Survey and Empirical Evaluation of Parallel Deep Learning Frameworks

18. AxoNN: An asynchronous, message-driven parallel framework for extreme-scale deep learning

19. Analytics of Longitudinal System Monitoring Data for Performance Prediction

20. Scalable Comparative Visualization of Ensembles of Call Graphs

21. Comparative Evaluation of Call Graph Generation by Profiling Tools

22. Visual Analytics Challenges in Analyzing Calling Context Trees

25. Design Concerns for Integrated Scripting and Interactive Visualization in Notebook Environments

26. A Large-Scale Epidemic Simulation Framework for Realistic Social Contact Networks

30. Pipit: Enabling programmatic analysis of parallel execution traces

31. Preliminary Evaluation of a Parallel Trace Replay Tool for HPC Network Simulations

36. A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training

37. Modeling Parallel Programs using Large Language Models

38. Communication-minimizing Asynchronous Tensor Parallelism

42. A Case Study of Communication Optimizations on 3D Mesh Interconnects

44. Designing an Interactive, Notebook-Embedded, Tree Visualization to Support Exploratory Performance Analysis

45. AxoNN: An asynchronous, message-driven parallel framework for extreme-scale deep learning

50. Understanding and Mitigating Network Interference on High-Performance Computing Systems

Catalog

Books, media, physical & digital resources