Search

Your search keyword '"Mohan, Jayashree"' showing total 15 results

Search Constraints

Start Over You searched for: Author "Mohan, Jayashree" Remove constraint Author: "Mohan, Jayashree" Publication Type Reports Remove constraint Publication Type: Reports
15 results on '"Mohan, Jayashree"'

Search Results

1. Towards Efficient Large Multimodal Model Serving

2. POD-Attention: Unlocking Full Prefill-Decode Overlap for Faster LLM Inference

3. ASTRA: Accurate and Scalable ANNS-based Training of Extreme Classifiers

4. Etalon: Holistic Performance Evaluation Framework for LLM Inference Systems

5. Vidur: A Large-Scale Simulation Framework For LLM Inference

6. vAttention: Dynamic Memory Management for Serving LLMs without PagedAttention

7. Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve

8. SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills

9. Synergy: Resource Sensitive DNN Scheduling in Multi-Tenant Clusters

10. Memory Optimization for Deep Networks

11. Analyzing and Mitigating Data Stalls in DNN Training

12. RECIPE : Converting Concurrent DRAM Indexes to Persistent-Memory Indexes

13. Analyzing GDPR Compliance Through the Lens of Privacy Policy

14. Finding Crash-Consistency Bugs with Bounded Black-Box Crash Testing

15. Analyzing IO Amplification in Linux File Systems

Catalog

Books, media, physical & digital resources