Search

Your search keyword '"Oliaro, Gabriele"' showing total 12 results

Search Constraints

Start Over You searched for: Author "Oliaro, Gabriele" Remove constraint Author: "Oliaro, Gabriele"
12 results on '"Oliaro, Gabriele"'

Search Results

1. SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference

2. Optimal Kernel Orchestration for Tensor Programs with Korch

3. FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning

4. Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems

5. SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification

6. Direct Telemetry Access

7. Zero-CPU Collection with Direct Telemetry Access

8. Optimal Kernel Orchestration for Tensor Programs with Korch

9. Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models

10. Direct Telemetry Access

11. SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification

Catalog

Books, media, physical & digital resources