Search

Your search keyword '"Stoica, A."' showing total 32,659 results

Search Constraints

Start Over You searched for: Author "Stoica, A." Remove constraint Author: "Stoica, A."
32,659 results on '"Stoica, A."'

Search Results

1. BlendServe: Optimizing Offline Inference for Auto-regressive Large Models with Resource-aware Batching

2. MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs

3. Pie: Pooling CPU Memory for LLM Inference

4. Min-Max Framework for Majorization-Minimization Algorithms in Signal Processing Applications: An Overview

5. Terahertz-permittivity of Carbon Nitrides: Revealing humidity-enhanced dielectric properties on the picosecond timescales relevant for charge carrier photogeneration

6. SkyServe: Serving AI Models across Regions and Clouds with Spot Instances

7. NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference

8. Model merging with SVD to tie the Knots

9. Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving

10. How to Evaluate Reward Models for RLHF

11. JudgeBench: A Benchmark for Evaluating LLM-based Judges

12. Efficient LLM Scheduling by Learning to Rank

13. The Fairness-Quality Trade-off in Clustering

14. Post-Training Sparse Attention with Double Sparsity

15. MPC-Minimized Secure LLM Inference

16. Classical Many-Worlds Interpretation

17. Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design

18. Sentient observers and the ontology of spacetime

19. RouteLLM: Learning to Route LLMs with Preference Data

20. Fairness in Social Influence Maximization via Optimal Transport

21. Optical Control of Adaptive Nanoscale Domain Networks

22. Optimizing Speculative Decoding for Serving Large Language Models Using Goodput

23. From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

24. High-precision measurements of the atomic mass and electron-capture decay $Q$ value of $^{95}$Tc

25. Causal Inference from Competing Treatments

26. OR-Bench: An Over-Refusal Benchmark for Large Language Models

27. Crafting Interpretable Embeddings by Asking LLMs Questions

28. Certified Inventory Control of Critical Resources

29. Stylus: Automatic Adapter Selection for Diffusion Models

30. Stability of the regular $n$-gon rotating equilibria with logarithm interaction

31. M\'elange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity

32. GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

33. Can we accurately read or write quantum data?

34. Trustless Audits without Revealing Data or Models

35. MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving

38. Non-equilibrium pathways to emergent polar supertextures

39. RAFT: Adapting Language Model to Domain Specific RAG

40. depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers

41. LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

42. Optimizing LLM Queries in Relational Workloads

43. Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

44. Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems

45. Fairness Rising from the Ranks: HITS and PageRank on Homophilic Networks

46. Theoretical analysis and predictions for the double electron capture of $^{124}$Xe

47. Hidden domain boundary dynamics towards crystalline perfection

48. Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

49. Does the Hamiltonian determine the tensor product structure and the 3d space?

Catalog

Books, media, physical & digital resources