Search

Your search keyword '"KUMAR, SANJIV"' showing total 2,059 results

Search Constraints

Start Over You searched for: Author "KUMAR, SANJIV" Remove constraint Author: "KUMAR, SANJIV"
2,059 results on '"KUMAR, SANJIV"'

Search Results

7. Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines

8. Efficient Document Ranking with Learnable Late Interactions

9. Landscape-Aware Growing: The Power of a Little LAG

10. Faster Cascades via Speculative Decoding

11. Language Model Cascades: Token-level uncertainty and beyond

12. Exploring and Improving Drafts in Blockwise Parallel Decoding

13. SOAR: Improved Indexing for Approximate Nearest Neighbor Search

14. Metric-aware LLM inference for regression and scoring

15. HiRE: High Recall Approximate Top-$k$ Estimation for Efficient LLM Inference

16. Tandem Transformers for Inference Efficient LLMs

17. Efficient Stagewise Pretraining via Progressive Subnetworks

18. SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced Token Detection

23. A Weighted K-Center Algorithm for Data Subset Selection

24. ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

25. Rethinking FID: Towards a Better Evaluation Metric for Image Generation

26. It's an Alignment, Not a Trade-off: Revisiting Bias and Variance in Deep Models

27. DistillSpec: Improving Speculative Decoding via Knowledge Distillation

28. What do larger image classifiers memorise?

29. Functional Interpolation for Relative Positions Improves Long Context Transformers

30. Think before you speak: Training Language Models With Pause Tokens

40. MarkovGen: Structured Prediction for Efficient Text-to-Image Generation

41. When Does Confidence-Based Cascade Deferral Suffice?

43. Depth Dependence of $\mu$P Learning Rates in ReLU MLPs

46. ResMem: Learn what you can and memorize the rest

47. On student-teacher deviations in distillation: does it pay to disobey?

48. Plugin estimators for selective classification with out-of-distribution detection

49. Supervision Complexity and its Role in Knowledge Distillation

50. Leveraging Importance Weights in Subset Selection

Catalog

Books, media, physical & digital resources