Search

Your search keyword '"Kakade, Sham"' showing total 558 results

Search Constraints

Start Over You searched for: Author "Kakade, Sham" Remove constraint Author: "Kakade, Sham"
558 results on '"Kakade, Sham"'

Search Results

1. LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

2. Flash Inference: Near Linear Time Inference for Long Convolution Sequence Models and Beyond

3. Neural Coordination and Capacity Control for Inventory Management

4. SOAP: Improving and Stabilizing Shampoo using Adam

5. Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques

6. Deconstructing What Makes a Good Optimizer for Language Models

7. Universal Length Generalization with Turing Programs

8. Eliminating Position Bias of Language Models: A Mechanistic Approach

9. A New Perspective on Shampoo's Preconditioner

10. DataComp-LM: In search of the next generation of training sets for language models

11. Transcendence: Generative Models Can Outperform The Experts That Train Them

12. CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

13. Scaling Laws in Linear Regression: Compute, Parameters, and Data

14. Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass

15. Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent

16. Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems

17. Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

18. Repeat After Me: Transformers are Better than State Space Models at Copying

19. A Study on the Calibration of In-context Learning

20. Feature emergence via margin maximization: case studies in algebraic tasks

21. Learning an Inventory Control Policy with General Inventory Arrival Dynamics

22. MatFormer: Nested Transformer for Elastic Inference

23. Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

24. Scaling Laws for Imitation Learning in Single-Agent Games

25. Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning

26. AdANNS: A Framework for Adaptive Semantic Search

27. Modified Gauss-Newton Algorithms under Noise

28. Hardness of Independent Learning and Sparse Equilibrium Computation in Markov Games

29. Finite-Sample Analysis of Learning High-Dimensional Single ReLU Neuron

30. Learning Hidden Markov Models Using Conditional Samples

31. On Provable Copyright Protection for Generative Models

32. Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity

33. The Role of Coverage in Online Reinforcement Learning

34. Deep Inventory Management

35. Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms

36. The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift

37. Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit

38. Matryoshka Representation Learning

39. A Complete Characterization of Linear Estimators for Offline Policy Evaluation

40. Risk Bounds of Multi-Pass SGD for Least Squares in the Interpolation Regime

41. Understanding Contrastive Learning Requires Incorporating Inductive Biases

42. Multi-Stage Episodic Control for Strategic Exploration in Text Games

43. The Statistical Complexity of Interactive Decision Making

44. Anti-Concentrated Confidence Bonuses for Scalable Exploration

45. Inductive Biases and Variable Creation in Self-Attention Mechanisms

46. Last Iterate Risk Bounds of SGD with Decaying Stepsize for Overparameterized Linear Regression

47. Sparsity in Partially Controllable Linear Systems

48. The Benefits of Implicit Regularization from SGD in Least Squares Problems

49. Going Beyond Linear RL: Sample Efficient Neural Function Approximation

50. Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

Catalog

Books, media, physical & digital resources