Search

Your search keyword '"Khalman, Misha"' showing total 10 results

Search Constraints

Start Over You searched for: Author "Khalman, Misha" Remove constraint Author: "Khalman, Misha"
10 results on '"Khalman, Misha"'

Search Results

1. Building Math Agents with Multi-Turn Iterative Preference Learning

2. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

3. Direct Language Model Alignment from Online AI Feedback

4. LiPO: Listwise Preference Optimization through Learning-to-Rank

5. Gemini: A Family of Highly Capable Multimodal Models

6. Calibrating Likelihoods towards Consistency in Summarization Models

7. Statistical Rejection Sampling Improves Preference Optimization

8. SLiC-HF: Sequence Likelihood Calibration with Human Feedback

9. Calibrating Sequence likelihood Improves Conditional Language Generation

Catalog

Books, media, physical & digital resources