Search

Your search keyword '"Xu, Kelvin"' showing total 41 results

Search Constraints

Start Over You searched for: Author "Xu, Kelvin" Remove constraint Author: "Xu, Kelvin" Publication Year Range Last 10 years Remove constraint Publication Year Range: Last 10 years
41 results on '"Xu, Kelvin"'

Search Results

1. Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries

2. Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

3. Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

4. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

5. Gemini: A Family of Highly Capable Multimodal Models

6. Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

7. Frontier Language Models are not Robust to Adversarial Arithmetic, or 'What do I need to say so you agree 2+2=5?

8. Small-scale proxies for large-scale Transformer training instabilities

9. PaLM 2 Technical Report

10. ContMulti-objective Optimization Model for Momentum Change Based on Genetic Algorithm

11. Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance

12. Autonomous Reinforcement Learning: Formalism and Benchmarking

13. Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention

14. Continual Learning of Control Primitives: Skill Discovery via Reset-Games

16. Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples

17. Probabilistic Model-Agnostic Meta-Learning

18. Learning a Prior over Intent via Meta-Inverse Reinforcement Learning

19. Trust-PCL: An Off-Policy Trust Region Method for Continuous Control

20. Bridging the Gap Between Value and Policy Based Reinforcement Learning

21. Unsupervised Perceptual Rewards for Imitation Learning

22. An Actor-Critic Algorithm for Sequence Prediction

23. Theano: A Python framework for fast computation of mathematical expressions

24. A Controller-Recognizer Framework: How necessary is recognition for control?

25. On Using Monolingual Corpora in Neural Machine Translation

26. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

28. Towards Adaptive, Continual Embodied Agents

30. LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

35. Exploring Attention Based Model for Captioning Images

41. Structural basis for corepressor assembly by the orphan nuclear receptor TLX.

Catalog

Books, media, physical & digital resources