Search

Your search keyword '"Hilton, Jacob"' showing total 119 results

Search Constraints

Start Over You searched for: Author "Hilton, Jacob" Remove constraint Author: "Hilton, Jacob"
119 results on '"Hilton, Jacob"'

Search Results

1. Estimating the Probabilities of Rare Outputs in Language Models

2. Towards a Law of Iterated Expectations for Heuristic Estimators

3. Backdoor defense, learnability and obfuscation

4. Scaling laws for single-agent reinforcement learning

5. Scaling Laws for Reward Model Overoptimization

6. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

7. Teaching Models to Express Their Uncertainty in Words

8. Training language models to follow instructions with human feedback

9. WebGPT: Browser-assisted question-answering with human feedback

10. Training Verifiers to Solve Math Word Problems

11. Batch size-invariance for policy optimization

12. TruthfulQA: Measuring How Models Mimic Human Falsehoods

13. Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark

14. Phasic Policy Gradient

15. Leveraging Procedural Generation to Benchmark Reinforcement Learning

16. Topological Ramsey numbers and countable ordinals

17. Combinatorics of countable ordinal topologies

19. The topological pigeonhole principle for ordinals

31. Understanding RL vision

37. Structure-Based Design of Novel Biphenyl Amide Antagonists of Human Transient Receptor Potential Cation Channel Subfamily M Member 8 Channels with Potential Implications in the Treatment of Sensory Neuropathies

50. Barely touch the blues: A novel

Catalog

Books, media, physical & digital resources