Search

Your search keyword '"Swersky A"' showing total 352 results

Search Constraints

Start Over You searched for: Author "Swersky A" Remove constraint Author: "Swersky A"
352 results on '"Swersky A"'

Search Results

1. Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

2. Exploring and Benchmarking the Planning Capabilities of Large Language Models

3. Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation

4. Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models

5. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

6. Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

7. Frontier Language Models are not Robust to Adversarial Arithmetic, or 'What do I need to say so you agree 2+2=5?

8. Directly Fine-Tuning Diffusion Models on Differentiable Rewards

9. Low-Variance Gradient Estimation in Unrolled Computation Graphs with ES-Single

10. Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks

11. CUF: Continuous Upsampling Filters

13. Learning to Improve Code Efficiency

14. Pre-training helps Bayesian optimization too

16. Data-Driven Offline Optimization For Architecting Hardware Accelerators

17. Pre-trained Gaussian Processes for Bayesian Optimization

19. Two Sides of the Same Coin: Heterophily and Oversmoothing in Graph Convolutional Neural Networks

20. Oops I Took A Gradient: Scalable Sampling for Discrete Distributions

21. Apollo: Transferable Architecture Exploration

22. Human 3D keypoints via spatial uncertainty modeling

24. No MCMC for me: Amortized sampling for fast and stable training of energy-based models

25. Learned Hardware/Software Co-Design of Neural Accelerators

26. Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach

27. An Imitation Learning Approach for Cache Replacement

28. Big Self-Supervised Models are Strong Semi-Supervised Learners

29. Neural Execution Engines: Learning to Execute Subroutines

30. SentenceMIM: A Latent Variable Language Model

31. Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One

33. MIM: Mutual Information Machine

34. High Mutual Information in Representation Learning with Symmetric Variational Inference

35. Learning Execution through Neural Code Fusion

36. Flexibly Fair Representation Learning by Disentanglement

37. Learning Sparse Networks Using Targeted Dropout

38. Graph Normalizing Flows

39. Neural Networks for Modeling Source Code Edits

40. Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples

42. Learning Memory Access Patterns

43. Meta-Learning for Semi-Supervised Few-Shot Classification

44. An online sequence-to-sequence model for noisy speech recognition

45. Learning Hard Alignments with Variational Inference

46. Prototypical Networks for Few-shot Learning

Catalog

Books, media, physical & digital resources