Search

Your search keyword '"Schulman, A."' showing total 333 results

Search Constraints

Start Over You searched for: Author "Schulman, A." Remove constraint Author: "Schulman, A." Publication Type Reports Remove constraint Publication Type: Reports
333 results on '"Schulman, A."'

Search Results

1. Measuring short-form factuality in large language models

2. Rule Based Rewards for Language Model Safety

3. GPT-4o System Card

4. Diversity in Evolutionary Dynamics

5. An Interactive Agent Foundation Model

6. Causal Discovery under Latent Class Confounding

7. Identifiability of Product of Experts Models

8. Nuclear Pleomorphism in Canine Cutaneous Mast Cell Tumors: Comparison of Reproducibility and Prognostic Relevance between Estimates, Manual Morphometry and Algorithmic Morphometry

9. Identification of Mixtures of Discrete Product Distributions in Near-Optimal Sample and Time Complexity

10. Let's Verify Step by Step

11. The impact of local pinning sites in magnetic tunnel junctions with non-homogeneous free layers

12. Enhancing Spin Transfer Torque in Magnetic Tunnel Junction Devices: Exploring the Influence of Capping Layer Materials and Thickness on Device Characteristics

13. GPT-4 Technical Report

14. Scaling laws for single-agent reinforcement learning

15. Need for 'special' states in a deterministic theory of quantum mechanics

16. Multilayer spintronic neural networks with radio-frequency connections

17. Classification of multi-frequency RF signals by extreme learning, using magnetic tunnel junctions as neurons and synapses

18. Scaling Laws for Reward Model Overoptimization

19. Efficient Training of Language Models to Fill in the Middle

20. A simple method to reprogram the binding specificity of DNA-coated colloids that crystallize

21. Trackers Bounce Back: Measuring Evasion of Partitioned Storage in the Wild

22. Training language models to follow instructions with human feedback

23. Rosebud: Making FPGA-Accelerated Middlebox Development More Pleasant

24. Causal Inference Despite Limited Global Confounding via Mixture Models

25. WebGPT: Browser-assisted question-answering with human feedback

26. Training Verifiers to Solve Math Word Problems

27. Batch size-invariance for policy optimization

28. Unsolved Problems in ML Safety

29. ZLeaks: Passive Inference Attacks on Zigbee based Smart Homes

30. A Refined Approximation for Euclidean k-Means

31. Shapes as Product Differentiation: Neural Network Embedding in the Analysis of Markets for Fonts

32. Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark

33. TerraWatt: Sustaining Sustainable Computing of Containers in Containers

34. Hadamard Extensions and the Identification of Mixtures of Product Distributions

35. The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors

36. Growth and site-specific organization of micron-scale biomolecular devices on living mammalian cells

37. Source Identification for Mixtures of Product Distributions

38. Scaling Laws for Autoregressive Generative Modeling

39. Phasic Policy Gradient

40. Droplet Migration on Conical Fibers

41. The Sparse Hausdorff Moment Problem, with Application to Topic Models

42. Leveraging Procedural Generation to Benchmark Reinforcement Learning

43. Using Social Stories[Superscript TM] to Implement Cognitive Behavioral Therapy via Zoom in Parents and Their Children with Autism and Anxiety

44. Teacher Mobility in the School District of Philadelphia, 2009-10 through 2015-16

45. Edge Expansion and Spectral Gap of Nonnegative Matrices

46. Policy Gradient Search: Online Planning and Expert Iteration without Search Trees

47. Semi-Supervised Learning by Label Gradient Alignment

48. Droplets capped with an elastic film can be round, elliptical, or nearly square

49. Quantifying Generalization in Reinforcement Learning

50. LA100 Equity Strategies

Catalog

Books, media, physical & digital resources