Search

Your search keyword '"Kosson, Atli"' showing total 11 results

Search Constraints

Start Over You searched for: Author "Kosson, Atli" Remove constraint Author: "Kosson, Atli"
11 results on '"Kosson, Atli"'

Search Results

1. Analyzing & Reducing the Need for Learning Rate Warmup in GPT Training

2. Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

3. Memory Efficient Mixed-Precision Optimizers

4. Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks

5. Ghost Noise for Regularizing Deep Neural Networks

6. Multiplication-Free Transformer Training via Piecewise Affine Operations

7. Adaptive Braking for Mitigating Gradient Delay

8. Pipelined Backpropagation at Scale: Training Large Models without Batches

9. Online Normalization for Training Neural Networks

10. Hardware-Efficient Transformer Training via Piecewise Affine Operations

11. Rotational Optimizers: Simple & Robust DNN Training

Catalog

Books, media, physical & digital resources