#optimization

8 pages

Adaptive learning rates with momentum for deep learning

The algorithm that enables neural networks to learn by computing gradients efficiently

Normalizing layer inputs to accelerate deep network training

Randomly dropping units during training to prevent overfitting

A recent idea for training models on pass-fail tasks when sampling matters

Directly optimizing policies through gradient ascent on expected returns

A stable, sample-efficient policy gradient algorithm for reinforcement learning

Finding a stable matching with the Gale-Shapley deferred acceptance algorithm