The Marginal Value of Adaptive Gradient Methods in Machine Learning
Ashia C. Wilson, Rebecca Roelofs, Mitchell Stern, Nathan Srebro, Benjamin Recht
https://arxiv.org/abs/1705.08292
May 23, 2017


Optimizers and activations