Non-convex learning via Stochastic Gradient Langevin Dynamics: a nonasymptotic analysis
Maxim Raginsky, Alexander Rakhlin, Matus Telgarsky
https://arxiv.org/abs/1702.03849
Feb. 13, 2017


Optimizers and activations