On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
From MaRDI portal
Publication:6277554
arXiv1609.04836MaRDI QIDQ6277554
Jorge Nocedal, Dheevatsa Mudigere, Mikhail Smelyanskiy, Nitish Shirish Keskar, Ping Tak Peter Tang
Publication date: 15 September 2016
Has companion code repository: https://github.com/keskarnitish/large-batch-training
This page was built for publication: On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima