On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima (Q6277554)

From MaRDI portal





preprint article from arXiv
Language Label Description Also known as
English
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
preprint article from arXiv

    Statements

    15 September 2016
    0 references
    cs.LG
    0 references
    math.OC
    0 references
    Nitish Shirish Keskar
    0 references
    Dheevatsa Mudigere
    0 references
    Jorge Nocedal
    0 references
    Mikhail Smelyanskiy
    0 references
    Ping Tak Peter Tang
    0 references

    Identifiers

    0 references