A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale (Q6450892)

From MaRDI portal





preprint article from arXiv
Language Label Description Also known as
English
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale
preprint article from arXiv

    Statements

    12 September 2023
    0 references
    cs.LG
    0 references
    cs.DC
    0 references
    cs.MS
    0 references
    math.OC
    0 references
    Hao-Jun Michael Shi
    0 references
    Tsung-Hsien Lee
    0 references
    Shintaro Iwasaki
    0 references
    Jose Gallego-Posada
    0 references
    Zhijing Li
    0 references
    Kaushik Rangadurai
    0 references
    Dheevatsa Mudigere
    0 references
    Michael Rabbat
    0 references

    Identifiers

    0 references