A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale (Q6450892)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale |
preprint article from arXiv |
Statements
12 September 2023
0 references
cs.LG
0 references
cs.DC
0 references
cs.MS
0 references
math.OC
0 references
Hao-Jun Michael Shi
0 references
Tsung-Hsien Lee
0 references
Shintaro Iwasaki
0 references
Jose Gallego-Posada
0 references
Zhijing Li
0 references
Kaushik Rangadurai
0 references
Dheevatsa Mudigere
0 references
Michael Rabbat
0 references