Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Analysis of a two-layer neural network via displacement convexity - MaRDI portal

Analysis of a two-layer neural network via displacement convexity

From MaRDI portal

Publication:1996787

Jump to:navigation, search

DOI10.1214/20-AOS1945zbMath1464.62401arXiv1901.01375OpenAlexW3111778284MaRDI QIDQ1996787

Adel Javanmard, Marco Mondelli, Andrea Montanari

Publication date: 26 February 2021

Published in: The Annals of Statistics (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1901.01375

zbMATH Keywords

neural networks convergence rate stochastic gradient descent Wasserstein gradient flow displacement convexity function regression

Mathematics Subject Classification ID

Estimation in multivariate analysis (62H12) Point estimation (62F10) General nonlinear regression (62J02) Neural nets and related approaches to inference from stochastic processes (62M45)

Related Items

The Continuous Formulation of Shallow Neural Networks as Wasserstein-Type Gradient Flows, Birth–death dynamics for sampling: global convergence, approximations and their asymptotics, A rigorous framework for the mean field limit of multilayer neural networks, A blob method for inhomogeneous diffusion with applications to multi-agent control and sampling, Convergence rates for shallow neural networks learned by gradient descent, Stochastic gradient descent with noise of machine learning type. II: Continuous time analysis, Unnamed Item, A selective overview of deep learning, Mean-field Langevin dynamics and energy landscape of neural networks

Uses Software

scar

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1996787&oldid=14456327"