scientific article; zbMATH DE number 6987098
From MaRDI portal
Publication:4558791
zbMath1451.68227arXiv2006.03976MaRDI QIDQ4558791
Sridhar Mahadevan, Mohammad Ghavamzadeh, Ji Liu, Marek Petrik, Bo Liu, Ian Gemp
Publication date: 30 November 2018
Full work available at URL: https://arxiv.org/abs/2006.03976
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
convergence ratesaddle-point error analysisstochastic gradient temporal difference learning algorithms
Related Items (2)
A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic ⋮ Unnamed Item
This page was built for publication: