Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Convergence rate of linear two-time-scale stochastic approximation. - MaRDI portal

Convergence rate of linear two-time-scale stochastic approximation.

From MaRDI portal
Publication:1879892

DOI10.1214/105051604000000116zbMath1094.62103arXivmath/0405287OpenAlexW1985291828MaRDI QIDQ1879892

John N. Tsitsiklis, Vijay R. Konda

Publication date: 15 September 2004

Published in: The Annals of Applied Probability (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/math/0405287




Related Items (29)

Recursive regression estimation based on the two-time-scale stochastic approximation method and Bernstein polynomialsA Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-CriticSequential online subsampling for thinning experimental designsOnline calibrated forecasts: memory efficiency versus universality for learning in gamesConvergence rate and averaging of nonlinear two-time-scale stochastic approximation algo\-rithmsChange-point monitoring for online stochastic approximationsRisk-Sensitive Reinforcement Learning via Policy Gradient SearchVariance-constrained actor-critic algorithms for discounted and average reward MDPsDIMIX: Diminishing Mixing for Sloppy AgentsGeometrical Insights for Implicit Generative ModelingTwo-time-scale nonparametric recursive regression estimator for independent functional dataAsymptotic behavior of multiscale stochastic partial differential equations with Hölder coefficientsA Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement LearningTowards multi‐agent reinforcement learning‐driven over‐the‐counter market simulationsTwo-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placementWeak convergence of dynamical systems in two timescalesGradient-Based Adaptive Stochastic Search for Simulation Optimization Over Continuous SpaceNon asymptotic controls on a recursive superquantile approximationGenerative adversarial networks are special cases of artificial curiosity (1990) and also closely related to predictability minimization (1991)Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functionsAveraging principle and normal deviations for multiscale stochastic systemsStochastic approximation algorithms for superquantiles estimationEmpirical Dynamic ProgrammingGADE: a generative adversarial approach to density estimation and its applicationsComputing VaR and CVaR using stochastic approximation and adaptive unconstrained importance samplingNetworks of reinforced stochastic processes: asymptotics for the empirical meansFundamental design principles for reinforcement learning algorithmsFinite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic ApproximationActor-Critic Algorithms with Online Feature Adaptation



Cites Work




This page was built for publication: Convergence rate of linear two-time-scale stochastic approximation.