Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Linear stochastic approximation driven by slowly varying Markov chains - MaRDI portal

Linear stochastic approximation driven by slowly varying Markov chains

From MaRDI portal

Publication:2503529

Jump to:navigation, search

DOI10.1016/S0167-6911(03)00132-4zbMath1157.93533OpenAlexW2078618768MaRDI QIDQ2503529

Vijay R. Konda, John N. Tsitsiklis

Publication date: 21 September 2006

Published in: Systems \& Control Letters (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/s0167-6911(03)00132-4

zbMATH Keywords

Stochastic approximation Adaptive algorithms Reinforcement learning

Mathematics Subject Classification ID

Identification in stochastic control theory (93E12) Stochastic approximation (62L20)

Related Items

Simulation-based optimal sensor scheduling with application to observer trajectory planning, Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement, Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2503529&oldid=15213197"