Linear stochastic approximation driven by slowly varying Markov chains
From MaRDI portal
Publication:2503529
DOI10.1016/S0167-6911(03)00132-4zbMath1157.93533OpenAlexW2078618768MaRDI QIDQ2503529
Vijay R. Konda, John N. Tsitsiklis
Publication date: 21 September 2006
Published in: Systems \& Control Letters (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/s0167-6911(03)00132-4
Related Items
Simulation-based optimal sensor scheduling with application to observer trajectory planning, Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement, Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Markov chains and stochastic stability
- Stochastic approximation with two time scales
- Asymptotic behavior of stochastic approximation and large deviations
- Convergence of an adaptive linear estimation algorithm
- OnActor-Critic Algorithms
- The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning