Asynchronous Stochastic Approximations

From MaRDI portal
Publication:4388937

DOI10.1137/S0363012995282784zbMath0922.62081OpenAlexW2080631849MaRDI QIDQ4388937

Vivek S. Borkar

Publication date: 10 May 1998

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1137/s0363012995282784



Lua error in Module:PublicationMSCList at line 37: attempt to index local 'msc_result' (a nil value).


Related Items (21)

Value iteration and adaptive dynamic programming for data-driven adaptive optimal control designAsynchronous stochastic approximation with differential inclusionsReinforcement learning based algorithms for average cost Markov decision processesConvergence analysis of contrastive divergence algorithm based on gradient method with errorsAsymptotics of Reinforcement Learning with Neural NetworksStochastic fictitious play with continuous action setsAn online actor-critic algorithm with function approximation for constrained Markov decision processesApproachability in Stackelberg stochastic games with vector costsIterative learning control for large scale nonlinear systems with observation noiseReinforcement learning for long-run average cost.Fully asynchronous stochastic coordinate descent: a tight lower bound on the parallelism achieving linear speedupQ-learning for Markov decision processes with a satisfiability criterionCharge-based control of DiffServ-like queuesThe Borkar-Meyn theorem for asynchronous stochastic approximationsStochastic approximation algorithms: overview and recent trends.A sensitivity formula for risk-sensitive cost and the actor-critic algorithmDistributed time synchronization for networks with random delays and measurement noiseEvent-driven stochastic approximationNonlinear GossipA stochastic gradient type algorithm for closed-loop problemsWhittle index based Q-learning for restless bandits with average reward




This page was built for publication: Asynchronous Stochastic Approximations