Asynchronous Stochastic Approximations

DOI10.1137/S0363012995282784zbMath0922.62081OpenAlexW2080631849MaRDI QIDQ4388937

Publication date: 10 May 1998

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1137/s0363012995282784

zbMATH Keywords

communication delays distributed algorithms asynchronous algorithms approximation ODE limit

Mathematics Subject Classification ID

Lua error in Module:PublicationMSCList at line 37: attempt to index local 'msc_result' (a nil value).

Related Items (21)

Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design ⋮ Asynchronous stochastic approximation with differential inclusions ⋮ Reinforcement learning based algorithms for average cost Markov decision processes ⋮ Convergence analysis of contrastive divergence algorithm based on gradient method with errors ⋮ Asymptotics of Reinforcement Learning with Neural Networks ⋮ Stochastic fictitious play with continuous action sets ⋮ An online actor-critic algorithm with function approximation for constrained Markov decision processes ⋮ Approachability in Stackelberg stochastic games with vector costs ⋮ Iterative learning control for large scale nonlinear systems with observation noise ⋮ Reinforcement learning for long-run average cost. ⋮ Fully asynchronous stochastic coordinate descent: a tight lower bound on the parallelism achieving linear speedup ⋮ Q-learning for Markov decision processes with a satisfiability criterion ⋮ Charge-based control of DiffServ-like queues ⋮ The Borkar-Meyn theorem for asynchronous stochastic approximations ⋮ Stochastic approximation algorithms: overview and recent trends. ⋮ A sensitivity formula for risk-sensitive cost and the actor-critic algorithm ⋮ Distributed time synchronization for networks with random delays and measurement noise ⋮ Event-driven stochastic approximation ⋮ Nonlinear Gossip ⋮ A stochastic gradient type algorithm for closed-loop problems ⋮ Whittle index based Q-learning for restless bandits with average reward

This page was built for publication: Asynchronous Stochastic Approximations