A constrained optimization perspective on actor-critic algorithms and application to network routing
From MaRDI portal
Publication:286519
DOI10.1016/j.sysconle.2016.02.020zbMath1338.93403arXiv1507.07984OpenAlexW2962840509MaRDI QIDQ286519
H. L. Prasad, Bhatnagar Shalabh, Chandra Prakash, L. A. Prashanth
Publication date: 20 May 2016
Published in: Systems \& Control Letters (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1507.07984
Nonlinear systems in control theory (93C10) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40)
Cites Work
- Unnamed Item
- Unnamed Item
- Natural actor-critic algorithms
- Stochastic approximation methods for constrained and unconstrained systems
- New algorithms of the Q-learning type
- Reinforcement learning based algorithms for average cost Markov decision processes
- OnActor-Critic Algorithms
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes