10.1162/jmlr.2003.3.4-5.803
From MaRDI portal
Publication:4656011
DOI10.1162/jmlr.2003.3.4-5.803zbMath1112.68446OpenAlexW4230735704MaRDI QIDQ4656011
Theodore J. Perkins, Andrew G. Barto
Publication date: 8 March 2005
Published in: CrossRef Listing of Deleted DOIs (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1162/jmlr.2003.3.4-5.803
Related Items (5)
A Lyapunov approach for stable reinforcement learning ⋮ Deep reinforcement learning control approach to mitigating actuator attacks ⋮ Accelerating Primal-Dual Methods for Regularized Markov Decision Processes ⋮ Stability analysis of reservoir computers dynamics via Lyapunov functions ⋮ Risk-averse autonomous systems: a brief history and recent developments from the perspective of optimal control
This page was built for publication: 10.1162/jmlr.2003.3.4-5.803