Convergence of discretization procedure in \(Q\)-learning
From MaRDI portal
Publication:2725088
zbMATH Open0984.93023MaRDI QIDQ2725088
Cangpu Wu, Guofei Jiang, Huiqi Gao
Publication date: 21 April 2002
Published in: Control Theory \& Applications (Search for Journal in Brave)
Dynamic programming in optimal control and differential games (49L20) Design techniques (robust design, computer-aided design, etc.) (93B51) Numerical methods of relaxation type (49M20)
Recommendations
- The convergence of value iteration in discounted Markov decision processes π π
- Convergence results for single-step on-policy reinforcement-learning algorithms π π
- On the convergence of reinforcement learning π π
- Boundedness of iterates in \(Q\)-learning π π
- Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming π π
- Reinforcement learning via approximation of the Q-function π π
- Convergence of a Q-learning Variant for Continuous States and Actions π π
- Machine Learning: ECML 2004 π π
This page was built for publication: Convergence of discretization procedure in \(Q\)-learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2725088)