scientific article; zbMATH DE number 7255083
From MaRDI portal
Publication:4969098
zbMath1498.68229arXiv1801.03326MaRDI QIDQ4969098
Publication date: 5 October 2020
Full work available at URL: https://arxiv.org/abs/1801.03326
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Markov processes: estimation; hidden Markov models (62M05) Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40)
Related Items (4)
Smoothing policies and safe policy gradients ⋮ Importance sampling in reinforcement learning with an estimated behavior policy ⋮ Unnamed Item ⋮ Multi-agent reinforcement learning: a selective overview of theories and algorithms
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Policy gradient in Lipschitz Markov decision processes
- Natural actor-critic algorithms
- Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation
- Some Relations Between Extended and Unscented Kalman Filters
- Optimal Estimation of Dynamic Systems
- 10.1162/1532443041827907
This page was built for publication: