scientific article; zbMATH DE number 7370552
From MaRDI portal
Publication:4998915
Publication date: 9 July 2021
Full work available at URL: https://arxiv.org/abs/1911.12976
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
maximum likelihood estimationMarkov decision processesonline learningchanging environmentuncertainty quantification
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Model-free \(Q\)-learning designs for linear discrete-time zero-sum games with application to \(H^\infty\) control
- An analysis of model-based interval estimation for Markov decision processes
- Stabilization and sensitivity for eventually time-invariant systems
- Module-based reinforcement learning: Experiments with a real robot
- A short proof of the Gittins index theorem
- Near-optimal reinforcement learning in polynomial time
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Introduction to Nonlinear Optimization
- 10.1162/153244303765208377
- Adaptive motion control of wheeled mobile robot with unknown slippage
This page was built for publication: