Safety-constrained reinforcement learning with a distributional safety critic
From MaRDI portal
Publication:6106435
DOI10.1007/s10994-022-06187-8OpenAlexW4283396661MaRDI QIDQ6106435
Thiago D. Simão, Matthijs T. J. Spaan, Simon H. Tindemans, Qisong Yang
Publication date: 27 June 2023
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-022-06187-8
Cites Work
- The distance between two random vectors wigh given dispersion matrices
- Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
- An actor-critic algorithm for constrained Markov decision processes
- Risk-Constrained Reinforcement Learning with Percentile Risk Criteria
- The variance of discounted Markov decision processes
- Robust Estimation of a Location Parameter
- On Information and Sufficiency
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
This page was built for publication: Safety-constrained reinforcement learning with a distributional safety critic