scientific article; zbMATH DE number 7559459
From MaRDI portal
Publication:5089265
DOI10.4230/LIPIcs.CONCUR.2020.3MaRDI QIDQ5089265
Roderick Bloem, Sebastian Junges, Bettina Könighofer, Alex Serban, Nils Jansen
Publication date: 18 July 2022
Full work available at URL: https://arxiv.org/abs/1807.06096
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (5)
Runtime monitors for Markov decision processes ⋮ Enforcing almost-sure reachability in POMDPs ⋮ Risk-aware shielding of partially observable Monte Carlo planning policies ⋮ Dynamic shielding for reinforcement learning in black-box environments ⋮ Lifted model checking for relational MDPs
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Shield synthesis
- Deep reinforcement learning with temporal logics
- A Survey of Multi-Objective Sequential Decision-Making
- Permissive Controller Synthesis for Probabilistic Systems
- Estimator-based reactive synthesis under incomplete information
- Graph Games and Reactive Synthesis
- Verification of Markov Decision Processes Using Learning Algorithms
- The Probabilistic Model Checking Landscape
- Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints
- dtControl
- Shield Synthesis:
- Safety-aware apprenticeship learning
This page was built for publication: