Dynamic shielding for reinforcement learning in black-box environments
DOI10.1007/978-3-031-19992-9_2zbMath1522.68351arXiv2207.13446OpenAlexW4312345073MaRDI QIDQ6103158
Stefan Klikovits, Ichiro Hasuo, Toru Takisaka, Ezequiel Castellano, Sasinee Pruekprasert, Masaki Waga
Publication date: 2 June 2023
Published in: Automated Technology for Verification and Analysis (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2207.13446
Computational learning theory (68Q32) Learning and adaptive systems in artificial intelligence (68T05) Formal languages and automata (68Q45) Markov and semi-Markov decision processes (90C40) Specification and verification (program logics, model checking, etc.) (68Q60) Control/observation systems governed by functional relations other than differential equations (such as hybrid and switching systems) (93C30)
Cites Work
- Deep reinforcement learning with temporal logics
- On the Inference of Finite State Automata from Positive and Negative Data
- On the Construction of Fine Automata for Safety Properties
- Shield Synthesis:
- Run-time optimization for learned controllers through quantitative games
- Unnamed Item
- Unnamed Item
- Unnamed Item
This page was built for publication: Dynamic shielding for reinforcement learning in black-box environments