Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Dynamic shielding for reinforcement learning in black-box environments - MaRDI portal

Dynamic shielding for reinforcement learning in black-box environments

From MaRDI portal

Publication:6103158

Jump to:navigation, search

DOI10.1007/978-3-031-19992-9_2zbMath1522.68351arXiv2207.13446OpenAlexW4312345073MaRDI QIDQ6103158

Stefan Klikovits, Ichiro Hasuo, Toru Takisaka, Ezequiel Castellano, Sasinee Pruekprasert, Masaki Waga

Publication date: 2 June 2023

Published in: Automated Technology for Verification and Analysis (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2207.13446

zbMATH Keywords

reinforcement learning shielding automata learning

Mathematics Subject Classification ID

Computational learning theory (68Q32) Learning and adaptive systems in artificial intelligence (68T05) Formal languages and automata (68Q45) Markov and semi-Markov decision processes (90C40) Specification and verification (program logics, model checking, etc.) (68Q60) Control/observation systems governed by functional relations other than differential equations (such as hybrid and switching systems) (93C30)

Cites Work

This page was built for publication: Dynamic shielding for reinforcement learning in black-box environments

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6103158&oldid=35552697"