Optimistic value iteration
From MaRDI portal
Publication:2226755
DOI10.1007/978-3-030-53291-8_26zbMath1478.68161arXiv1910.01100OpenAlexW3046677588MaRDI QIDQ2226755
Benjamin Lucien Kaminski, Arnd Hartmanns
Publication date: 9 February 2021
Full work available at URL: https://arxiv.org/abs/1910.01100
Markov and semi-Markov decision processes (90C40) Specification and verification (program logics, model checking, etc.) (68Q60) Probability in computer science (algorithm analysis, random structures, phase transitions, etc.) (68Q87)
Related Items (8)
Latticed \(k\)-induction with an application to probabilistic programs ⋮ Runtime monitors for Markov decision processes ⋮ Optimistic and topological value iteration for simple stochastic games ⋮ Multi-cost bounded tradeoff analysis in MDP ⋮ Unnamed Item ⋮ Multi-objective optimization of long-run average and total rewards ⋮ Comparison of algorithms for simple stochastic games ⋮ Verification of multiplayer stochastic games via abstract dependency graphs
This page was built for publication: Optimistic value iteration