Parallel rollout for online solution of partially observable Markov decision processes
From MaRDI portal
Publication:702170
DOI10.1023/B:DISC.0000028199.78776.c4zbMath1057.90051OpenAlexW2034713309MaRDI QIDQ702170
Hyeong Soo Chang, Edwin K. P. Chong, Robert L. Givan
Publication date: 17 January 2005
Published in: Discrete Event Dynamic Systems (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1023/b:disc.0000028199.78776.c4
Related Items (6)
Simulation-based rolling horizon scheduling for operating theatres ⋮ A policy improvement method for constrained average Markov decision processes ⋮ Converging marriage in honey-bees optimization and application to stochastic dynamic programming ⋮ Multi-policy improvement in stochastic optimization with forward recursive function criteria ⋮ Partially observable Markov decision process approximations for adaptive sensing ⋮ Dynamic programming and suboptimal control: a survey from ADP to MPC
This page was built for publication: Parallel rollout for online solution of partially observable Markov decision processes