Multi-policy improvement in stochastic optimization with forward recursive function criteria
From MaRDI portal
Publication:1771387
DOI10.1016/j.jmaa.2004.10.062zbMath1059.90142OpenAlexW2072967268MaRDI QIDQ1771387
Publication date: 21 April 2005
Published in: Journal of Mathematical Analysis and Applications (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.jmaa.2004.10.062
Invariant imbeddingAssociative dynamic programsForward recursive objective functionParallel rolloutPolicy switching
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Parallel rollout for online solution of partially observable Markov decision processes
- Convergence properties of ordinal comparison in the simulation of discrete event dynamic systems
- Approximate receding horizon approach for Markov decision processes: average reward case
- Stochastic optimization of forward recursive functions
- Associative dynamic programs
- On the convergence rate of ordinal comparisons of random variables
- A survey of computational complexity results in systems and control
This page was built for publication: Multi-policy improvement in stochastic optimization with forward recursive function criteria