Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds
From MaRDI portal
Publication:799497
DOI10.1007/BF00939287zbMath0548.90084MaRDI QIDQ799497
J. L. Popyack, Chelsea C. III White
Publication date: 1985
Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)
finite state and action spacessuboptimal policiesinfinite-horizon expected total discounted costlarge-scale Markov decision processes
Related Items (4)
Reward revision and the average reward Markov decision process ⋮ Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation ⋮ Markov decision processes ⋮ Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation
Cites Work
- Unnamed Item
- Unnamed Item
- Convex composite multi-objective nonsmooth programming
- Dynamic programming and stochastic control
- Suboptimal Design for Large Scale, Multimodule Systems
- Applications of dynamic programming and other optimization methods in pest management
- An Iterative Aggregation Procedure for Markov Decision Processes
- Optimal Integrated Control of Univoltine Pest Populations with Age Structure
- A survey of maintenance models: The control and surveillance of deteriorating systems
- Multilayer control of large Markov chains
- Approximations of Dynamic Programs, I
- Approximations of Dynamic Programs, II
- Quality Control under Markovian Deterioration
This page was built for publication: Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds