A survey of solution techniques for the partially observed Markov decision process
From MaRDI portal
Publication:804478
DOI10.1007/BF02204836zbMath0727.90089MaRDI QIDQ804478
Publication date: 1991
Published in: Annals of Operations Research (Search for Journal in Brave)
surveyheuristic searchsuboptimal designcomputational procedurespartially observed Markov decision process
Markov and semi-Markov decision processes (90C40) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02) Computational methods for problems pertaining to operations research and mathematical programming (90-08)
Related Items
A two-state partially observable Markov decision process with three actions ⋮ Admission Control Policies in a Finite Capacity Geo/Geo/1 Queue Under Partial State Observations ⋮ An efficient heuristic for a partially observable Markov decision process of machine replacement ⋮ Control limits for two-state partially observable Markov decision processes ⋮ Asymptotically optimal Bayesian sequential change detection and identification rules ⋮ The skyline algorithm for POMDP value function pruning ⋮ Dynamic Learning and Decision Making via Basis Weight Vectors ⋮ A nonlinear programming model for partially observable Markov decision processes: Finite horizon case ⋮ Exploiting symmetries for single- and multi-agent partially observable stochastic domains ⋮ State observation accuracy and finite-memory policy performance ⋮ An integrated approach to solving influence diagrams and finite-horizon partially observable decision processes ⋮ Unnamed Item ⋮ Markov-Entscheidungs-Prozesse mit abhängigen Aktionen für optimale Reparaturmaßnahmen bei unvollständiger Information. (Markov decision processes with dependent actions for optimal repair policies under incomplete information) ⋮ Planning and acting in partially observable stochastic domains ⋮ Optimal condition based maintenance with imperfect information and the proportional hazards model ⋮ Undiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policy ⋮ A superharmonic approach to solving infinite horizon partially observable Markov decision problems ⋮ A simple suboptimal algorithm for system maintance under partial observability ⋮ On replacement policies for additive systems with several working levels ⋮ Selecting a quality control attribute sample: An information-economics method ⋮ A tutorial on partially observable Markov decision processes ⋮ Multiaction maintenance subject to action-dependent risk and stochastic failure ⋮ Optimizing active surveillance for prostate cancer using partially observable Markov decision processes ⋮ A leader-follower partially observed, multiobjective Markov game ⋮ Value of information for a leader-follower partially observed Markov game
Cites Work
- Application of Jensen's inequality to adaptive suboptimal design
- Nonparametric adaptive control of discrete-time partially observable stochastic systems
- Optimal control of Markov processes with incomplete state information
- Optimal control of Markov processes with incomplete state-information. II: The convexity of the loss-function
- Reward Revision for Discounted Markov Decision Problems
- Optimal Infinite-Horizon Undiscounted Control of Finite Probabilistic Systems
- Action Elimination Procedures for Modified Policy Iteration Algorithms
- State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms
- Computationally Feasible Bounds for Partially Observed Markov Decision Processes
- The Optimal Control of Partially Observable Markov Processes over the Infinite Horizon: Discounted Costs
- Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
- Finite-Memory Suboptimal Design for Partially Observed Markov Decision Processes
- The Optimal Control of Partially Observable Markov Processes over a Finite Horizon
- Solution Procedures for Partially Observed Markov Decision Processes
- Markov decision processes
- Unnamed Item
- Unnamed Item