Markov decision processes
From MaRDI portal
Publication:5904001
DOI10.1016/0377-2217(89)90348-2zbMath0677.90086OpenAlexW2096630263MaRDI QIDQ5904001
Chelsea C. III White, Douglas J. White
Publication date: 1989
Published in: European Journal of Operational Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/0377-2217(89)90348-2
introductionadaptivemultiobjectivediscrete event dynamic systemsconstrained modelssemi-Markovpartially observed
Related Items
An Heuristic for Multi-Dimensional Markov Decision Processes, On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes, The value of observing the condition of a deteriorating machine, Intelligent agent supporting human-multi-robot team collaboration, Optimal recovery strategies for manufacturing systems, Approximate solutions to constrained risk-sensitive Markov decision processes, Continuous time shock markov decision processes with discounted criterion, Sequential process control under capacity constraints., Computation of weighted sums of rewards for concurrent MDPs, A multi-period TSP with stochastic regular and urgent demands, Relevant states and memory in Markov chain bootstrapping and simulation, Determining the optimal strategies for discrete control problems on stochastic networks with discounted costs, Was Angelina Jolie Right? Optimizing Cancer Prevention Strategies Among BRCA Mutation Carriers, Multiaction maintenance subject to action-dependent risk and stochastic failure, A survey of solution techniques for the partially observed Markov decision process, Optimal cost and policy for a Markovian replacement problem
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds
- Reward revision and the average reward Markov decision process
- Optimality and efficiency. I
- Stochastic optimal control. The discrete time case
- Multi-objective infinite-horizon discounted Markov decision processes
- Infinite horizon Markov decision processes with unknown or variable discount factors
- Mean, variance and probabilistic criteria in finite Markov decision processes: A review
- Dynamic programming, Markov chains, and the method of successive approximations
- Sufficient statistics in the optimum control of stochastic systems
- A modified dynamic programming method for Markovian decision problems
- Finite state Markovian decision processes
- Vector-Valued Dynamic Programming
- Iterative Aggregation-Disaggregation Procedures for Discounted Semi-Markov Reward Processes
- Reward Revision for Discounted Markov Decision Problems
- Parameter Imprecision in Finite State, Finite Action Dynamic Programs
- Performance evaluation and perturbation analysis of discrete event dynamic systems
- Suboptimal Design for Large Scale, Multimodule Systems
- Action Elimination Procedures for Modified Policy Iteration Algorithms
- An Iterative Aggregation Procedure for Markov Decision Processes
- State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms
- Convergence of Dynamic Programming Models
- On the Optimality of Myopic Policies in Sequential Decision Problems
- Bounds and Transformations for Discounted Finite Markov Decision Chains
- Minimizing a Submodular Function on a Lattice
- Sequential Decision Problems with Expected Utility Criteria. III: Upper and Lower Transience
- Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
- Approximations of Dynamic Programs, I
- The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems
- Approximations of Dynamic Programs, II
- A Survey of Applications of Markov Decision Processes
- Markov Decision Processes with Imprecise Transition Probabilities
- Discounted Dynamic Programming
- Contraction Mappings in the Theory Underlying Dynamic Programming
- Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems
- On Finding the Maximal Gain for Markov Decision Processes
- Some Bounds for Discounted Sequential Decision Processes
- Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation