Stationary policies and Markov policies in Borel dynamic programming
From MaRDI portal
Publication:1071658
DOI10.1007/BF01845641zbMath0585.90088MaRDI QIDQ1071658
William D. Sudderth, Manfred Schäl
Publication date: 1986
Published in: Probability Theory and Related Fields (Search for Journal in Brave)
Dynamic programming (90C39) Stopping times; optimal stopping problems; gambling theory (60G40) Markov and semi-Markov decision processes (90C40)
Related Items (6)
The transformation method for continuous-time Markov decision processes ⋮ Markov-achievable payoffs for finite-horizon decision models. ⋮ On Generalized Bellman Equations and Temporal-Difference Learning ⋮ Finite-stage reward functions having the Markov adequacy property ⋮ MDPs with setwise continuous transition probabilities ⋮ On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion
- Stochastic optimal control. The discrete time case
- The optimal reward operator in dynamic programming
- The stochastic processes of Borel gambling and dynamic programming
- On stationary strategies for absolutely continuous houses
- Measurable selections of extrema
- Stationary Policies in Dynamic Programming Models Under Compactness Assumptions
- On Stationary Strategies in Countable State Total Reward Markov Decision Processes
- Renewal Plans and Persistent Optimality in Countably Additive Gambling
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Countably additive gambling and optimal stopping
- Optimal stopping and almost sure convergence of random sequences
- Persistently ϵ-Optimal Strategies
- Negative Dynamic Programming
- On the Existence of Good Stationary Strategies
- On the Existence of Stationary Optimal Strategies
- A "Fatou Equation" for Randomly Stopped Variables
- On a Problem of D. Lackwell from the Theory of Dynamic Programming
This page was built for publication: Stationary policies and Markov policies in Borel dynamic programming