Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies
From MaRDI portal
Publication:5130921
DOI10.1137/19M1239507zbMath1452.90317arXiv2001.01357MaRDI QIDQ5130921
Publication date: 30 October 2020
Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2001.01357
Markov decision processesaverage costoptimality inequalityBorel spacesuniversally measurable policiesmajorization conditions
Related Items (3)
On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs ⋮ On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies ⋮ Convex analytic method revisited: further optimality results and performance of deterministic policies in average cost stochastic control
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Average control of Markov decision processes with Feller transition probabilities and general action spaces
- On the optimality equation for average cost Markov control processes with Feller transition probabilities
- Invariant problems in dynamic programming - average reward criterion
- Stochastic optimal control. The discrete time case
- A counterexample on the optimality equation in Markov decision chains with the average cost criterion
- Average optimality in dynamic programming on Borel spaces -- unbounded costs and controls
- The optimal reward operator in dynamic programming
- The average cost optimality equation: a fixed point approach
- Fatou's lemma and Lebesgue's convergence theorem for measures
- Solutions of the average cost optimality equation for Markov decision processes with weakly continuous kernel: the fixed-point approach revisited
- Envelopes of sets of measures, tightness, and Markov control processes
- Markov chains and invariant probabilities
- On the optimality equation for average cost Markov decision processes and its validity for inventory control
- On the vanishing discount factor approach for Markov decision processes with weakly continuous transition probabilities
- Limiting Discounted-Cost Control of Partially Observable Stochastic Systems
- Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities
- On the convergence of optimal actions for Markov decision processes and the optimality of (s,S) inventory policies
- General Irreducible Markov Chains and Non-Negative Operators
- A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies
- Semi-Markov control processes with non-compact action spaces and discontinuous costs
- Markov Chains and Stochastic Stability
- Markov Decision Processes with a Borel Measurable Cost Function—The Average Case
- Inequalities in Theorems of Ergodicity and Stability for Markov Chains with Common Phase Space. II
- Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs
- The Optimal Reward Operator in Negative Dynamic Programming
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Alternative Theoretical Frameworks for Finite Horizon Discrete-Time Stochastic Optimal Control
- Universally Measurable Policies in Dynamic Programming
- Variational Analysis
- Infinite-horizon Markov control processes with undiscounted cost criteria: from average to overtaking optimality
- The policy iteration algorithm for average reward Markov decision processes with general state space
- Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
- Average Optimality in Dynamic Programming with General State Space
- Real Analysis and Probability
- Average cost Markov control processes with weighted norms: existence of canonical policies
- Fatou's Lemma in Its Classical Form and Lebesgue's Convergence Theorems for Varying Measures with Applications to Markov Decision Processes
- On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs
- On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes
- Negative Dynamic Programming
- A Borel Set Not Containing a Graph
- Arbitrary State Markovian Decision Processes
- Discrete Dynamic Programming with Sensitive Discount Optimality Criteria
- Coordinating Inventory Control and Pricing Strategies with Random Demand and Fixed Ordering Cost: The Infinite Horizon Case
This page was built for publication: Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies