Markov decision processes with state-dependent discount factors and unbounded rewards/costs
From MaRDI portal
Publication:408405
DOI10.1016/j.orl.2011.06.014zbMath1235.90178OpenAlexW2055647224MaRDI QIDQ408405
Publication date: 5 April 2012
Published in: Operations Research Letters (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.orl.2011.06.014
Markov decision processesoptimal value functionoptimal stationary policystate-dependent discount factorsunbounded costs/rewards
Related Items (16)
Constrained Markov control processes with randomized discounted cost criteria: infinite linear programming approach ⋮ First passage problems for nonstationary discrete-time stochastic control systems ⋮ First Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount Factors ⋮ Certified reinforcement learning with logic guidance ⋮ Some advances on constrained Markov decision processes in Borel spaces with random state-dependent discount factors ⋮ Delay-Minimizing Capacity Allocation in an Infinite Server-Queueing System ⋮ An average-value-at-risk criterion for Markov decision processes with unbounded costs ⋮ Discrete-time control with non-constant discount factor ⋮ Zero-sum Markov games with random state-actions-dependent discount factors: existence of optimal strategies ⋮ Finite approximation of the first passage models for discrete-time Markov decision processes with varying discount factors ⋮ Deep reinforcement learning with temporal logics ⋮ Convergence of Markov decision processes with constraints and state-action dependent discount factors ⋮ A mean field absorbing control model for interacting objects systems ⋮ Semi-Markov decision processes with variance minimization criterion ⋮ Zero-sum semi-Markov games with state-action-dependent discount factors ⋮ First passage Markov decision processes with constraints and varying discount factors
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Markov decision processes with exponentially representable discounting
- A convex analytic approach to Markov decision processes
- Discounted cost Markov decision processes on Borel spaces: The linear programming formulation
- Discounting the distant future: How much do uncertain rates increase valuations?
- Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards
- Markov control processes with randomized discounted cost
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Markov Decision Models with Weighted Discounted Criteria
This page was built for publication: Markov decision processes with state-dependent discount factors and unbounded rewards/costs