Time-varying Markov decision processes with state-action-dependent discount factors and unbounded costs
From MaRDI portal
Publication:5227206
DOI10.14736/kyb-2019-1-0166zbMath1449.93262OpenAlexW2921268247MaRDI QIDQ5227206
Beatris Adriana Escobedo-Trujillo, Carmen G. Higuera-Chan
Publication date: 5 August 2019
Published in: Kybernetika (Search for Journal in Brave)
Full work available at URL: http://hdl.handle.net/10338.dmlcz/147711
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Semi-Markov control models with partially known holding times distribution: discounted and average criteria
- Markov control models with unknown random state-action-dependent discount factors
- Measurable selection theorems for optimization problems
- Adaptive policies for time-varying stochastic systems under discounted criterion
- Limiting optimal discounted-cost control of a class of time-varying stochastic systems
- Semi-Markov control processes with unknown holding times distribution under an average cost criterion
- Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria
- Semi-Markov control processes with unknown holding times distribution under a discounted criterion
- Approximate policy iteration: a survey and some new methods
- Estimate and approximate policy iteration algorithm for discounted Markov decision models with bounded costs and Borel spaces
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Uniform Turnpike Theorems for Finite Markov Decision Processes
- Approximate Dynamic Programming
- Turnpike Planning Horizons for a Markovian Decision Model
This page was built for publication: Time-varying Markov decision processes with state-action-dependent discount factors and unbounded costs