Discounted Markov decision processes with utility constraints
From MaRDI portal
Publication:2494787
DOI10.1016/j.camwa.2005.11.013zbMath1120.90066OpenAlexW2054226461MaRDI QIDQ2494787
Masami Kurano, Masami Yasuda, Yoshinobu Kadota
Publication date: 30 June 2006
Published in: Computers \& Mathematics with Applications (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.camwa.2005.11.013
Markov decision processesLagrange techniqueSaddle-pointConstrained optimal policyConstrained optimal policy Markov decision processesDiscount criterionUtility constraints
Related Items (4)
A consumption and investment problem via a Markov decision processes approach with random horizon ⋮ Smoothing policies and safe policy gradients ⋮ An exact iterative search algorithm for constrained Markov decision processes ⋮ Discounted continuous-time constrained Markov decision processes in Polish spaces
Cites Work
- Target-level criterion in Markov decision processes
- Minimising a threshold probability in discounted Markov decision processes
- Markov decision processes with a new optimality criterion: Discrete time
- Constrained Discounted Markov Decision Chains
- On Fan's minimax theorem
- Discounted MDP’s: Distribution Functions and Exponential Utility Maximization
- On the General Utility of Discounted Markov Decision Processes
- Constrained markov decision processes with compact state and action spaces: the average case
- Risk-Sensitive Markov Decision Processes
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
This page was built for publication: Discounted Markov decision processes with utility constraints