Risk-Sensitive Optimal Control for Markov Decision Processes with Monotone Cost
From MaRDI portal
Publication:5704070
DOI10.1287/moor.27.1.192.334zbMath1082.90577OpenAlexW2086304253MaRDI QIDQ5704070
Publication date: 11 November 2005
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/moor.27.1.192.334
Dynamic programming in optimal control and differential games (49L20) Queues and service in operations research (90B22) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40)
Related Items
Risk-Sensitive Average Optimality for Discrete-Time Markov Decision Processes ⋮ Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains ⋮ Analyzing operational risk-reward trade-offs for start-ups ⋮ Risk-Sensitive Ergodic Control of Continuous Time Markov Processes With Denumerable State Space ⋮ Zero-Sum Risk-Sensitive Stochastic Differential Games ⋮ Oja's algorithm for graph clustering, Markov spectral decomposition, and risk sensitive control ⋮ Dynamic admission and service rate control of a queue ⋮ Zero-sum semi-Markov games with a probability criterion ⋮ Discounted approximations to the risk-sensitive average cost in finite Markov chains ⋮ Risk-sensitive semi-Markov decision problems with discounted cost and general utilities ⋮ Contractive approximations in average Markov decision chains driven by a risk-seeking controller ⋮ Risk-Sensitive Reinforcement Learning via Policy Gradient Search ⋮ On the global convergence of relative value iteration for infinite-horizon risk-sensitive control of diffusions ⋮ Discounted approximations in risk-sensitive average Markov cost chains with finite state space ⋮ Average criteria in denumerable semi-Markov decision chains under risk-aversion ⋮ Markov decision processes under risk sensitivity: a discount vanishing approach ⋮ Discrete-time zero-sum games for Markov chains with risk-sensitive average cost criterion ⋮ Characterization of the optimal average cost in Markov decision chains driven by a risk-seeking controller ⋮ Discrete time homogeneous Markov processes for the study of the basic risk processes ⋮ Zero-sum risk-sensitive stochastic games on a countable state space ⋮ The Vanishing Discount Approach in a class of Zero-Sum Finite Games with Risk-Sensitive Average Criterion ⋮ Risk-sensitive control for a class of nonlinear systems with multiplicative noise ⋮ Risk-sensitive control of pure jump process on countable space with near monotone cost ⋮ Average optimality for risk-sensitive control with general state space ⋮ Convergence of value functions for finite horizon Markov decision processes with constraints ⋮ On tight bounds for function approximation error in risk-sensitive reinforcement learning ⋮ Infinite horizon risk-sensitive control of diffusions without any blanket stability assumptions ⋮ Exit time risk-sensitive control for systems of cooperative agents ⋮ A discounted approach in communicating average Markov decision chains under risk-aversion ⋮ The study of basic risk processes by discrete-time non-homogeneous Markov processes ⋮ A sensitivity formula for risk-sensitive cost and the actor-critic algorithm ⋮ Zero-sum risk-sensitive stochastic games ⋮ Risk-sensitive control of continuous time Markov chains ⋮ Vanishing discount approximations in controlled Markov chains with risk-sensitive average criterion ⋮ Necessary and sufficient conditions for a solution to the risk-sensitive Poisson equation on a finite state space ⋮ Nonzero-Sum Risk-Sensitive Stochastic Games on a Countable State Space ⋮ Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains ⋮ Risk-Sensitive Discounted Continuous-Time Markov Decision Processes with Unbounded Rates ⋮ Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion ⋮ Risk-sensitive zero-sum stochastic differential game for jump-diffusions ⋮ Nonzero-sum risk-sensitive average stochastic games: The case of unbounded costs ⋮ On the relative value iteration with a risk-sensitive criterion ⋮ Unnamed Item ⋮ Risk-averse policy optimization via risk-neutral policy optimization ⋮ On the policy improvement algorithm for ergodic risk-sensitive control ⋮ Ergodic risk-sensitive control of Markov processes on countable state space revisited ⋮ Risk-sensitive control for a class of diffusions with jumps ⋮ Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning