Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach (Q1812296)

scientific article; zbMATH DE number 1932774

Language	Label	Description	Also known as
English	Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach	scientific article; zbMATH DE number 1932774

Statements

instance of

scholarly article

0 references

title

Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach (English)

0 references

author

Rolando Cavazos-Cadena

0 references

Daniel Hernández-Hernández

0 references

published in

Mathematical Methods of Operations Research

0 references

publication date

23 June 2003

0 references

review text

This paper studies Markov decision chains with finite state and action-sets. The decision maker is assumed to be risk averse with constant risk sensitive coefficient \(\lambda\) and the performance of a control policy is measured by the risk-sensitive average cost criterion. Using a contractive operator and the vanishing discount approach, the authors present an alternative proof for the existence result [\textit{R. A. Howard} and \textit{J. E. Matheson}, Manage. Sci., Theory 18, 356--369 (1972; Zbl 0238.90007)], which says that the optimality equation has a solution for every \(\lambda> 0\), when the whole state space is a communication class under the action of each stationary policy.

0 references

zbMATH Keywords

contractive operator

0 references

vanishing discount approach

0 references

risk sensitive control

0 references

Markov decision chains

0 references

existence result

0 references