Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach - MaRDI portal

Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach (Q1812296)

From MaRDI portal





scientific article; zbMATH DE number 1932774
Language Label Description Also known as
English
Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach
scientific article; zbMATH DE number 1932774

    Statements

    Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach (English)
    0 references
    23 June 2003
    0 references
    This paper studies Markov decision chains with finite state and action-sets. The decision maker is assumed to be risk averse with constant risk sensitive coefficient \(\lambda\) and the performance of a control policy is measured by the risk-sensitive average cost criterion. Using a contractive operator and the vanishing discount approach, the authors present an alternative proof for the existence result [\textit{R. A. Howard} and \textit{J. E. Matheson}, Manage. Sci., Theory 18, 356--369 (1972; Zbl 0238.90007)], which says that the optimality equation has a solution for every \(\lambda> 0\), when the whole state space is a communication class under the action of each stationary policy.
    0 references
    contractive operator
    0 references
    vanishing discount approach
    0 references
    risk sensitive control
    0 references
    Markov decision chains
    0 references
    existence result
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references