Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach (Q1812296)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach |
scientific article; zbMATH DE number 1932774
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach |
scientific article; zbMATH DE number 1932774 |
Statements
Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach (English)
0 references
23 June 2003
0 references
This paper studies Markov decision chains with finite state and action-sets. The decision maker is assumed to be risk averse with constant risk sensitive coefficient \(\lambda\) and the performance of a control policy is measured by the risk-sensitive average cost criterion. Using a contractive operator and the vanishing discount approach, the authors present an alternative proof for the existence result [\textit{R. A. Howard} and \textit{J. E. Matheson}, Manage. Sci., Theory 18, 356--369 (1972; Zbl 0238.90007)], which says that the optimality equation has a solution for every \(\lambda> 0\), when the whole state space is a communication class under the action of each stationary policy.
0 references
contractive operator
0 references
vanishing discount approach
0 references
risk sensitive control
0 references
Markov decision chains
0 references
existence result
0 references