A semimartingale characterization of average optimal stationary policies for Markov decision processes (Q871336)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A semimartingale characterization of average optimal stationary policies for Markov decision processes |
scientific article; zbMATH DE number 5134583
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | A semimartingale characterization of average optimal stationary policies for Markov decision processes |
scientific article; zbMATH DE number 5134583 |
Statements
A semimartingale characterization of average optimal stationary policies for Markov decision processes (English)
0 references
19 March 2007
0 references
Summary: This paper deals with discrete-time Markov decision processes with Borel state and action spaces. The criterion to be minimized is the average expected costs, and the costs may have neither upper nor lower bounds. In our former paper [J. Appl. Probab. 43, No. 2, 318--334 (2006; Zbl 1121.90122)], weaker conditions are proposed to ensure the existence of average optimal stationary policies. In this paper, we further study some properties of optimal policies. Under these weaker conditions, we not only obtain two necessary and sufficient conditions for optimal policies, but also give a ``semimartingale characterization'' of an average optimal stationary policy.
0 references
0 references
0 references
0 references
0 references
0 references
0.9171807
0 references
0.91453534
0 references
0.91185164
0 references
0.90874857
0 references
0.9084592
0 references