Monotone optimal policies in discounted Markov decision processes with transition probabilities independent of the current state: existence and approximation (Q2868780)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: scientific article |
scientific article; zbMATH DE number 6239462
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Monotone optimal policies in discounted Markov decision processes with transition probabilities independent of the current state: existence and approximation |
scientific article; zbMATH DE number 6239462 |
Statements
19 December 2013
0 references
Markov decision process
0 references
total discounted cost
0 references
total discounted reward
0 references
increasing optimal policy
0 references
decreasing optimal policy
0 references
policy iteration algorithm
0 references
0 references
Monotone optimal policies in discounted Markov decision processes with transition probabilities independent of the current state: existence and approximation (English)
0 references