A pause control approach to the value iteration scheme in average Markov decision processes (Q1128694)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A pause control approach to the value iteration scheme in average Markov decision processes |
scientific article; zbMATH DE number 1189955
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | A pause control approach to the value iteration scheme in average Markov decision processes |
scientific article; zbMATH DE number 1189955 |
Statements
A pause control approach to the value iteration scheme in average Markov decision processes (English)
0 references
13 August 1998
0 references
controlled Markov chains
0 references
long-run average cost criterion
0 references
Lyapunov function condition
0 references
convergent approximations to the solution of the optimality equation
0 references
artificial action
0 references
value iteration scheme
0 references
0 references
0 references
0 references