Estimation and control in multichain processes (Q1176867)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Estimation and control in multichain processes |
scientific article; zbMATH DE number 12606
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Estimation and control in multichain processes |
scientific article; zbMATH DE number 12606 |
Statements
Estimation and control in multichain processes (English)
0 references
25 June 1992
0 references
The paper considers Markovian decision processes in discrete time with transition probabilities depending on an unknown parameter which may change step by step. In the case of convergence of such a parameter sequence a policy maximizing the average expected reward over an infinite horizon is looked for. Under continuity conditions, the uniform optimality of a policy based on ``estimation and control'' for some multichain models is shown.
0 references
adaptive controls
0 references
discrete time
0 references
average expected reward
0 references
0 references
0 references