Control of a Markov process in a problem with constraints (Q922966)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Control of a Markov process in a problem with constraints |
scientific article; zbMATH DE number 4170672
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Control of a Markov process in a problem with constraints |
scientific article; zbMATH DE number 4170672 |
Statements
Control of a Markov process in a problem with constraints (English)
0 references
1989
0 references
Given is a discrete-time Markov decision process with finite state and action space and \(\beta\)-discounted total reward F. Let \(\pi\) be a (randomized) strategy. In contrast to the usual approach it is searched for \(v=\sup_{\pi}E^{\pi}F\) under the constraints \(E^{\pi}F_ i\leq K_ i\), \(i=1,...,m\), for some functions \(F_ i\) and real constants \(K_ i\). Then there exists an optimal strategy which is a mixture of at most \(m+1\) stationary strategies.
0 references
discrete-time Markov decision process
0 references
finite state and action space
0 references
\(\beta \) -discounted total reward
0 references
optimal strategy
0 references