Constrained denumerable state non-stationary MDPs with expected total reward criterion (Q1568256)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Constrained denumerable state non-stationary MDPs with expected total reward criterion |
scientific article; zbMATH DE number 1462511
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Constrained denumerable state non-stationary MDPs with expected total reward criterion |
scientific article; zbMATH DE number 1462511 |
Statements
Constrained denumerable state non-stationary MDPs with expected total reward criterion (English)
0 references
21 June 2000
0 references
non-stationary Markov decision processes
0 references
expected total reward criterion
0 references
constrained optimal policies
0 references
Markov policy
0 references