Average optimality in dynamic programming on Borel spaces -- unbounded costs and controls (Q1190402)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Average optimality in dynamic programming on Borel spaces -- unbounded costs and controls |
scientific article; zbMATH DE number 55364
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Average optimality in dynamic programming on Borel spaces -- unbounded costs and controls |
scientific article; zbMATH DE number 55364 |
Statements
Average optimality in dynamic programming on Borel spaces -- unbounded costs and controls (English)
0 references
26 September 1992
0 references
The author considers a discrete-time Markovian decision process with Borel state space, Borel action space, and unbounded rewards. He proves the existence of an average-optimal policy within a setting allowing the sets of admissible actions to be noncompact.
0 references
discrete-time Markovian decision process
0 references
Borel state space
0 references
Borel action space
0 references
unbounded rewards
0 references
average-optimal policy
0 references
0 references
0 references
0 references
0.9331282
0 references
0.92361546
0 references
0.91894656
0 references
0.9146333
0 references
0.91019017
0 references
0.90179753
0 references