Linear programming and constrained average optimality for general continuous-time Markov decision processes in history-dependent policies (Q2884585)

scientific article; zbMATH DE number 6039275

Language	Label	Description	Also known as
English	Linear programming and constrained average optimality for general continuous-time Markov decision processes in history-dependent policies	scientific article; zbMATH DE number 6039275

Statements

instance of

0 references

0 references

0 references

0 references

SIAM Journal on Control and Optimization

0 references

publication date

30 May 2012

0 references

zbMATH Keywords

continuous-time Markov process

0 references

unbounded transition rate

0 references

average criterion

0 references

linear program

0 references

dual program

0 references

constrained optimal policy

0 references

MaRDI profile type

Publication

0 references

full work available at URL

https://doi.org/10.1137/100805169

0 references

title

Linear programming and constrained average optimality for general continuous-time Markov decision processes in history-dependent policies (English)

0 references

review text

The authors study the constrained average optimality for continuous-time Markov decision processes in the class of randomized history-dependent policies. The states and actions are in general Polish spaces, and the transition rates are allowed to be bounded. The optimality criterion is expected average costs, multiple constraints are imposed on similar expected average costs, and all costs may be unbounded both from above and from below. Basing on the improved concept of a stable policy and using the analogue of the forward Kolmogorov equation the authors show the existence of a constrained optimal policy. Then, they develop a linear program (LP), which is equivalent to the constrained optimality problem and is used to obtain a constrained optimal policy. Further, it is established the dual program (DP) to LP and showed that LP and DP are solvable. Finally, the authors use a cash flow model and a controlled birth and death system to illustrate the applications of the results of the paper. Ample set of cited references contain 39 items.

0 references

reviewed by

Wiesław Kotarski

0 references