Mean, variance and probabilistic criteria in finite Markov decision processes: A review (Q1821706)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Mean, variance and probabilistic criteria in finite Markov decision processes: A review |
scientific article; zbMATH DE number 3999706
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Mean, variance and probabilistic criteria in finite Markov decision processes: A review |
scientific article; zbMATH DE number 3999706 |
Statements
Mean, variance and probabilistic criteria in finite Markov decision processes: A review (English)
0 references
1988
0 references
This paper is a survey of papers which make use of nonstandard Markov decision process criteria (i.e., those which do not seek simply to optimize expected returns per unit time or expected discounted return). It covers infinite-horizon nondiscounted formulations, infinite-horizon discounted formulations, and finite-horizon formulations. For problem formulations in terms solely of the probabilities of being in each state and taking each action, policy equivalence results are given which allow policies to be restricted to the class of Markov policies or to the randomizations of deterministic Markov policies. For problems which cannot be stated in such terms, in terms of the primitive state set I, formulations involving a redefinition of the states are examined.
0 references
survey
0 references
nonstandard Markov decision process criteria
0 references
infinite-horizon nondiscounted formulations
0 references
discounted formulations
0 references
finite-horizon
0 references
mean
0 references
variance
0 references
probabilistic criteria
0 references
0 references
0 references