Monotone value iteration for discounted finite Markov decision processes (Q1076618)

scientific article; zbMATH DE number 3954679

Language	Label	Description	Also known as
English	Monotone value iteration for discounted finite Markov decision processes	scientific article; zbMATH DE number 3954679

Statements

instance of

scholarly article

0 references

title

Monotone value iteration for discounted finite Markov decision processes (English)

0 references

published in

Journal of Mathematical Analysis and Applications

0 references

publication date

1985

0 references

review text

This paper considers the properties of two modifications of the value iteration scheme for a finite state, finite actions Markovian decision process with the total expected discounted reward criterion. The first variant replaces the discount factor appearing in the value iteration scheme by a magnitude which depends on the results of the previous iterations. In the second variant some perturbation is added to the value vector on each iteration. The author shows that for some choice of the parameters of these methods they converge monotonically to the optimal solutions. He also presents some examples in which these modifications perform better than the usual value iteration scheme.

0 references

reviewed by

Shmuel Gal

0 references

zbMATH Keywords

value iteration

0 references

finite state, finite actions Markovian decision process

0 references

total expected discounted reward criterion

0 references

author