The method of value oriented successive approximations for the average reward Markov decision process (Q1144501)

scientific article; zbMATH DE number 3693049

Language	Label	Description	Also known as
English	The method of value oriented successive approximations for the average reward Markov decision process	scientific article; zbMATH DE number 3693049

Statements

instance of

scholarly article

0 references

title

The method of value oriented successive approximations for the average reward Markov decision process (English)

0 references

0 references

0 references

1980

0 references

zbMATH Keywords

value oriented successive approximations

0 references

average reward

0 references

finite state space

0 references

finite action space

0 references

almost optimal solutions

0 references

convergence

0 references

MaRDI profile type

Publication

0 references

cites work

Optimal decision procedures for finite Markov chains. Part II: Communicating systems

0 references

Q3245701

0 references

Q3251743

0 references

Technical Note—Bounds on the Gain of a Markov Decision Process

0 references

Technical Note—The Method of Successive Approximations and Markovian Decision Problems

0 references

Q3266141

0 references

Technical Note—Undiscounted Markov Renewal Programming Via Modified Successive Approximations

0 references

Discounting, Ergodicity and Convergence for Markov Decision Processes

0 references

A set of successive approximation methods for discounted Markovian decision problems

0 references

Q4190426

0 references

On Finding the Maximal Gain for Markov Decision Processes

0 references

Technical Note—Improved Conditions for Convergence in Undiscounted Markov Renewal Programming

0 references

Some Bounds for Discounted Sequential Decision Processes

0 references

Iterative solution of the functional equations of undiscounted Markov renewal programming

0 references

The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems

0 references

Geometric convergence of value-iteration in multichain Markov decision problems

0 references

A successive approximation algorithm for an undiscounted Markov decision process

0 references

Dynamic programming, Markov chains, and the method of successive approximations

0 references

full work available at URL

https://doi.org/10.1007/bf01719500

0 references

Identifiers

zbMATH Open document ID

0443.90109

0 references

DOI

10.1007/BF01719500

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1144501