Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Value iteration and rolling plans for Markov control processes with unbounded rewards - MaRDI portal

Value iteration and rolling plans for Markov control processes with unbounded rewards (Q1260895)

From MaRDI portal





scientific article; zbMATH DE number 399094
Language Label Description Also known as
English
Value iteration and rolling plans for Markov control processes with unbounded rewards
scientific article; zbMATH DE number 399094

    Statements

    Value iteration and rolling plans for Markov control processes with unbounded rewards (English)
    0 references
    5 September 1993
    0 references
    The purpose is to extend known results for discounted Markov decision processes to the convergence of the value-iteration and the existence of error bounds for rolling horizon procedures to the case of a general state space and unbounded rewards. Now the error bounds are pointwise [w.r.t. the initial states] in contrast to the known uniform bounds. Uniformness is then obtained by using weighted norms. Further, under a strong ergodicity condition the bounds can be improved. The condition assumes a positive measure as lower bound for the distributions of states.
    0 references
    discounted Markov decision processes
    0 references
    convergence of the value-iteration
    0 references
    strong ergodicity condition
    0 references

    Identifiers