Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Adaptive control of discounted Markov decision chains - MaRDI portal

Adaptive control of discounted Markov decision chains (Q796461)

From MaRDI portal





scientific article; zbMATH DE number 3865009
Language Label Description Also known as
English
Adaptive control of discounted Markov decision chains
scientific article; zbMATH DE number 3865009

    Statements

    Adaptive control of discounted Markov decision chains (English)
    0 references
    1985
    0 references
    We consider discounted-reward finite-state Markov decision processes which depend on unknown parameters. An adaptive policy inspired by the nonstationary value iteration scheme of \textit{A. Federgruen} and \textit{P. J. Schweitzer} [ibid. 34, 207-241 (1981; Zbl 0426.90091)] is proposed. This policy is briefly compared with the principle of estimation and control recently obtained by \textit{M. Schäl} [Lect. Notes Pure Appl. Math. 86, 239-253 (1983; Zbl 0525.93071)].
    0 references
    discounted-reward finite-state Markov decision processes
    0 references
    adaptive policy
    0 references
    nonstationary value iteration
    0 references

    Identifiers