Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Nonstationary Markov decision problems with converging parameters - MaRDI portal

Nonstationary Markov decision problems with converging parameters

From MaRDI portal

Publication:1136706

Jump to:navigation, search

DOI10.1007/BF00935474zbMath0426.90091MaRDI QIDQ1136706

Awi Federgruen, Paul J. Schweitzer

Publication date: 1981

Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)

zbMATH Keywords

convergence rates asymptotic behavior optimal policies approximations for parameters discounted version modified value-iteration method non-stationary Markov decision problems undiscounted version

Mathematics Subject Classification ID

Minimax problems in mathematical programming (90C47) Rate of convergence, degree of approximation (41A25)

Related Items

Monotone value iteration for discounted finite Markov decision processes ⋮ Finite-state approximations for denumerable multidimensional state discounted Markov decision processes ⋮ A unified approach to adaptive control of average reward Markov decision processes ⋮ Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes ⋮ Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution ⋮ Central limit theorem for the estimator of the value of an optimal stopping problem ⋮ On confidence intervals from simulation of finite Markov chains ⋮ Nonparametric adaptive control of discrete-time partially observable stochastic systems ⋮ Discretization procedures for adaptive Markov control processes ⋮ A value-iteration scheme for undiscounted multichain Markov renewal programs ⋮ Adaptive discounted control for piecewise deterministic Markov processes ⋮ The rate of convergence for backwards products of a convergent sequence of finite Markov matrices ⋮ Nonparametric estimation and adaptive control in a class of finite Markov decision chains ⋮ Computationally efficient algorithms for on-line optimization of Markov decision processes ⋮ Unnamed Item ⋮ Recursive adaptive control of Markov decision processes with the average reward criterion ⋮ Computing Optimal Policies for Markovian Decision Processes Using Simulation ⋮ Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion ⋮ Unnamed Item ⋮ Adaptive control of discounted Markov decision chains ⋮ Nonstationary value-iteration and adaptive control of discounted semi- Markov processes ⋮ Existence of optimal policy for time non-homogeneous discounted Markovian decision programming

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1136706&oldid=13187029"