Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Geometric convergence of value-iteration in multichain Markov decision problems - MaRDI portal

Geometric convergence of value-iteration in multichain Markov decision problems

From MaRDI portal
Publication:4187616

DOI10.2307/1426774zbMath0402.90097OpenAlexW2005357215MaRDI QIDQ4187616

Awi Federgruen, Paul J. Schweitzer

Publication date: 1979

Published in: Advances in Applied Probability (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.2307/1426774




Related Items

Variational characterizations in Markov decision processesAsymptotic expansions for dynamic programming recursions with general nonnegative matricesDual bounds on the equilibrium distribution of a finite Markov chainContraction mappings underlying undiscounted Markov decision problems. IIComputing transience bounds of emergency call centers: a hierarchical timed Petri net approachA Brouwer fixed-point mapping approach to communicating Markov decision processesA value-iteration scheme for undiscounted multichain Markov renewal programsNonstationary Markov decision problems with converging parametersThe method of value oriented successive approximations for the average reward Markov decision processConvergence of iterates in nonlinear Perron-Frobenius theoryA value iteration method for undiscounted multichain Markov decision processesCriteria for selecting the relaxation factor of the value iteration algorithm for undiscounted Markov and semi-Markov decision processesIllustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPsContraction mappings underlying undiscounted Markov decision problemsImproved iterative computation of the expected discounted return in Markov and semi-Markov chainsMARKOV DECISION PROCESSESPiecewise Affine Dynamical Models of Petri Nets – Application to Emergency Call Centers*Guaranteed approximation of Markov chains with applications to multiplexer engineering in ATM networks