Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems - MaRDI portal

The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems

From MaRDI portal
Publication:4187617

DOI10.1287/moor.2.4.360zbMath0402.90098OpenAlexW2128857696MaRDI QIDQ4187617

Awi Federgruen, Paul J. Schweitzer

Publication date: 1977

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://semanticscholar.org/paper/e8f7193b21b6805928c07cbebc8fd09eb6bfff42




Related Items (18)

Dual bounds on the equilibrium distribution of a finite Markov chainA methodology for computation reduction for specially structured large scale Markov decision problemsContraction mappings underlying undiscounted Markov decision problems. IIComputing transience bounds of emergency call centers: a hierarchical timed Petri net approachA Brouwer fixed-point mapping approach to communicating Markov decision processesConnectedness conditions used in finite state Markov decision processesNonstationary Markov decision problems with converging parametersThe method of value oriented successive approximations for the average reward Markov decision processUnnamed ItemSeparable value functions for infinite horizon average reward Markov decision processesA value iteration method for undiscounted multichain Markov decision processesMarkov decision processesIllustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPsContraction mappings underlying undiscounted Markov decision problemsValue iteration in countable state average cost Markov decision processes with unbounded costsSpectral theorem for convex monotone homogeneous maps, and ergodic controlPiecewise Affine Dynamical Models of Petri Nets – Application to Emergency Call Centers*Optimal pricing for a \(\mathrm{GI}/\mathrm{M}/k/N\) queue with several customer types and holding costs






This page was built for publication: The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems