Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Markov decision processes with state-dependent discount factors and unbounded rewards/costs - MaRDI portal

Markov decision processes with state-dependent discount factors and unbounded rewards/costs

From MaRDI portal

Publication:408405

Jump to:navigation, search

DOI10.1016/j.orl.2011.06.014zbMath1235.90178OpenAlexW2055647224MaRDI QIDQ408405

Xianping Guo, Qingda Wei

Publication date: 5 April 2012

Published in: Operations Research Letters (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.orl.2011.06.014

zbMATH Keywords

Markov decision processes optimal value function optimal stationary policy state-dependent discount factors unbounded costs/rewards

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Related Items (16)

Constrained Markov control processes with randomized discounted cost criteria: infinite linear programming approach ⋮ First passage problems for nonstationary discrete-time stochastic control systems ⋮ First Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount Factors ⋮ Certified reinforcement learning with logic guidance ⋮ Some advances on constrained Markov decision processes in Borel spaces with random state-dependent discount factors ⋮ Delay-Minimizing Capacity Allocation in an Infinite Server-Queueing System ⋮ An average-value-at-risk criterion for Markov decision processes with unbounded costs ⋮ Discrete-time control with non-constant discount factor ⋮ Zero-sum Markov games with random state-actions-dependent discount factors: existence of optimal strategies ⋮ Finite approximation of the first passage models for discrete-time Markov decision processes with varying discount factors ⋮ Deep reinforcement learning with temporal logics ⋮ Convergence of Markov decision processes with constraints and state-action dependent discount factors ⋮ A mean field absorbing control model for interacting objects systems ⋮ Semi-Markov decision processes with variance minimization criterion ⋮ Zero-sum semi-Markov games with state-action-dependent discount factors ⋮ First passage Markov decision processes with constraints and varying discount factors

Cites Work

This page was built for publication: Markov decision processes with state-dependent discount factors and unbounded rewards/costs

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:408405&oldid=12280044"