Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Approximate receding horizon approach for Markov decision processes: average reward case - MaRDI portal

Approximate receding horizon approach for Markov decision processes: average reward case (Q1414220)

From MaRDI portal

Jump to:navigation, search

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use this page instead for the normal view: Approximate receding horizon approach for Markov decision processes: average reward case

scientific article; zbMATH DE number 2006348

Language	Label	Description	Also known as
English	Approximate receding horizon approach for Markov decision processes: average reward case	scientific article; zbMATH DE number 2006348

Statements

scholarly article

0 references

Approximate receding horizon approach for Markov decision processes: average reward case (English)

0 references

Hyeong Soo Chang

0 references

Steven I. Marcus

0 references

Journal of Mathematical Analysis and Applications

0 references

publication date

20 November 2003

0 references

The authors consider an approximation scheme for solving Markov decision processes (MDPs) with countable state space, finite action space, and bounded rewards that uses an approximate solution of a fixed finite-horizon sub-MDP of a given infinite-horizon MDP to create a stationary policy, which they call ''approximate receding horizon control''. They analyze the performance of the approximate receding horizon control in some conditions, study two examples, also provide a simple proof on the policy improvement for countable state space, and discuss practical implementations of these schemes via simulation.

0 references

zbMATH Keywords

Markov decision process

0 references

receding horizon control

0 references

Infinite-horizon average reward

0 references

policy improvement

0 references

ergodicity

0 references

Mariano Ruiz Espejo

0 references

MaRDI profile type

0 references

Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey

0 references

Enlarging the region of convergence of Newton's method for constrained optimization

0 references

Rollout algorithms for stochastic scheduling problems

0 references

0 references

An on-line procedure in discounted infinite-horizon stochastic optimal control

0 references

On the structure of value functions for threshold policies in queueing models

0 references

Moving horizon control in dynamic games

0 references

0 references

Detection of minimal forecast horizons in dynamic programs with multiple indicators of the future

0 references

Finite horizon approximations of infinite horizon linear programs

0 references

Error bounds for rolling horizon policies in discrete-time Markov control processes

0 references

A forecast horizon and a stopping rule for general Markov decision processes

0 references

Adaptive Markov control processes

0 references

0 references

Optimal infinite-horizon feedback laws for a general class of constrained discrete-time systems: Stability and moving-horizon approximations

0 references

On the value function of a priority queue with an application to a controlled polling model

0 references

A probabilistic analysis of bias optimality in unichain Markov decision processes

0 references

Receding horizon control of nonlinear systems

0 references

The policy iteration algorithm for average reward Markov decision processes with general state space

0 references

Complexity of finite-horizon Markov decision process problems

0 references

Separable routing: A scheme for state-dependent routing of circuit switched telephone traffic

0 references

0 references

Optimal Infinite-Horizon Undiscounted Control of Finite Probabilistic Systems

0 references

0 references

Comparing neuro-dynamic programming algorithms for the vehicle routing problem with stochastic demands

0 references

The Determination of Approximately Optimal Policies in Markov Decision Processes by the Use of Bounds

0 references

Identifiers

zbMATH Open document ID

0 references

10.1016/S0022-247X(03)00506-7

0 references

Mathematics Subject Classification ID

0 references

0 references

zbMATH DE Number

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1414220

Retrieved from "https://mardi.schubotz.org/w/index.php?title=Item:Q1414220&oldid=43091706"