Approximation of Markov decision processes with general state space
From MaRDI portal
Publication:663675
DOI10.1016/j.jmaa.2011.11.015zbMath1232.90342OpenAlexW2017419329MaRDI QIDQ663675
Tomás Prieto-Rumeau, François Dufour
Publication date: 27 February 2012
Published in: Journal of Mathematical Analysis and Applications (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.jmaa.2011.11.015
Numerical mathematical programming methods (65K05) Continuous-time Markov processes on general state spaces (60J25) Case-oriented studies in operations research (90B90) Markov and semi-Markov decision processes (90C40)
Related Items
A convex optimization approach to dynamic programming in continuous state and action spaces, Conditions for the solvability of the linear programming formulation for constrained discounted Markov decision processes, Nonasymptotic Analysis of Monte Carlo Tree Search, From Infinite to Finite Programs: Explicit Error Bounds with Applications to Approximate Dynamic Programming, Near optimality of quantized policies in stochastic control under weak continuity conditions, Certified reinforcement learning with logic guidance, Approximation of discounted minimax Markov control problems and zero-sum Markov games using Hausdorff and Wasserstein distances, On Finite Approximations to Markov Decision Processes with Recursive and Nonlinear Discounting, Unnamed Item, Quantitative model-checking of controlled discrete-time Markov processes, Relevant states and memory in Markov chain bootstrapping and simulation, Computable approximations for average Markov decision processes in continuous time, A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs, Stochastic approximations of constrained discounted Markov decision processes, Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities, A stability result for linear Markovian stochastic optimization problems
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Lipschitz continuity of value functions in Markovian decision processes
- Simulation-based algorithms for Markov decision processes.
- Stochastic optimal control. The discrete time case
- Adaptive Markov control processes
- Convergence of Dynamic Programming Models
- Convergence of discretization procedures in dynamic programming
- Approximations of Dynamic Programs, I
- Using Randomization to Break the Curse of Dimensionality
- Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
- Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path
- Approximate Dynamic Programming