Pages that link to "Item:Q3593009"
From MaRDI portal
The following pages link to Simulation‐based Uniform Value Function Estimates of Markov Decision Processes (Q3593009):
Displaying 7 items.
- Simulation-based optimization of Markov decision processes: an empirical process theory approach (Q608432) (← links)
- Near optimality of quantized policies in stochastic control under weak continuity conditions (Q892326) (← links)
- A survey of some simulation-based algorithms for Markov decision processes (Q937352) (← links)
- On the convergence of reinforcement learning with Monte Carlo exploring starts (Q2665181) (← links)
- Empirical dynamic programming (Q2806811) (← links)
- Computing optimal policies for Markovian decision processes using simulation (Q2808287) (← links)
- Bias and Variance Approximation in Value Function Estimates (Q3116079) (← links)