Pages that link to "Item:Q1937498"
From MaRDI portal
The following pages link to Approximate stochastic annealing for online control of infinite horizon Markov decision processes (Q1937498):
Displaying 3 items.
- An incremental off-policy search in a model-free Markov decision process using a single sample path (Q1621868) (← links)
- Model-Based Annealing Random Search with Stochastic Averaging (Q5270721) (← links)
- A Q-learning algorithm for Markov decision processes with continuous state spaces (Q6569411) (← links)