Pages that link to "Item:Q1870309"
From MaRDI portal
The following pages link to From perturbation analysis to Markov decision processes and reinforcement learning (Q1870309):
Displaying 8 items.
- Performance optimization of queueing systems with perturbation realization (Q439492) (← links)
- Stochastic control via direct comparison (Q633815) (← links)
- A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases (Q705478) (← links)
- Continuous-time Markov decision processes with \(n\)th-bias optimality criteria (Q963964) (← links)
- Basic ideas for event-based optimization of Markov systems (Q1773104) (← links)
- Policy iteration based feedback control (Q2440692) (← links)
- Reinforcement learning in finite MDPs: PAC analysis (Q2880979) (← links)
- Error bounds of optimization algorithms for semi-Markov decision processes (Q3625261) (← links)