Pages that link to "Item:Q2051259"
From MaRDI portal
The following pages link to Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling (Q2051259):
Displaying 4 items.
- Relative loss bounds for temporal-difference learning (Q1397415) (← links)
- A concentration bound for \(\operatorname{LSPE}( \lambda )\) (Q2677709) (← links)
- Risk-Sensitive Reinforcement Learning via Policy Gradient Search (Q5102286) (← links)
- Concentration of Contractive Stochastic Approximation and Reinforcement Learning (Q5870773) (← links)