Pages that link to "Item:Q5928992"
From MaRDI portal
The following pages link to On the convergence of temporal-difference learning with linear function approximation (Q5928992):
Displaying 15 items.
- Multiscale Q-learning with linear function approximation (Q312650) (← links)
- Asymptotic analysis of value prediction by well-specified and misspecified models (Q448322) (← links)
- Proximal algorithms and temporal difference methods for solving fixed point problems (Q721950) (← links)
- A policy gradient method for semi-Markov decision processes with application to call admission control (Q859693) (← links)
- A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning (Q859737) (← links)
- Natural actor-critic algorithms (Q1049136) (← links)
- Relative loss bounds for temporal-difference learning (Q1397415) (← links)
- The convergence of \(TD(\lambda)\) for general \(\lambda\) (Q1812934) (← links)
- Restricted gradient-descent algorithm for value-function approximation in reinforcement learning (Q2389624) (← links)
- Basis function adaptation in temporal difference reinforcement learning (Q2485935) (← links)
- Chaotic dynamics and convergence analysis of temporal difference algorithms with bang-bang control (Q2800471) (← links)
- Flow shop scheduling with reinforcement learning (Q2868184) (← links)
- On the Convergence of Stochastic Iterative Dynamic Programming Algorithms (Q4323346) (← links)
- Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes (Q5898263) (← links)
- Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes (Q5920615) (← links)