Pages that link to "Item:Q2320580"
From MaRDI portal
The following pages link to TD-regularized actor-critic methods (Q2320580):
Displaying 5 items.
- td-reg (Q46402) (← links)
- Improve generated adversarial imitation learning with reward variance regularization (Q2673321) (← links)
- Hyperbolically Discounted Temporal Difference Learning (Q3568377) (← links)
- Optimistic reinforcement learning by forward Kullback-Leibler divergence optimization (Q6077011) (← links)
- On the sample complexity of actor-critic method for reinforcement learning with function approximation (Q6134324) (← links)