Pages that link to "Item:Q1784532"
From MaRDI portal
The following pages link to Real-time reinforcement learning by sequential actor-critics and experience replay (Q1784532):
Displaying 11 items.
- Autonomous reinforcement learning with experience replay (Q461126) (← links)
- Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems (Q463819) (← links)
- TD-regularized actor-critic methods (Q2320580) (← links)
- Reward-weighted regression with sample reuse for direct policy search in reinforcement learning (Q2887009) (← links)
- Experience selection in deep reinforcement learning for control (Q4558146) (← links)
- Artificial Intelligence and Soft Computing - ICAISC 2004 (Q4666259) (← links)
- Deep reinforcement learning via good choice resampling experience replay memory (Q4687986) (← links)
- Efficient Sample Reuse in Policy Gradients with Parameter-Based Exploration (Q5378202) (← links)
- Recursive estimation in piecewise affine systems using parameter identifiers and concurrent learning (Q5382990) (← links)
- Safe adaptive output-feedback optimal control of a class of linear systems (Q6577225) (← links)
- A Lie group PMP approach for optimal stabilization and tracking control of autonomous underwater vehicles (Q6664699) (← links)