Pages that link to "Item:Q5378202"
From MaRDI portal
The following pages link to Efficient Sample Reuse in Policy Gradients with Parameter-Based Exploration (Q5378202):
Displaying 6 items.
- Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (Q889297) (← links)
- Model-based reinforcement learning with dimension reduction (Q2281680) (← links)
- Reward-weighted regression with sample reuse for direct policy search in reinforcement learning (Q2887009) (← links)
- An Incremental Fast Policy Search Using a Single Sample Path (Q5045345) (← links)
- Policy search for active fault diagnosis with partially observable state (Q6496669) (← links)
- Learning under nonstationarity: covariate shift and class-balance change (Q6607922) (← links)