Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning

From MaRDI portal
Publication:2887009