Off-policy RL algorithms can be sample-efficient for continuous control via sample multiple reuse
From MaRDI portal
Publication:6539379
DOI10.1016/j.ins.2024.120371MaRDI QIDQ6539379
Xiu Li, Zongqing Lu, Le Wan, Jiafei Lyu
Publication date: 14 May 2024
Published in: Information Sciences (Search for Journal in Brave)
Cites Work
This page was built for publication: Off-policy RL algorithms can be sample-efficient for continuous control via sample multiple reuse