Distributed multi-agent temporal-difference learning with full neighbor information
From MaRDI portal
Publication:4995743
DOI10.1007/s11768-020-00016-wzbMath1474.68268OpenAlexW3101698686MaRDI QIDQ4995743
Jiangping Hu, Zhinan Peng, Rui Luo, Bijoy Kumar Ghosh
Publication date: 1 July 2021
Published in: Control Theory and Technology (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s11768-020-00016-w
Learning and adaptive systems in artificial intelligence (68T05) Distributed algorithms (68W15) Agent technology and artificial intelligence (68T42)
This page was built for publication: Distributed multi-agent temporal-difference learning with full neighbor information