TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation
From MaRDI portal
Publication:6579244
DOI10.1016/j.jfranklin.2024.107018zbMath1543.93194MaRDI QIDQ6579244
Unnamed Author, Xinyu Xu, Tianrun Liu
Publication date: 25 July 2024
Published in: Journal of the Franklin Institute (Search for Journal in Brave)
Artificial neural networks and deep learning (68T07) Adaptive control/observation systems (93C40) Multi-agent systems (93A16)
This page was built for publication: TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation