scientific article
From MaRDI portal
Publication:3624133
zbMath1182.68261arXiv1111.0062MaRDI QIDQ3624133
Matthijs T. J. Spaan, Frans A. Oliehoek, Nikos Vlassis
Publication date: 28 April 2009
Full work available at URL: https://arxiv.org/abs/1111.0062
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (8)
Controlling a Fleet of Unmanned Aerial Vehicles to Collect Uncertain Information in a Threat Environment ⋮ Improving coordination in small-scale multi-agent deep reinforcement learning through memory-driven communication ⋮ A unified framework for stochastic optimization ⋮ Online planning for multi-agent systems with bounded communication ⋮ Multi-agent reinforcement learning algorithm to solve a partially-observable multi-agent problem in disaster response ⋮ Unnamed Item ⋮ A Sufficient Statistic for Influence in Structured Multiagent Environments ⋮ A leader-follower partially observed, multiobjective Markov game
Uses Software
This page was built for publication: