${{\cal Q} {\cal D}}$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through ${\rm Consensus} + {\rm Innovations}$
DOI10.1109/TSP.2013.2241057zbMath1393.94293arXiv1205.0047OpenAlexW1918371733MaRDI QIDQ4578509
José M. F. Moura, Soummya Kar, H. Vincent Poor
Publication date: 22 August 2018
Published in: IEEE Transactions on Signal Processing (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1205.0047
Applications of stochastic analysis (to PDEs, etc.) (60H30) Signal theory (characterization, reconstruction, filtering, etc.) (94A12) Stochastic approximation (62L20) Markov and semi-Markov decision processes (90C40)
Related Items (8)
This page was built for publication: ${{\cal Q} {\cal D}}$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through ${\rm Consensus} + {\rm Innovations}$