Distributed Policy Evaluation Under Multiple Behavior Strategies
From MaRDI portal
Publication:2982737
DOI10.1109/TAC.2014.2368731zbMath1360.68714arXiv1312.7606OpenAlexW2144672231MaRDI QIDQ2982737
Ali H. Sayed, Jianshu Chen, Sergio Valcarcel Macua, S. Zazo
Publication date: 16 May 2017
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1312.7606
Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40) Distributed algorithms (68W15)
Related Items (6)
Scalable Reinforcement Learning for Multiagent Networked Systems ⋮ Linear convergence of primal-dual gradient methods and their performance in distributed optimization ⋮ Distributed consensus-based multi-agent temporal-difference learning ⋮ Finite-Time Performance of Distributed Temporal-Difference Learning with Linear Function Approximation ⋮ A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation ⋮ Multi-agent reinforcement learning: a selective overview of theories and algorithms
This page was built for publication: Distributed Policy Evaluation Under Multiple Behavior Strategies