Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Distributed Policy Evaluation Under Multiple Behavior Strategies - MaRDI portal

Distributed Policy Evaluation Under Multiple Behavior Strategies

From MaRDI portal

Publication:2982737

Jump to:navigation, search

DOI10.1109/TAC.2014.2368731zbMath1360.68714arXiv1312.7606OpenAlexW2144672231MaRDI QIDQ2982737

Ali H. Sayed, Jianshu Chen, Sergio Valcarcel Macua, S. Zazo

Publication date: 16 May 2017

Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1312.7606

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40) Distributed algorithms (68W15)

Related Items (6)

Scalable Reinforcement Learning for Multiagent Networked Systems ⋮ Linear convergence of primal-dual gradient methods and their performance in distributed optimization ⋮ Distributed consensus-based multi-agent temporal-difference learning ⋮ Finite-Time Performance of Distributed Temporal-Difference Learning with Linear Function Approximation ⋮ A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation ⋮ Multi-agent reinforcement learning: a selective overview of theories and algorithms

This page was built for publication: Distributed Policy Evaluation Under Multiple Behavior Strategies

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2982737&oldid=15990552"