Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Multiagent learning using a variable learning rate - MaRDI portal

Multiagent learning using a variable learning rate

From MaRDI portal

Publication:1605410

Jump to:navigation, search

DOI10.1016/S0004-3702(02)00121-2zbMath0995.68075OpenAlexW2120327309MaRDI QIDQ1605410

Michael Bowling, Manuela M. Veloso

Publication date: 15 July 2002

Published in: Artificial Intelligence (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/s0004-3702(02)00121-2

zbMATH Keywords

game theory reinforcement learning multiagent learning

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Related Items

Belief and truth in hypothesised behaviours, FUZZY STATE AGGREGATION AND POLICY HILL CLIMBING FOR STOCHASTIC ENVIRONMENTS, EAQR: a multiagent Q-learning algorithm for coordination of multiple agents, Autonomous agents modelling other agents: a comprehensive survey and open problems, AWESOME: a general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents, A general criterion and an algorithmic framework for learning in multi-agent systems, Introduction to the special issue on learning and computational game theory, GENERAL PROOF OF CONVERGENCE OF THE NASH-Q-LEARNING ALGORITHM, Exploration-exploitation in multi-agent learning: catastrophe theory meets game theory, Continuous learning methods in two-buyer pricing problem, Learning equilibrium in bilateral bargaining games, Model Checking for Safe Navigation Among Humans, Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning, Multi-agent machine learning in self-organizing systems, Learning efficient Nash equilibria in distributed systems, $Q$-Learning in a Stochastic Stackelberg Game between an Uninformed Leader and a Naive Follower, Decentralized reinforcement learning of robot behaviors, SOLVING CONSTRAINED OPTIMIZATION PROBLEMS USING PROBABILITY COLLECTIVES AND A PENALTY FUNCTION APPROACH, On-policy concurrent reinforcement learning, Single-leader-multiple-follower games with boundedly rational agents, A distributed algorithm to obtain repeated games equilibria with discounting, Sharing in teams of heterogeneous, collaborative learning agents, Negotiating team formation using deep reinforcement learning, When autonomous agents model other agents: an appeal for altered judgment coupled with mouths, ears, and a little more tape, Analysis of Hannan consistent selection for Monte Carlo tree search in simultaneous move games, Unnamed Item, COOPERATIVE LEARNING BY POLICY-SHARING IN MULTIPLE AGENTS, Perspectives on multiagent learning, Learning with policy prediction in continuous state-action multi-agent decision processes, An adjustment scheme for nonlinear pricing problem with two buyers, Multi-agent reinforcement learning: a selective overview of theories and algorithms, A Probability Collectives Approach for Multi-Agent Distributed and Cooperative Optimization with Tolerance for Agent Failure

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1605410&oldid=13908058"