Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
DeepStack: Expert-level artificial intelligence in heads-up no-limit poker - MaRDI portal

DeepStack: Expert-level artificial intelligence in heads-up no-limit poker

From MaRDI portal

Publication:4645965

Jump to:navigation, search

DOI10.1126/science.aam6960zbMath1403.68202arXiv1701.01724OpenAlexW2574978968WikidataQ47952679 ScholiaQ47952679MaRDI QIDQ4645965

Michael Bowling, Viliam Lisý, Kevin Waugh, Martin J. Schmid, Matej Moravčík, Dustin Morrill, Michael Johanson, Nolan Bard, Neil Burch, Trevor Davis

Publication date: 11 January 2019

Published in: Science (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1701.01724

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Probabilistic games; gambling (91A60)

Related Items

Successful Nash equilibrium agent for a three-player imperfect-information game, Computing human-understandable strategies: deducing fundamental rules of poker strategy, Distinguishing luck from skill through statistical simulation: a case study, Rethinking formal models of partially observable multiagent decision making, Counterfactuals as modal conditionals, and their probability, Value functions for depth-limited solving in zero-sum imperfect-information games, Solving zero-sum one-sided partially observable stochastic games, A multivariate Riesz basis of ReLU neural networks, DCENet: a dynamic correlation evolve network for short-term traffic prediction, Approximating maxmin strategies in imperfect recall games using A-loss recall property, Committing to correlated strategies with multiple leaders, Evaluating Strategic Structures in Multi-Agent Inverse Reinforcement Learning, Limited lookahead in imperfect-information games, Identifying behaviorally robust strategies for normal form games under varying forms of uncertainty, Faster algorithms for extensive-form game solving via improved smoothing functions, The Hanabi challenge: a new frontier for AI research, Analysis of Hannan consistent selection for Monte Carlo tree search in simultaneous move games, Automated construction of bounded-loss imperfect-recall abstractions in extensive-form games, Deep reinforcement learning with emergent communication for coalitional negotiation games, Generosity, selfishness and exploitation as optimal greedy strategies for resource sharing, Multi-agent reinforcement learning: a selective overview of theories and algorithms, World-class interpretable poker, Mathematical consistency and long-term behaviour of a dynamical system with a self-organising vector field, CECMLP: new cipher-based evaluating collaborative multi-layer perceptron scheme in federated learning, Robust and resource-efficient identification of two hidden layer neural networks, Computing Large Market Equilibria Using Abstractions

Uses Software

DeepStack

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4645965&oldid=29995311"