DeepStack: Expert-level artificial intelligence in heads-up no-limit poker
From MaRDI portal
Publication:4645965
DOI10.1126/science.aam6960zbMath1403.68202arXiv1701.01724OpenAlexW2574978968WikidataQ47952679 ScholiaQ47952679MaRDI QIDQ4645965
Michael Bowling, Viliam Lisý, Kevin Waugh, Martin J. Schmid, Matej Moravčík, Dustin Morrill, Michael Johanson, Nolan Bard, Neil Burch, Trevor Davis
Publication date: 11 January 2019
Published in: Science (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1701.01724
Learning and adaptive systems in artificial intelligence (68T05) Probabilistic games; gambling (91A60)
Related Items
Successful Nash equilibrium agent for a three-player imperfect-information game, Computing human-understandable strategies: deducing fundamental rules of poker strategy, Distinguishing luck from skill through statistical simulation: a case study, Rethinking formal models of partially observable multiagent decision making, Counterfactuals as modal conditionals, and their probability, Value functions for depth-limited solving in zero-sum imperfect-information games, Solving zero-sum one-sided partially observable stochastic games, A multivariate Riesz basis of ReLU neural networks, DCENet: a dynamic correlation evolve network for short-term traffic prediction, Approximating maxmin strategies in imperfect recall games using A-loss recall property, Committing to correlated strategies with multiple leaders, Evaluating Strategic Structures in Multi-Agent Inverse Reinforcement Learning, Limited lookahead in imperfect-information games, Identifying behaviorally robust strategies for normal form games under varying forms of uncertainty, Faster algorithms for extensive-form game solving via improved smoothing functions, The Hanabi challenge: a new frontier for AI research, Analysis of Hannan consistent selection for Monte Carlo tree search in simultaneous move games, Automated construction of bounded-loss imperfect-recall abstractions in extensive-form games, Deep reinforcement learning with emergent communication for coalitional negotiation games, Generosity, selfishness and exploitation as optimal greedy strategies for resource sharing, Multi-agent reinforcement learning: a selective overview of theories and algorithms, World-class interpretable poker, Mathematical consistency and long-term behaviour of a dynamical system with a self-organising vector field, CECMLP: new cipher-based evaluating collaborative multi-layer perceptron scheme in federated learning, Robust and resource-efficient identification of two hidden layer neural networks, Computing Large Market Equilibria Using Abstractions
Uses Software