scientific article; zbMATH DE number 7306889
From MaRDI portal
Publication:5148991
Michael L. Littman, Lucas Lehnert
Publication date: 5 February 2021
Full work available at URL: https://arxiv.org/abs/1901.11437
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (1)
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Equivalence notions and model minimization in Markov decision processes
- \({\mathcal Q}\)-learning
- Bisimulation Metrics for Continuous Markov Decision Processes
- A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
- Learning Theory and Kernel Machines
This page was built for publication: