scientific article; zbMATH DE number 7370594
From MaRDI portal
Publication:4998982
Toshiki Kataoka, Yasuhiro Fujita, Prabhat Nagarajan, Takahiro Ishikawa
Publication date: 9 July 2021
Full work available at URL: https://arxiv.org/abs/1912.03905
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (2)
Uses Software
Cites Work
- QT-Opt
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
- A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
- Unnamed Item
This page was built for publication: