Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Merge two items
In other projects
MaRDI portal item
Discussion
View source
View history
Purge
English
Log in

scientific article; zbMATH DE number 7370594

From MaRDI portal
Publication:4998982
Jump to:navigation, search

MaRDI QIDQ4998982

Toshiki Kataoka, Yasuhiro Fujita, Prabhat Nagarajan, Takahiro Ishikawa

Publication date: 9 July 2021

Full work available at URL: https://arxiv.org/abs/1912.03905

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

reinforcement learningreproducibilityopen source softwaredeep reinforcement learningChainer


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)


Related Items (2)

Unnamed Item ⋮ ChainerRL


Uses Software

  • GitHub
  • Dopamine
  • rlpyt
  • RLlib
  • Catalyst.RL
  • AlphaZero
  • Stable Baselines
  • Baselines
  • QT-Opt



Cites Work

  • QT-Opt
  • Simple statistical gradient-following algorithms for connectionist reinforcement learning
  • Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
  • A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
  • Unnamed Item




This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4998982&oldid=19452525"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
This page was last edited on 8 February 2024, at 09:59.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki