Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

scientific article; zbMATH DE number 6982300

From MaRDI portal
Publication:4558146
Jump to:navigation, search

zbMath1437.68150MaRDI QIDQ4558146

Robert Babuška, Tim de Bruin, Jens Kober, Karl Tuyls

Publication date: 21 November 2018

Full work available at URL: http://jmlr.csail.mit.edu/papers/v19/17-131.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

controlreinforcement learningroboticsdeep learningexperience replay


Mathematics Subject Classification ID

Applications of statistics in engineering and industry; control charts (62P30) Learning and adaptive systems in artificial intelligence (68T05)


Related Items (1)

An approach to solving optimal control problems of nonlinear systems by introducing detail-reward mechanism in deep reinforcement learning


Uses Software

  • Torch
  • Adam
  • GitHub
  • Baselines
  • VIME



Cites Work

  • Unnamed Item
  • Unnamed Item
  • Unnamed Item
  • Approximate dynamic programming with a fuzzy parameterization
  • Random sampling with a reservoir
  • A theory of the learnable
  • Sequential Decision Making With Coherent Risk
  • Stochastic gradient descent, weighted sampling, and the randomized Kaczmarz algorithm




This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4558146&oldid=18688333"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 7 February 2024, at 11:12.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki