Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

scientific article; zbMATH DE number 7415125

From MaRDI portal
Jump to:navigation, search

MaRDI QIDQ5159474

Jan Peters, Pascal Klink, Carlo D'Eramo, Boris Belousov, Hany Abdulsamad, Joni Pajarinen

Publication date: 27 October 2021

Full work available at URL: https://arxiv.org/abs/2102.13176

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

reinforcement learningcurriculum learningself-paced learningRL-as-inferencetempered inference


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)


Related Items

Unnamed Item


Uses Software

  • CMA-ES
  • SciPy
  • GitHub
  • MuJoCo
  • Stable Baselines
  • VIREL
  • VIME


Cites Work

  • Unnamed Item
  • Unnamed Item
  • Unnamed Item
  • Unnamed Item
  • Unnamed Item
  • Unnamed Item
  • Optimization by Simulated Annealing
  • Nearly unbiased variable selection under minimax concave penalty
  • A theoretical understanding of self-paced learning
  • Convex Optimization: Algorithms and Complexity
  • Using Expectation-Maximization for Reinforcement Learning
  • Introduction to Derivative-Free Optimization
  • BRINGING UP ROBOT: FUNDAMENTAL MECHANISMS FOR CREATING A SELF-MOTIVATED, SELF-ORGANIZING ARCHITECTURE
  • Probabilistic numerics and uncertainty in computations
  • Computer Vision
  • Simulating normalizing constants: From importance sampling to bridge sampling to path sampling
Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5159474&oldid=19714410"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 8 February 2024, at 16:18.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki