Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

Accelerating reinforcement learning with a directional-Gaussian-smoothing evolution strategy

From MaRDI portal
Publication:2055215
Jump to:navigation, search

DOI10.3934/era.2021075zbMath1484.37099arXiv2002.09077OpenAlexW3200056662MaRDI QIDQ2055215

Yanyan Li

Publication date: 3 December 2021

Published in: Electronic Research Archive (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2002.09077


zbMATH Keywords

stochastic controlreinforcement learningnon-convex optimizationGauss-Hermite quadratureGaussian smoothinghigh-dimensional optimization


Mathematics Subject Classification ID

Continuous-time Markov processes on general state spaces (60J25) Learning and adaptive systems in artificial intelligence (68T05) Stochastic learning and adaptive control (93E35) Approximation methods and numerical treatment of dynamical systems (37M99)



Uses Software

  • CMA-ES
  • OpenAI Gym
  • PyTorch
  • GitHub
  • Ray
  • Baselines
  • Pybullet



Cites Work

  • Enhanced variable-fidelity surrogate-based optimization framework by Gaussian process regression and fuzzy clustering
  • Random gradient-free minimization of convex functions
  • Unnamed Item
  • Unnamed Item
  • Unnamed Item
  • Unnamed Item




This page was built for publication: Accelerating reinforcement learning with a directional-Gaussian-smoothing evolution strategy

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2055215&oldid=14536077"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 1 February 2024, at 19:56.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki