Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

scientific article; zbMATH DE number 6982305

From MaRDI portal
Publication:4558153
Jump to:navigation, search

zbMath1437.68147arXiv1606.09197MaRDI QIDQ4558153

Hany Abdulsamad, Gerhard Neumann, Jan Peters, Abbas Abdolmaleki, Riad Akrour

Publication date: 21 November 2018

Full work available at URL: https://arxiv.org/abs/1606.09197

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

reinforcement learningroboticstrajectory optimizationpolicy optimization


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40) Statistical aspects of information-theoretic topics (62B10)


Related Items (6)

Unnamed Item ⋮ Unnamed Item ⋮ Experiments with Tractable Feedback in Robotic Planning Under Uncertainty: Insights over a Wide Range of Noise Regimes ⋮ Compatible natural gradient policy search ⋮ TD-regularized actor-critic methods ⋮ Unnamed Item


Uses Software

  • GitHub
  • Baselines
  • PILCO


Cites Work

  • Policy gradient in Lipschitz Markov decision processes
  • Model-based contextual policy search for data-efficient generalization of robot skills
  • Algorithms for Reinforcement Learning
  • Unnamed Item
  • Unnamed Item
  • Unnamed Item
  • Unnamed Item


This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4558153&oldid=18688343"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 7 February 2024, at 12:12.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki