scientific article; zbMATH DE number 6982305
From MaRDI portal
Publication:4558153
zbMath1437.68147arXiv1606.09197MaRDI QIDQ4558153
Hany Abdulsamad, Gerhard Neumann, Jan Peters, Abbas Abdolmaleki, Riad Akrour
Publication date: 21 November 2018
Full work available at URL: https://arxiv.org/abs/1606.09197
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40) Statistical aspects of information-theoretic topics (62B10)
Related Items (6)
Unnamed Item ⋮ Unnamed Item ⋮ Experiments with Tractable Feedback in Robotic Planning Under Uncertainty: Insights over a Wide Range of Noise Regimes ⋮ Compatible natural gradient policy search ⋮ TD-regularized actor-critic methods ⋮ Unnamed Item
Uses Software
Cites Work
This page was built for publication: