Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scientific article - MaRDI portal

scientific article

From MaRDI portal

Publication:2896181

Jump to:navigation, search

zbMath1242.68254MaRDI QIDQ2896181

Evangelos A. Theodorou, Jonas Buchli, Stefan Schaal

Publication date: 13 July 2012

Full work available at URL: http://www.jmlr.org/papers/v11/theodorou10a.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

stochastic optimal control reinforcement learning parameterized policies

Mathematics Subject Classification ID

Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05) Optimal stochastic control (93E20)

Related Items (26)

Adaptive importance sampling for control and inference ⋮ Probabilistic inference for determining options in reinforcement learning ⋮ Nonlinear stochastic receding horizon control: stability, robustness and Monte Carlo methods for control approximation ⋮ Active inference and agency: optimal control without cost functions ⋮ Path-integral-based reinforcement learning algorithm for goal-directed locomotion of snake-shaped robot ⋮ PI-ELM: reinforcement learning-based adaptable policy improvement for dynamical system ⋮ Incremental nonlinear stability analysis of stochastic systems perturbed by Lévy noise ⋮ A novel online gait optimization approach for biped robots with point-feet ⋮ Unnamed Item ⋮ Predictive control of linear discrete-time Markovian jump systems by learning recurrent patterns ⋮ Action selection in growing state spaces: control of network structure growth ⋮ Preference-based reinforcement learning: a formal framework and a policy iteration algorithm ⋮ Applications of variable discounting dynamic programming to iterated function systems and related problems ⋮ Optimization of market stochastic dynamics ⋮ A survey of inverse reinforcement learning: challenges, methods and progress ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Stochastic differential games: a sampling approach via FBSDEs ⋮ Kernel dynamic policy programming: applicable reinforcement learning to robot systems with high dimensional states ⋮ Phase portraits as movement primitives for fast humanoid robot control ⋮ Numerical Trajectory Optimization for Stochastic Mechanical Systems ⋮ Whence the Expected Free Energy? ⋮ Unnamed Item ⋮ Closing the gap: combining task specification and reinforcement learning for compliant vegetable cutting ⋮ A multilevel approach for stochastic nonlinear optimal control ⋮ Iterative Path Integral Approach to Nonlinear Stochastic Optimal Control Under Compound Poisson Noise

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2896181&oldid=15854923"