scientific article
From MaRDI portal
Publication:2896181
zbMath1242.68254MaRDI QIDQ2896181
Evangelos A. Theodorou, Jonas Buchli, Stefan Schaal
Publication date: 13 July 2012
Full work available at URL: http://www.jmlr.org/papers/v11/theodorou10a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05) Optimal stochastic control (93E20)
Related Items (26)
Adaptive importance sampling for control and inference ⋮ Probabilistic inference for determining options in reinforcement learning ⋮ Nonlinear stochastic receding horizon control: stability, robustness and Monte Carlo methods for control approximation ⋮ Active inference and agency: optimal control without cost functions ⋮ Path-integral-based reinforcement learning algorithm for goal-directed locomotion of snake-shaped robot ⋮ PI-ELM: reinforcement learning-based adaptable policy improvement for dynamical system ⋮ Incremental nonlinear stability analysis of stochastic systems perturbed by Lévy noise ⋮ A novel online gait optimization approach for biped robots with point-feet ⋮ Unnamed Item ⋮ Predictive control of linear discrete-time Markovian jump systems by learning recurrent patterns ⋮ Action selection in growing state spaces: control of network structure growth ⋮ Preference-based reinforcement learning: a formal framework and a policy iteration algorithm ⋮ Applications of variable discounting dynamic programming to iterated function systems and related problems ⋮ Optimization of market stochastic dynamics ⋮ A survey of inverse reinforcement learning: challenges, methods and progress ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Stochastic differential games: a sampling approach via FBSDEs ⋮ Kernel dynamic policy programming: applicable reinforcement learning to robot systems with high dimensional states ⋮ Phase portraits as movement primitives for fast humanoid robot control ⋮ Numerical Trajectory Optimization for Stochastic Mechanical Systems ⋮ Whence the Expected Free Energy? ⋮ Unnamed Item ⋮ Closing the gap: combining task specification and reinforcement learning for compliant vegetable cutting ⋮ A multilevel approach for stochastic nonlinear optimal control ⋮ Iterative Path Integral Approach to Nonlinear Stochastic Optimal Control Under Compound Poisson Noise
This page was built for publication: