scientific article; zbMATH DE number 1759703
From MaRDI portal
Publication:4536713
zbMath0989.68518MaRDI QIDQ4536713
Publication date: 6 August 2002
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items
Unnamed Item ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ Underestimation estimators to Q-learning ⋮ A penalized h-likelihood variable selection algorithm for generalized linear regression models with random effects ⋮ Reinforcement learning endowed with safe veto policies to learn the control of linked-multicomponent robotic systems