Hessian matrix distribution for Bayesian policy gradient reinforcement learning
From MaRDI portal
Publication:545311
DOI10.1016/j.ins.2011.01.001zbMath1216.68224OpenAlexW2046727971MaRDI QIDQ545311
TaeChoong Chung, Ngo Anh Vien, Hwanjo Yu
Publication date: 22 June 2011
Published in: Information Sciences (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.ins.2011.01.001
Markov decision processreinforcement learningBayesian policy gradientHessian matrix distributionMonte-Carlo policy gradientpolicy gradient
Related Items (1)
Cites Work
- A generic architecture for adaptive agents based on reinforcement learning
- Natural actor-critic algorithms
- Bayes-Hermite quadrature
- Fuzzy control rules extraction from perception-based information using computing with words
- Adaptive evolutionary programming based on reinforcement learning
- Adaptive stock trading with dynamic asset allocation using reinforcement learning
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
This page was built for publication: Hessian matrix distribution for Bayesian policy gradient reinforcement learning