Stochastic Policy Gradient Ascent in Reproducing Kernel Hilbert Spaces
From MaRDI portal
Publication:4957593
DOI10.1109/TAC.2020.3029317zbMath1471.93259arXiv1807.11274OpenAlexW3092621452MaRDI QIDQ4957593
Alejandro Ribeiro, Austin Small, Juan Andrés Bazerque, Santiago Paternain
Publication date: 9 September 2021
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1807.11274
Markov and semi-Markov decision processes (90C40) Control/observation systems in abstract spaces (93C25) Stochastic systems in control theory (general) (93E03)
This page was built for publication: Stochastic Policy Gradient Ascent in Reproducing Kernel Hilbert Spaces