Policy Gradient for Continuing Tasks in Discounted Markov Decision Processes
From MaRDI portal
Publication:6075992
DOI10.1109/TAC.2022.3163085OpenAlexW4226204016MaRDI QIDQ6075992
Juan Andrés Bazerque, Alejandro Ribeiro, Santiago Paternain
Publication date: 21 September 2023
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/tac.2022.3163085
This page was built for publication: Policy Gradient for Continuing Tasks in Discounted Markov Decision Processes