Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
From MaRDI portal
Publication:5092299
DOI10.1109/TAC.2021.3108121OpenAlexW3198564127WikidataQ114147847 ScholiaQ114147847MaRDI QIDQ5092299
Abhay Karandikar, Prasanna Chaporkar, Vivek S. Borkar, Arghyadip Roy
Publication date: 28 July 2022
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1912.10325
Related Items (1)
This page was built for publication: Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes