Universal Reinforcement Learning
From MaRDI portal
Publication:5281503
DOI10.1109/TIT.2010.2043762zbMath1368.68280OpenAlexW2123742287MaRDI QIDQ5281503
Ciamac Cyrus Moallemi, Benjamin van Roy, Vivek Francis Farias, Tsachy Weissman
Publication date: 27 July 2017
Published in: IEEE Transactions on Information Theory (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/tit.2010.2043762
Related Items (2)
Off-policy evaluation in partially observed Markov decision processes under sequential ignorability ⋮ Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities
This page was built for publication: Universal Reinforcement Learning