Online Markov Decision Processes With Kullback–Leibler Control Cost
From MaRDI portal
Publication:2983139
DOI10.1109/TAC.2014.2301558zbMath1360.90277arXiv1401.3198MaRDI QIDQ2983139
Rebecca Willett, Maxim Raginsky, Peng Guan
Publication date: 16 May 2017
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1401.3198
Related Items (7)
On the Origins of Imperfection and Apparent Non-rationality ⋮ Optimal design of priors constrained by external predictors ⋮ Axiomatisation of fully probabilistic design revisited ⋮ Unnamed Item ⋮ Model-based preference quantification ⋮ Ordinary Differential Equation Methods for Markov Decision Processes and Application to Kullback--Leibler Control Cost ⋮ On a probabilistic approach to synthesize control policies from example datasets
This page was built for publication: Online Markov Decision Processes With Kullback–Leibler Control Cost