Learning and Planning for Time-Varying MDPs Using Maximum Likelihood Estimation
From MaRDI portal
Publication:6330080
arXiv1911.12976MaRDI QIDQ6330080
Publication date: 29 November 2019
Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20) Empirical decision procedures; empirical Bayes procedures (62C12)
This page was built for publication: Learning and Planning for Time-Varying MDPs Using Maximum Likelihood Estimation