scientific article
zbMath1235.68169MaRDI QIDQ2880921
Hui Li, Xuejun Liao, Lawrence Carin
Publication date: 17 April 2012
Full work available at URL: http://www.jmlr.org/papers/v10/li09b.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
reinforcement learningDirichlet processespartially observable Markov decision processesmulti-task learningregionalized policy representation
Bayesian problems; characterization of Bayes procedures (62C10) Learning and adaptive systems in artificial intelligence (68T05) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20)
Related Items (1)
This page was built for publication: