Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Merge two items
In other projects
MaRDI portal item
Discussion
View source
View history
Purge
English
Log in

scientific article

From MaRDI portal
Publication:2880921
Jump to:navigation, search

zbMath1235.68169MaRDI QIDQ2880921

Hui Li, Xuejun Liao, Lawrence Carin

Publication date: 17 April 2012

Full work available at URL: http://www.jmlr.org/papers/v10/li09b.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

reinforcement learningDirichlet processespartially observable Markov decision processesmulti-task learningregionalized policy representation


Mathematics Subject Classification ID

Bayesian problems; characterization of Bayes procedures (62C10) Learning and adaptive systems in artificial intelligence (68T05) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20)


Related Items (1)

Partially observable collaborative model for optimizing personalized treatment selection







This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2880921&oldid=15834260"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
This page was last edited on 3 February 2024, at 19:31.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki