scientific article
From MaRDI portal
Publication:3096166
zbMath1225.68169MaRDI QIDQ3096166
Balázs Csanád Csáji, László Monostori
Publication date: 8 November 2011
Full work available at URL: http://www.jmlr.org/papers/v9/csaji08a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Markov decision processesreinforcement learningchanging environmentsvalue function bounds\((\epsilon\delta )\)-MDPsstochastic iterative algorithms
Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40)
Related Items (2)
This page was built for publication: