scientific article
From MaRDI portal
Publication:3174029
zbMath1222.93210MaRDI QIDQ3174029
Vivek S. Borkar, Madhukar Akarapu, Shalabh Bhatnagar
Publication date: 12 October 2011
Full work available at URL: http://www.jmlr.org/papers/v7/bhatnagar06a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Markov decision processesreinforcement learningoptimal control conditioned on a rare eventsimulation based algorithmsSPSA with deterministic perturbations
Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Stochastic systems in control theory (general) (93E03)
Related Items (1)
This page was built for publication: