scientific article
From MaRDI portal
Publication:3148833
zbMath0992.68090MaRDI QIDQ3148833
Leonid Peshkin, Sayan Mukherjee
Publication date: 22 September 2002
Full work available at URL: http://link.springer.de/link/service/series/0558/bibs/2111/21110616
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (3)
Simulation-based optimization of Markov decision processes: an empirical process theory approach ⋮ Autonomous reinforcement learning with experience replay ⋮ Real-time reinforcement learning by sequential actor-critics and experience replay
This page was built for publication: