Learning the variance of the reward-to-go (Q2810778)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Learning the variance of the reward-to-go |
scientific article; zbMATH DE number 6589427
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Learning the variance of the reward-to-go |
scientific article; zbMATH DE number 6589427 |
Statements
6 June 2016
0 references
reinforcement learning
0 references
Markov decision processes
0 references
variance estimation
0 references
simulation
0 references
temporal differences
0 references
Learning the variance of the reward-to-go (English)
0 references