Learning the variance of the reward-to-go (Q2810778)

scientific article; zbMATH DE number 6589427

Language	Label	Description	Also known as
English	Learning the variance of the reward-to-go	scientific article; zbMATH DE number 6589427

Statements

instance of

0 references

0 references

0 references

0 references

6 June 2016

0 references

full work available at URL

http://jmlr.csail.mit.edu/papers/v17/14-335.html

0 references

zbMATH Keywords

reinforcement learning

0 references

Markov decision processes

0 references

variance estimation

0 references

simulation

0 references

temporal differences

0 references

MaRDI profile type

Publication

0 references

title

Learning the variance of the reward-to-go (English)

0 references

published in

Journal of Machine Learning Research (JMLR)

0 references

Identifiers

zbMATH Open document ID

1360.68713

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2810778