Learning action probabilities from delayed reinforcement
From MaRDI portal
Publication:4278272
DOI10.1080/00207729308949639zbMath0798.68143OpenAlexW2165244003MaRDI QIDQ4278272
Publication date: 31 October 1994
Published in: International Journal of Systems Science (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1080/00207729308949639
Cites Work
This page was built for publication: Learning action probabilities from delayed reinforcement