Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Estimating the value of a discounted reward process - MaRDI portal

Estimating the value of a discounted reward process

From MaRDI portal

Publication:1196212

Jump to:navigation, search

DOI10.1016/0167-6377(92)90002-KzbMath0771.62043OpenAlexW2044252233MaRDI QIDQ1196212

Martin L. Puterman, Moshe Haviv

Publication date: 17 December 1992

Published in: Operations Research Letters (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/0167-6377(92)90002-k

zbMATH Keywords

differential equation simulations unbiased estimator discounted reward process expected total discounted return expected total discounted reward expected total undiscounted return independent negative binomial stopping times sampling cumulative sums of the rewards variance properties

Mathematics Subject Classification ID

Estimation in multivariate analysis (62H12) Inference from stochastic processes (62M99) Markov and semi-Markov decision processes (90C40)

Cites Work

This page was built for publication: Estimating the value of a discounted reward process

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1196212&oldid=13255087"