Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scientific article - MaRDI portal

scientific article

From MaRDI portal

Publication:3312038

Jump to:navigation, search

zbMath0529.90092MaRDI QIDQ3312038

Manfred Schäl

Publication date: 1984

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

average reward criterion discounted reward criterion compact state and action spaces asymptotically optimal policy consistent asymptotic estimator incompletely known law of motion sequential Markov decision models

Mathematics Subject Classification ID

Inference from stochastic processes (62M99) Statistical decision theory (62C99) Markov and semi-Markov decision processes (90C40)

Related Items (2)

Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes ⋮ Adaptive discounted control for piecewise deterministic Markov processes

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3312038&oldid=16539777"