Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Bounds for the regret loss in dynamic programming under adaptive control - MaRDI portal

Bounds for the regret loss in dynamic programming under adaptive control

From MaRDI portal

Publication:3968777

Jump to:navigation, search

DOI10.1007/BF01916897zbMath0502.90085OpenAlexW2006974531MaRDI QIDQ3968777

Michael Kolonko

Publication date: 1983

Published in: Zeitschrift für Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/bf01916897

zbMATH Keywords

estimation error adaptive procedure Markovian dynamic programming bounds for the expected regret loss

Mathematics Subject Classification ID

Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)

Related Items (7)

Continuous dependence of stochastic control models on the noise distribution ⋮ On truncations and perturbations of Markov decision problems with an application to queueing network overflow control ⋮ Nonparametric adaptive control of discrete-time partially observable stochastic systems ⋮ Estimation and control in discounted stochastic dynamic programming ⋮ Generalized Lipschitz-continuity of integrals with respect to a parameter of the intergrating probability measure ⋮ First-order sensitivity of the optimal value in a Markov decision model with respect to deviations in the transition probability function ⋮ Adaptive control of Markov processes with incomplete state information and unknown parameters

Cites Work

This page was built for publication: Bounds for the regret loss in dynamic programming under adaptive control

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3968777&oldid=17680602"