Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL - MaRDI portal

A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL

From MaRDI portal

Publication:2711577

Jump to:navigation, search

DOI10.1017/S0269964800142081zbMath1029.93065OpenAlexW1965413238MaRDI QIDQ2711577

Vivek S. Borkar

Publication date: 2 February 2004

Published in: Probability in the Engineering and Informational Sciences (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1017/s0269964800142081

zbMATH Keywords

nonlinear control learning almost sure convergence compact state space discrete-time stochastic control compact action space \(Q\)-learning algorithm simulation based algorithm

Mathematics Subject Classification ID

Nonlinear systems in control theory (93C10) Discrete-time control/observation systems (93C55) Stochastic learning and adaptive control (93E35)

Related Items (1)

Stochastic approximation algorithms: overview and recent trends.

This page was built for publication: A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2711577&oldid=15561663"