Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Empirical Q-Value Iteration - MaRDI portal

Empirical Q-Value Iteration

From MaRDI portal

Publication:5856670

Jump to:navigation, search

DOI10.1287/stsy.2019.0062zbMath1461.68184arXiv1412.0180OpenAlexW3092476885MaRDI QIDQ5856670

Dileep Kalathil, Rahul Jain, Vivek S. Borkar

Publication date: 29 March 2021

Published in: Stochastic Systems (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1412.0180

zbMATH Keywords

simulation dynamic programming stochastic approximations empirical methods

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Dynamic programming (90C39) Stochastic approximation (62L20) Markov and semi-Markov decision processes (90C40)

Related Items (1)

Some Limit Properties of Markov Chains Induced by Recursive Stochastic Algorithms

Cites Work

This page was built for publication: Empirical Q-Value Iteration

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5856670&oldid=30712720"