Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Linear least-squares algorithms for temporal difference learning - MaRDI portal

Linear least-squares algorithms for temporal difference learning

From MaRDI portal

Publication:1911340

Jump to:navigation, search

DOI10.1007/BF00114723zbMath0845.68091MaRDI QIDQ1911340

Steven J. Bradtke, Andrew G. Barto

Publication date: 10 June 1996

Published in: Machine Learning (Search for Journal in Brave)

zbMATH Keywords

least-squares TD TD error variance temporal difference algorithms

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Parallel algorithms in computer science (68W10)

Related Items

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning ⋮ Temporal difference-based policy iteration for optimal control of stochastic systems ⋮ Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning ⋮ Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation ⋮ Regularized feature selection in reinforcement learning ⋮ Variance Regularization in Sequential Bayesian Optimization ⋮ Finite-Time Performance of Distributed Temporal-Difference Learning with Linear Function Approximation ⋮ A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation ⋮ Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health ⋮ Unnamed Item

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1911340&oldid=14330000"