New Versions of Gradient Temporal-Difference Learning (Q6093230)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: New Versions of Gradient Temporal-Difference Learning |
scientific article; zbMATH DE number 7746652
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | New Versions of Gradient Temporal-Difference Learning |
scientific article; zbMATH DE number 7746652 |
Statements
New Versions of Gradient Temporal-Difference Learning (English)
0 references
6 October 2023
0 references
convergence
0 references
optimization
0 references
reinforcement learning (RL)
0 references
saddle-point problem
0 references
stability
0 references
temporal-difference (TD) learning
0 references
0.88171244
0 references
0.88087416
0 references
0 references
0.8525688
0 references