Rollout sampling approximate policy iteration (Q2036256)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Rollout sampling approximate policy iteration |
scientific article; zbMATH DE number 7364052
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Rollout sampling approximate policy iteration |
scientific article; zbMATH DE number 7364052 |
Statements
Rollout sampling approximate policy iteration (English)
0 references
28 June 2021
0 references
reinforcement learning
0 references
approximate policy iteration
0 references
rollouts
0 references
bandit problems
0 references
classification
0 references
sample complexity
0 references