Bellman's principle of optimality and deep reinforcement learning for time-varying tasks (Q5043501)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Bellman's principle of optimality and deep reinforcement learning for time-varying tasks |
scientific article; zbMATH DE number 7596504
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Bellman's principle of optimality and deep reinforcement learning for time-varying tasks |
scientific article; zbMATH DE number 7596504 |
Statements
Bellman's principle of optimality and deep reinforcement learning for time-varying tasks (English)
0 references
6 October 2022
0 references
Bellman's principle
0 references
finite-horizon optimal control
0 references
deep reinforcement learning
0 references
model-free control
0 references
0 references
0.88154995
0 references
0.88154995
0 references
0.86190176
0 references
0.85830086
0 references
0.85502553
0 references
0.85178155
0 references
0.84772223
0 references