Optimistic reinforcement learning by forward Kullback-Leibler divergence optimization (Q6077011)
From MaRDI portal
scientific article; zbMATH DE number 7751345
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Optimistic reinforcement learning by forward Kullback-Leibler divergence optimization |
scientific article; zbMATH DE number 7751345 |
Statements
Optimistic reinforcement learning by forward Kullback-Leibler divergence optimization (English)
0 references
17 October 2023
0 references
reinforcement learning
0 references
control as probabilistic inference
0 references
Kullback-Leibler divergence
0 references
optimistic learning
0 references