Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching (Q6657507)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching |
scientific article; zbMATH DE number 7962330
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching |
scientific article; zbMATH DE number 7962330 |
Statements
Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching (English)
0 references
6 January 2025
0 references
reinforcement learning in continuous time
0 references
policy gradient
0 references
control randomization
0 references
actor-critic algorithms
0 references
optimal switching
0 references
0 references
0 references
0 references
0 references
0 references