Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Optimal Scheduling of Entropy Regularizer for Continuous-Time Linear-Quadratic Reinforcement Learning - MaRDI portal

Optimal Scheduling of Entropy Regularizer for Continuous-Time Linear-Quadratic Reinforcement Learning (Q6180253)

From MaRDI portal
scientific article; zbMATH DE number 7791452
Language Label Description Also known as
English
Optimal Scheduling of Entropy Regularizer for Continuous-Time Linear-Quadratic Reinforcement Learning
scientific article; zbMATH DE number 7791452

    Statements

    Optimal Scheduling of Entropy Regularizer for Continuous-Time Linear-Quadratic Reinforcement Learning (English)
    0 references
    0 references
    0 references
    0 references
    19 January 2024
    0 references
    continuous-time reinforcement learning
    0 references
    linear-quadratic
    0 references
    entropy regularization
    0 references
    exploratory control
    0 references
    proximal policy update
    0 references
    regret analysis
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references