Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function - MaRDI portal

Deprecated: Use of MediaWiki\Skin\SkinTemplate::injectLegacyMenusIntoPersonalTools was deprecated in Please make sure Skin option menus contains `user-menu` (and possibly `notifications`, `user-interface-preferences`, `user-page`) 1.46. [Called from MediaWiki\Skin\SkinTemplate::getPortletsTemplateData in /var/www/html/w/includes/Skin/SkinTemplate.php at line 691] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of QuickTemplate::(get/html/text/haveData) with parameter `personal_urls` was deprecated in MediaWiki Use content_navigation instead. [Called from MediaWiki\Skin\QuickTemplate::get in /var/www/html/w/includes/Skin/QuickTemplate.php at line 131] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function (Q1886590)

From MaRDI portal





scientific article; zbMATH DE number 2116578
Language Label Description Also known as
English
Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function
scientific article; zbMATH DE number 2116578

    Statements

    Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function (English)
    0 references
    0 references
    0 references
    18 November 2004
    0 references
    Internal prediction
    0 references
    Reliability
    0 references
    Model-free reinforcement learning
    0 references
    TD learning
    0 references
    Discount rate
    0 references
    Exploration-exploitation balance
    0 references
    Temperature parameter
    0 references
    Meta-learning
    0 references

    Identifiers