Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Approximating Markov decision processes using expected state transitions - MaRDI portal

Deprecated: Use of MediaWiki\Skin\SkinTemplate::injectLegacyMenusIntoPersonalTools was deprecated in Please make sure Skin option menus contains `user-menu` (and possibly `notifications`, `user-interface-preferences`, `user-page`) 1.46. [Called from MediaWiki\Skin\SkinTemplate::getPortletsTemplateData in /var/www/html/w/includes/Skin/SkinTemplate.php at line 691] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of MediaWiki\Skin\BaseTemplate::getPersonalTools was deprecated in 1.46 Call $this->getSkin()->getPersonalToolsForMakeListItem instead (T422975). [Called from Skins\Chameleon\Components\NavbarHorizontal\PersonalTools::getHtml in /var/www/html/w/skins/chameleon/src/Components/NavbarHorizontal/PersonalTools.php at line 66] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of QuickTemplate::(get/html/text/haveData) with parameter `personal_urls` was deprecated in MediaWiki Use content_navigation instead. [Called from MediaWiki\Skin\QuickTemplate::get in /var/www/html/w/includes/Skin/QuickTemplate.php at line 131] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Approximating Markov decision processes using expected state transitions (Q1075954)

From MaRDI portal





scientific article; zbMATH DE number 3952525
Language Label Description Also known as
English
Approximating Markov decision processes using expected state transitions
scientific article; zbMATH DE number 3952525

    Statements

    Approximating Markov decision processes using expected state transitions (English)
    0 references
    1985
    0 references
    For discounted Markov decision processes approximative solutions are obtained when in the optimality equation the expectation is replaced by the value at the expected new state. The approximate value and the true value for the approximative policy are compared to the unknown optimal value by establishing various types of bounds using e.g. Lipschitz conditions, Taylor expansions, convexity, monotonicity, and normal approximation. These bounds are demonstrated by a random walk model and by an inventory problem.
    0 references
    expected state approximation
    0 references
    discounted Markov decision processes approximative solutions
    0 references
    bounds
    0 references
    random walk model
    0 references
    0 references

    Identifiers