Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Action-dependent stopping times and Markov decision process with unbounded rewards - MaRDI portal

Action-dependent stopping times and Markov decision process with unbounded rewards (Q1158111)

From MaRDI portal

Jump to:navigation, search

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use this page instead for the normal view: Action-dependent stopping times and Markov decision process with unbounded rewards

scientific article; zbMATH DE number 3739321

Language	Label	Description	Also known as
English	Action-dependent stopping times and Markov decision process with unbounded rewards	scientific article; zbMATH DE number 3739321

Statements

scholarly article

0 references

Action-dependent stopping times and Markov decision process with unbounded rewards (English)

0 references

0 references

publication date

1981

0 references

zbMATH Keywords

successive-approximation method

0 references

semi Markov decision processes

0 references

unbounded rewards

0 references

actions-dependent stopping time

0 references

algorithm

0 references

equal-row- sum property

0 references

lower bounds

0 references

elimination of non-optimal actions

0 references

upper bounds

0 references

J. A. E. E. Van Nunen

0 references

Shaler jun. Stidham

0 references

MaRDI profile type

0 references

Note—A Test for Nonoptimal Actions in Undiscounted Finite Markov Decision Chains

0 references

Markov decision processes and strongly excessive functions

0 references

0 references

Applying a New Device in the Optimization of Exponential Queuing Systems

0 references

On Dynamic Programming with Unbounded Rewards

0 references

Discounting, Ergodicity and Convergence for Markov Decision Processes

0 references

0 references

A set of successive approximation methods for discounted Markovian decision problems

0 references

0 references

Note—A Note on Dynamic Programming with Unbounded Rewards

0 references

Successive approximations for Markov decision processes and Markov games with unbounded rewards

0 references

On theory and algorithms for Markov decision problems with the total reward criterion

0 references

Bounds and Transformations for Discounted Finite Markov Decision Chains

0 references

0 references

Iterative solution of the functional equations of undiscounted Markov renewal programming

0 references

0 references

Technical Note—An Equivalence Between Continuous and Discrete Time Markov Decision Processes

0 references

Markov programming by successive approximations with respect to weighted supremum norms

0 references

full work available at URL

https://doi.org/10.1007/bf01783952

0 references

Identifiers

zbMATH Open document ID

0 references

10.1007/BF01783952

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1158111

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1158111&oldid=42919960"