Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
On Dynamic Programming with Unbounded Rewards - MaRDI portal

On Dynamic Programming with Unbounded Rewards

From MaRDI portal
Publication:4068423

DOI10.1287/mnsc.21.11.1225zbMath0309.90017OpenAlexW2109174772MaRDI QIDQ4068423

Steven A. Lippman

Publication date: 1975

Published in: Management Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/mnsc.21.11.1225




Related Items

Time and Ratio Expected Average Cost Optimality for Semi-Markov Control Processes on Borel SpacesA Contraction Theorem in Inventory ProblemsFinite-state approximations for denumerable state discounted Markov decision processesStrong bounds on perturbationsMyopia and \(R\& D\)/production complementaritiesOptimal QoS control of interacting service stationsZero-sum continuous-time Markov games with unbounded transition and discounted payoff ratesControl of economic systems under the process of data improvementControlled semi-Markov models - the discounted caseRobustness inequality for Markov control processes with unbounded costsEstimation and control in discounted stochastic dynamic programmingThe optimal frequency of information purchasesIDENTIFICATION OF DISCRETE CHOICE DYNAMIC PROGRAMMING MODELS WITH NONPARAMETRIC DISTRIBUTION OF UNOBSERVABLESArbitrary state semi-Markov decision processesOn theory and algorithms for Markov decision problems with the total reward criterionContinuous time shock markov decision processes with discounted criterionA Verification Theorem for Threshold-Indexability of Real-State Discounted Restless BanditsAction-dependent stopping times and Markov decision process with unbounded rewardsTime-average and asymptotically optimal flow control policies in networks with multiple transmittersSemi-Markov decision processes with a reachable state-subsetDiscrete type shock semi-markov decision processes with borel state spaceAverage cost optimal policies for Markov control processes with Borel state space and unbounded costsExistence and uniqueness theorems for the optimal inventory equation: The back-logging caseMarkov programming by successive approximations with respect to weighted supremum normsMarkov decision processes and strongly excessive functionsCondition-based maintenance policies under imperfect maintenance at scheduled and unscheduled opportunitiesRecursive utility and the Ramsey problemTwo person zero-sum semi-Markov games with unknown holding times distribution on one side: A discounted payoff criterionMixed Markov decision processes in a semi-Markov environment with discounted criterionControlled semi-Markov models under long-run average rewardsStochastic Inventory Models with Limited Production Capacity and Periodically Varying ParametersUnnamed ItemAverage Cost Semi-Markov Decision Processes and the Control of Queueing SystemsOptimal adaptive control of priority assignment in queueing systemsFinite state approximations for denumerable state infinite horizon discounted Markov decision processes with unbounded rewardsTechnological expectations and adoption of improved technologyUnnamed ItemNonstationary value-iteration and adaptive control of discounted semi- Markov processes