Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
On Dynamic Programming with Unbounded Rewards - MaRDI portal

On Dynamic Programming with Unbounded Rewards

From MaRDI portal

Publication:4068423

Jump to:navigation, search

DOI10.1287/mnsc.21.11.1225zbMath0309.90017OpenAlexW2109174772MaRDI QIDQ4068423

Steven A. Lippman

Publication date: 1975

Published in: Management Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/mnsc.21.11.1225

Mathematics Subject Classification ID

Queueing theory (aspects of probability theory) (60K25) Inventory, storage, reservoirs (90B05) Markov processes (60J99) Hamilton-Jacobi theories (49L99)

Related Items

Time and Ratio Expected Average Cost Optimality for Semi-Markov Control Processes on Borel Spaces ⋮ A Contraction Theorem in Inventory Problems ⋮ Finite-state approximations for denumerable state discounted Markov decision processes ⋮ Strong bounds on perturbations ⋮ Myopia and \(R\& D\)/production complementarities ⋮ Optimal QoS control of interacting service stations ⋮ Zero-sum continuous-time Markov games with unbounded transition and discounted payoff rates ⋮ Control of economic systems under the process of data improvement ⋮ Controlled semi-Markov models - the discounted case ⋮ Robustness inequality for Markov control processes with unbounded costs ⋮ Estimation and control in discounted stochastic dynamic programming ⋮ The optimal frequency of information purchases ⋮ IDENTIFICATION OF DISCRETE CHOICE DYNAMIC PROGRAMMING MODELS WITH NONPARAMETRIC DISTRIBUTION OF UNOBSERVABLES ⋮ Arbitrary state semi-Markov decision processes ⋮ On theory and algorithms for Markov decision problems with the total reward criterion ⋮ Continuous time shock markov decision processes with discounted criterion ⋮ A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits ⋮ Action-dependent stopping times and Markov decision process with unbounded rewards ⋮ Time-average and asymptotically optimal flow control policies in networks with multiple transmitters ⋮ Semi-Markov decision processes with a reachable state-subset ⋮ Discrete type shock semi-markov decision processes with borel state space ⋮ Average cost optimal policies for Markov control processes with Borel state space and unbounded costs ⋮ Existence and uniqueness theorems for the optimal inventory equation: The back-logging case ⋮ Markov programming by successive approximations with respect to weighted supremum norms ⋮ Markov decision processes and strongly excessive functions ⋮ Condition-based maintenance policies under imperfect maintenance at scheduled and unscheduled opportunities ⋮ Recursive utility and the Ramsey problem ⋮ Two person zero-sum semi-Markov games with unknown holding times distribution on one side: A discounted payoff criterion ⋮ Mixed Markov decision processes in a semi-Markov environment with discounted criterion ⋮ Controlled semi-Markov models under long-run average rewards ⋮ Stochastic Inventory Models with Limited Production Capacity and Periodically Varying Parameters ⋮ Unnamed Item ⋮ Average Cost Semi-Markov Decision Processes and the Control of Queueing Systems ⋮ Optimal adaptive control of priority assignment in queueing systems ⋮ Finite state approximations for denumerable state infinite horizon discounted Markov decision processes with unbounded rewards ⋮ Technological expectations and adoption of improved technology ⋮ Unnamed Item ⋮ Nonstationary value-iteration and adaptive control of discounted semi- Markov processes

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4068423&oldid=17810514"