Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal

From MaRDI portal
Publication:4077764

DOI10.1007/BF00532612zbMath0316.90080OpenAlexW323116121MaRDI QIDQ4077764

Manfred Schäl

Publication date: 1975

Published in: Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/bf00532612




Related Items (only showing first 100 items - show all)

Dynamic moral hazard without commitmentSemicontinuous nonstationary stochastic gamesA polynomial time bound for Howard's policy improvement algorithmSufficient conditions for optimality of a \((z,c^ -,c^ +)\)-sampling plan in multistage Bayesian acceptance samplingConditions for the solvability of the linear programming formulation for constrained discounted Markov decision processesOptimal assignment policy of a single server attended by two queuesRecurrence conditions for Markov decision processes with Borel state space: A surveyDensity estimation and adaptive control of Markov processes: Average and discounted criteriaScheduling in a multi-class series of queues with deterministic service timesOptimale Innovationspolitik bei unvollständiger Information. (Optimal innovation policy under incomplete information)Normal fields of functionals and optimal measurable sectionsOptimal learning with costly adjustmentRecursive utility and optimal growth under uncertaintyValue iteration in average cost Markov control processes on Borel spacesA strategic market game with secured lendingConvex analytic approach to constrained discounted Markov decision processes with non-constant discount factorsConstrained Markov control processes with randomized discounted cost criteria: infinite linear programming approachGeometry of information structures, strategic measures and associated stochastic control topologiesOn compactness of the space of policies in stochastic dynamic programmingMarkov decision processes with iterated coherent risk measuresOptimality of monotonic policies for two-action Markovian decision processes, with applications to control of queues with delayed informationSample path methods in the control of queuesThe policy iteration algorithm for average continuous control of piecewise deterministic Markov processesTwo-person zero-sum stochastic games with varying discount factorsMarkov control models with unknown random state-action-dependent discount factorsRobust Markov control processesConvex Analysis in Decentralized Stochastic Control, Strategic Measures, and Optimal SolutionsDynamic programming for discrete-time stochastic systems of a general typeMarkov decision processes with state-dependent discount factors and unbounded rewards/costsOn theory and algorithms for Markov decision problems with the total reward criterionMarkov renewal decision processes with finite horizonConditions for characterizing the structure of optimal strategies in infinite-horizon dynamic programsOn the expected total reward with unbounded returns for Markov decision processesOn discounted dynamic programming with unbounded returnsDiscounted dynamic programming with unbounded returns: application to economic modelsSequential stochastic control (single or multi-agent) problems nearly admit change of measures with independent measurement\(n\)-person dynamic strategic market gamesA natural extension of the MacQueen extrapolationBias and overtaking equilibria for zero-sum stochastic differential gamesA note on the \({\sigma}\)-compactness of sets of probability measures on metric spacesOptimal policies in multiproduct inventory modelsOn two-state quality control under Markovian deteriorationOptimal dividend policy in discrete timeHow to stay in a set or Koenig's lemma for random pathsAdaptive control of constrained Markov chains: Criteria and policiesDenumerable semi-Markov decision chains with small interest ratesOn variable discounting in dynamic programming: applications to resource extraction and other economic modelsAverage optimality for risk-sensitive control with general state spacePolicy iteration algorithms for zero-sum stochastic differential games with long-run average payoff criteriaOptimal dynamic load distribution in a class of flow-type flexible manufacturing systemsStochastic dynamic programming with non-linear discountingMaximizing the probability of visiting a set infinitely often for a countable state space Markov decision processZero-sum Markov games with random state-actions-dependent discount factors: existence of optimal strategiesStochastic optimal growth with bounded or unbounded utility and with bounded or unbounded shocksAdaptive control of stochastic systems with unknown disturbance distribution: discounted criteriaA note on negative dynamic programming for risk-sensitive controlDynamic programming with state-dependent discountingA linear-quadratic Gaussian approach to dynamic information acquisitionSome structured dynamic programs arising in economicsStochastic games with unbounded payoffs: applications to robust control in economicsPolicy iteration for continuous-time average reward Markov decision processes in Polish spacesContinuous-time Markov decision processes with state-dependent discount factorsThe discounted method and equivalence of average criteria for risk-sensitive Markov decision processes on Borel spacesCompactness of the space of non-randomized policies in countable-state sequential decision processesSemi-Markov control models with partially known holding times distribution: discounted and average criteriaStochastic growth with short-run prediction of shocksNonstationary discrete-time deterministic and stochastic control systems: bounded and unbounded casesManufacturing lead-time rules: customer retention versus tardiness costsOn stopped decision processes with discrete time parameterAverage cost optimal policies for Markov control processes with Borel state space and unbounded costsNonstationary discrete-time deterministic and stochastic control systems with infinite horizonGeneralised discounting in dynamic programming with unbounded returnsMeasurable selection theorems for optimization problemsA dynamic game approach to distributionally robust safety specifications for stochastic systemsA general stochastic fixed-point theorem for continuous random operators on stochastic domainsA survey of Markov decision models for control of networks of queuesDynamic programming method for deterministic discrete processes of general formSemicontinuous nonstationary stochastic games. IIOn piecewise deterministic Markov control processes: Control of jumps and of risk processes in insuranceAdaptive control of diffusion processes with a discounted reward criterionSensitivity analysis of multisector optimal economic dynamicsCharacterizations of overtaking optimality for controlled diffusion processesMDPs with setwise continuous transition probabilitiesA transformation method for stochastic control problems with partial observationsLimiting optimal discounted-cost control of a class of time-varying stochastic systemsZum Problem des zweiarmigen Bernoulli-Banditen mit einer bekannten Erfolgswahrscheinlichkeit und unendlich vielen SpielenConvex analytic method revisited: further optimality results and performance of deterministic policies in average cost stochastic controlOn a stopping rule for a class of sequential decision problemsA strategic market game with active bankruptcyAdaptive control of discounted Markov decision chainsSemi-Markov decision processes with variance minimization criterionDiscounted robust control for Markov diffusion processesControl of arrivals to two queues in seriesZero-sum semi-Markov games with state-action-dependent discount factorsA note on Optimal control of a queueing system with two heterogeneous serversNonstationary value-iteration and adaptive control of discounted semi- Markov processesOvertaking optimality for controlled Markov-modulated diffusionsLock and no-lock mortgage plans: Is it only a matter of risk shifting?Stationary policies and Markov policies in Borel dynamic programmingNonzero-sum stochastic differential games with additive structure and average payoffs



Cites Work


This page was built for publication: Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal