Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

From MaRDI portal
Publication:2925348

DOI10.1287/moor.1120.0555zbMath1297.90173arXiv1202.4122OpenAlexW1966208686MaRDI QIDQ2925348

Eugene A. Feinberg, Nina V. Zadoianchuk (Zadoyanchuk), Pavlo O. Kasyanov

Publication date: 21 October 2014

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1202.4122




Related Items

LP based upper and lower bounds for Cesàro and Abel limits of the optimal values in problems of control of stochastic discrete time systemsUnbounded dynamic programming via the Q-transformA Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable PoliciesSTOCHASTIC SETUP-COST INVENTORY MODEL WITH BACKORDERS AND QUASICONVEX COST FUNCTIONSA useful technique for piecewise deterministic Markov decision processesMarkov Decision Processes with Incomplete Information and Semiuniform Feller Transition ProbabilitiesReduction of total-cost and average-cost MDPs with weakly continuous transition probabilities to discounted mdpsOptimality Conditions for Partially Observable Markov Decision ProcessesExamples concerning Abel and Cesàro limitsNear optimality of quantized policies in stochastic control under weak continuity conditionsFormalization of methods for the development of autonomous artificial intelligence systemsContinuity of discounted values and the structure of optimal policies for <scp>periodic‐review</scp> inventory systems with setup costsA survey of average cost problems in deterministic discrete-time control systemsA note on the existence of optimal stationary policies for average Markov decision processes with countable statesFatou's Lemma in Its Classical Form and Lebesgue's Convergence Theorems for Varying Measures with Applications to Markov Decision ProcessesAverage Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable PoliciesConvergence of probability measures and Markov decision models with incomplete informationContinuity of minima: local resultsAverage Cost Markov Decision Processes with Semi-Uniform Feller Transition ProbabilitiesConstrained Markov decision processes in Borel spaces: from discounted to average optimalitySolutions of the average cost optimality equation for Markov decision processes with weakly continuous kernel: the fixed-point approach revisitedUnnamed ItemPlanning for the long run: programming with patient, Pareto responsive preferencesUniform Fatou's lemmaOn the vanishing discount factor approach for Markov decision processes with weakly continuous transition probabilitiesBerge's theorem for noncompact image setsPartially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition ProbabilitiesFatou's Lemma for Weakly Converging Measures under the Uniform Integrability ConditionOn the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded CostsMDPs with setwise continuous transition probabilitiesOn structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policiesConvex analytic method revisited: further optimality results and performance of deterministic policies in average cost stochastic controlOn Convergence of Value Iteration for a Class of Total Cost Markov Decision ProcessesOn the reduction of total‐cost and average‐cost MDPs to discounted MDPsAverage optimality for continuous-time Markov decision processes under weak continuity conditionsBerge's maximum theorem for noncompact image setsStructure of optimal policies to periodic-review inventory models with convex costs and backorders for all values of discount factorsContinuity of equilibria for two-person zero-sum games with noncompact action sets and unbounded payoffsOn the optimality equation for average cost Markov decision processes and its validity for inventory control



Cites Work