On dynamic programming: Compactness of the space of policies

From MaRDI portal
Publication:1221981

DOI10.1016/0304-4149(75)90031-9zbMath0317.60025OpenAlexW2070935253WikidataQ126298569 ScholiaQ126298569MaRDI QIDQ1221981

Manfred Schäl

Publication date: 1975

Published in: Stochastic Processes and their Applications (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/0304-4149(75)90031-9




Related Items

Semicontinuous nonstationary stochastic gamesContinuity Properties of Value Functions in Information Structures for Zero-Sum and General Games and Stochastic TeamsAn equilibrium existence result for games with incomplete information and indeterminate outcomesPerfect equilibria in games of incomplete informationConstrained discounted stochastic gamesThe Expected Total Cost Criterion for Markov Decision Processes under Constraints: A Convex Analytic ApproachOptimal Control of Piecewise Deterministic Markov ProcessesConditions for the solvability of the linear programming formulation for constrained discounted Markov decision processesBayesian learning and convergence to rational expectationsOptimal learning with costly adjustmentStationary Markov Nash Equilibria for Nonzero-Sum Constrained ARAT Markov GamesGeometry of information structures, strategic measures and associated stochastic control topologiesSelf-fulfilling expectations in stochastic processes of temporary equilibriaOn compactness of the space of policies in stochastic dynamic programmingMarkov Decision Processes with Incomplete Information and Semiuniform Feller Transition ProbabilitiesSemi-uniform Feller stochastic kernelsZero-sum games involving teams against teams: existence of equilibria, and comparison and regularity in informationEquivalent conditions for weak continuity of nonlinear filtersThe martingale problem method revisitedOn the expected total reward with unbounded returns for Markov decision processesExtreme Occupation Measures in Markov Decision Processes with an Absorbing StateNash equilibria for total expected reward absorbing Markov games: the constrained and unconstrained casesAbsorbing Markov decision processesA Universal Dynamic Program and Refined Existence Results for Decentralized Stochastic ControlA Convex Programming Approach for Discrete-Time Markov Decision Processes under the Expected Total Reward CriterionConditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimalMaximizing the probability of visiting a set infinitely often for a countable state space Markov decision processSufficiency of Deterministic Policies for Atomless Discounted and Uniformly Absorbing MDPs with Multiple CriteriaCompactness of the space of non-randomized policies in countable-state sequential decision processesEssential stability of the alpha cores of finite games with incomplete informationMultiobjective Stopping Problem for Discrete-Time Markov Processes: Convex Analytic ApproachConstrained discounted Markov decision processes with Borel state spacesEquilibria in infinite games of incomplete informationConstrained and Unconstrained Optimal Discounted Control of Piecewise Deterministic Markov ProcessesOn the Existence of Nash Equilibrium in Bayesian GamesSemicontinuous nonstationary stochastic games. IILarge deviations principle for discrete-time mean-field gamesConstrained Markov Decision Processes with Expected Total Reward CriteriaMarkov decision processes under ambiguityComparison of Information Structures for Zero-Sum Games and a Partial Converse to Blackwell Ordering in Standard Borel SpacesOptimality, equilibrium, and curb sets in decision problems without commitmentNowak's Theorem on Probability Measures Induced by Strategies RevisitedConvex analytic method revisited: further optimality results and performance of deterministic policies in average cost stochastic controlMultiple objective nonatomic Markov decision processes with total reward criteriaOn maximizing the average time at a goalConstrained Markovian decision processes: The dynamic programming approachExistence of optimal policy for time non-homogeneous discounted Markovian decision programmingStrategic measures in optimal control problems for stochastic sequences



Cites Work