The variance of discounted Markov decision processes
From MaRDI portal
Publication:4739691
DOI10.2307/3213832zbMath0503.90091OpenAlexW2313791856MaRDI QIDQ4739691
Publication date: 1982
Published in: Journal of Applied Probability (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.2307/3213832
semi-Markov decision processpolicy improvementvariance formulafinite Markov decision processhigher moments formulasvalue of single-stage rewards
Decision theory (91B06) Continuous-time Markov processes on general state spaces (60J25) Statistical decision theory (62C99) Markov and semi-Markov decision processes (90C40)
Related Items
Markov decision processes with a minimum-variance criterion, Utility, probabilistic constraints, mean and variance of discounted rewards in Markov decision processes, A variance minimization problem for a Markov decision process, Analyzing operational risk-reward trade-offs for start-ups, Augmenting Markov Cohort Analysis to Compute (Co)Variances: Implications for Strength of Cost-Effectiveness, First passage risk probability minimization for piecewise deterministic Markov decision processes, Target-level criterion in Markov decision processes, Trading performance for stability in Markov decision processes, On using discrete random models within decision support systems, Markov Decision Problems Where Means Bound Variances, Finite-horizon variance penalised Markov decision processes, Multi-objective discounted Markov decision processes with expectation and variance criteria, Risk-sensitive control of Markov decision processes: a moment-based approach with target distributions, Variance minimization for constrained discounted continuous-time MDPs with exponentially distributed stopping times, Optimization of Markov decision processes under the variance criterion, A mean-variance optimization problem for discounted Markov decision processes, Mean-variance problems for finite horizon semi-Markov decision processes, First Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount Factors, Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model, The risk probability criterion for discounted continuous-time Markov decision processes, Risk-Sensitive Reinforcement Learning via Policy Gradient Search, Variance-constrained actor-critic algorithms for discounted and average reward MDPs, A unified algorithm framework for mean-variance optimization in discounted Markov decision processes, Risk-averse optimization of reward-based coherent risk measures, Safety-constrained reinforcement learning with a distributional safety critic, Risk-averse dynamic pricing using mean-semivariance optimization, Mean-variance optimization of discrete time discounted Markov decision processes, The optimal unbiased value estimator and its relation to LSTD, TD and MC, A note on the dynamic liquidity trading problem with a mean-variance objective, Threshold probability of non-terminal type in finite horizon Markov decision processes, Near-optimal PAC bounds for discounted MDPs, Optimal threshold probability in undiscounted Markov decision processes with a target set., Risk-Constrained Reinforcement Learning with Percentile Risk Criteria, Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques, Stopped decision processes in conjunction with general utility, On the total reward variance for continuous-time Markov reward chains, Efficient algorithms for risk-sensitive Markov decision processes with limited budget, Mean, variance and probabilistic criteria in finite Markov decision processes: A review, An Inequality for Variances of the Discounted Rewards, Minimizing risk models in Markov decision processes with policies depending on target values, On the General Utility of Discounted Markov Decision Processes, Optimal policy for minimizing risk models in Markov decision processes, Semi-Markov decision processes with variance minimization criterion, Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning, Solution strategies for variance minimization problems, On mean reward variance in semi-Markov processes, Algorithmic aspects of mean-variance optimization in Markov decision processes, Notes on average Markov decision processes with a minimum-variance criterion