A unified algorithm framework for mean-variance optimization in discounted Markov decision processes
From MaRDI portal
Publication:6096629
DOI10.1016/j.ejor.2023.06.022arXiv2201.05737MaRDI QIDQ6096629
Publication date: 15 September 2023
Published in: European Journal of Operational Research (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2201.05737
dynamic programmingMarkov decision processbilevel optimizationBellman local-optimality equationdiscounted mean-variance
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Analysis and improvement of policy gradient estimation
- Mean-variance analysis of option contracts in a two-echelon supply chain
- Advances in prospect theory: cumulative representation of uncertainty
- Continuous-time mean-variance portfolio selection: a stochastic LQ framework
- Mean-variance optimization of discrete time discounted Markov decision processes
- A mean-variance optimization problem for discounted Markov decision processes
- A possibilistic mean-semivariance-entropy model for multi-period portfolio selection with transaction costs
- Optimization of Markov decision processes under the variance criterion
- Survey on multi-period mean-variance portfolio selection model
- Optimal Dynamic Portfolio Selection: Multiperiod Mean-Variance Formulation
- Multilevel Optimization Modeling for Risk-Averse Stochastic Programming
- Sensitivity Analysis for Mean-Variance Portfolio Problems
- Variance-Penalized Markov Decision Processes
- Mean-Variance Tradeoffs in an Undiscounted MDP
- Mean-Variance Tradeoffs in an Undiscounted MDP: The Unichain Case
- Markowitz's Mean-Variance Portfolio Selection with Regime Switching: A Continuous-Time Model
- The variance of discounted Markov decision processes
- Sample-Path Optimality and Variance-Minimization of Average Cost Markov Control Processes
- Portfolio Optimization with Nonparametric Value at Risk: A Block Coordinate Descent Method
- Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques