Reinforcement learning with dynamic convex risk measures
From MaRDI portal
Publication:6196296
DOI10.1111/mafi.12388arXiv2112.13414OpenAlexW4225482307MaRDI QIDQ6196296
Unnamed Author, Sebastian Jaimungal
Publication date: 14 March 2024
Published in: Mathematical Finance (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2112.13414
reinforcement learningtrading strategiestime-consistencyactor-critic algorithmrobot controldynamic risk measurespolicy gradientfinancial hedging
Statistical methods; risk measures (91G70) Learning and adaptive systems in artificial intelligence (68T05) Optimal stochastic control (93E20)
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Dual representation of minimal supersolutions of convex BSDEs
- Risk-averse dynamic programming for Markov decision processes
- Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations
- Convex measures of risk and trading constraints
- Reinforcement learning and stochastic optimisation
- Minimizing spectral risk measures applied to Markov decision processes
- Markov decision processes with recursive risk measures
- Dynamic coherent risk measures
- Conditional and dynamic convex risk measures
- Coherent Measures of Risk
- Dynamic assessment indices
- Markov decision processes with iterated coherent risk measures
- Percentile Optimization for Markov Decision Processes with Parameter Uncertainty
- Risk Measures and Comonotonicity: A Review
- Approximate Integration of Stochastic Differential Equations
- Risk-Constrained Reinforcement Learning with Percentile Risk Criteria
- Approximate Value Iteration for Risk-Aware Markov Decision Processes
- Solving high-dimensional partial differential equations using deep learning
- Deep learning volatility: a deep neural network perspective on pricing and calibration in (rough) volatility models
- Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon
- Dynamic Risk Measures
- Sequential Decision Making With Coherent Risk
- A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options
- Risk-Sensitive Reinforcement Learning
- Envelope Theorems for Arbitrary Choice Sets
- DISTRIBUTION‐INVARIANT RISK MEASURES, INFORMATION, AND DYNAMIC CONSISTENCY
- Distributionally Robust Markov Decision Processes and Their Connection to Risk Measures
- Deep Q-Learning for Nash Equilibria: Nash-DQN
- Approximation by superpositions of a sigmoidal function
- Stochastic finance. An introduction in discrete time
This page was built for publication: Reinforcement learning with dynamic convex risk measures