Recent advances in reinforcement learning in finance
From MaRDI portal
Publication:6146668
DOI10.1111/mafi.12382arXiv2112.04553MaRDI QIDQ6146668
Benjamin M. Hambly, Renyuan Xu, Huining Yang
Publication date: 31 January 2024
Published in: Mathematical Finance (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2112.04553
Learning and adaptive systems in artificial intelligence (68T05) Research exposition (monographs, survey articles) pertaining to game theory, economics, and finance (91-02) Derivative securities (option pricing, hedging, etc.) (91G20) Portfolio theory (91G10) Financial markets (91G15)
Related Items (1)
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- The Pricing of Options and Corporate Liabilities
- Dealing with the inventory risk: a solution to the market making problem
- Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model
- Optimal mean-variance portfolio selection
- An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
- Continuous-time mean-variance portfolio selection: a stochastic LQ framework
- Near-optimal reinforcement learning in polynomial time
- Risk-sensitive reinforcement learning
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- \({\mathcal Q}\)-learning
- Optimal portfolios of a small investor in a limit order market: a shadow price approach
- Error bounds for constant step-size \(Q\)-learning
- A selective overview of deep learning
- Markowitz Revisited: Mean-Variance Models in Financial Portfolio Analysis
- Optimal Dynamic Portfolio Selection: Multiperiod Mean-Variance Formulation
- 10.1162/153244303765208377
- Statistical Learning Theory: Models, Concepts, and Results
- PAC Bounds for Discounted MDPs
- DYNAMIC INDIFFERENCE VALUATION VIA CONVEX RISK MEASURES
- High-frequency trading in a limit order book
- OnActor-Critic Algorithms
- Optimal order placement in limit order markets
- Risk-Constrained Reinforcement Learning with Percentile Risk Criteria
- Empirical properties of asset returns: stylized facts and statistical issues
- Applications of Hilbert–Huang transform to non‐stationary financial time series analysis
- 10.1162/1532443041827907
- Optimal Portfolio Liquidation with Limit Orders
- Equal risk pricing of derivatives with deep hedging
- Learning a functional control for high-frequency finance
- Robust Risk-Aware Reinforcement Learning
- Time-consistent strategies for multi-period mean-variance portfolio optimization with the serially correlated returns
- What is the value of the cross-sectional approach to deep reinforcement learning?
- Machine Learning in Finance
- Quant GANs: deep generation of financial time series
- Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
- Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon
- Deep Reinforcement Learning for Market Making in Corporate Bonds: Beating the Curse of Dimensionality
- Deep hedging
- The QLBS Q-Learner goes NuQLear: fitted Q iteration, inverse RL, and option portfolios
- A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options
- On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability
- Risk-Sensitive Reinforcement Learning
- Q-Learning with Linear Function Approximation
- Fallacy of the log-normal approximation to optimal portfolio decision-making over many periods
- Optimal high-frequency trading with limit and market orders
- Some aspects of the sequential design of experiments
- Continuous‐time mean–variance portfolio selection: A reinforcement learning framework
- Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits Under Realizability
This page was built for publication: Recent advances in reinforcement learning in finance