Recent advances in reinforcement learning in finance

DOI10.1111/mafi.12382arXiv2112.04553MaRDI QIDQ6146668

Benjamin M. Hambly, Renyuan Xu, Huining Yang

Publication date: 31 January 2024

Published in: Mathematical Finance (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2112.04553

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Research exposition (monographs, survey articles) pertaining to game theory, economics, and finance (91-02) Derivative securities (option pricing, hedging, etc.) (91G20) Portfolio theory (91G10) Financial markets (91G15)

Related Items (1)

Dynamics of market making algorithms in dealer markets: Learning and tacit collusion

Cites Work

Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
The Pricing of Options and Corporate Liabilities
Dealing with the inventory risk: a solution to the market making problem
Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model
Optimal mean-variance portfolio selection
An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
Continuous-time mean-variance portfolio selection: a stochastic LQ framework
Near-optimal reinforcement learning in polynomial time
Risk-sensitive reinforcement learning
Simple statistical gradient-following algorithms for connectionist reinforcement learning
\({\mathcal Q}\)-learning
Optimal portfolios of a small investor in a limit order market: a shadow price approach
Error bounds for constant step-size \(Q\)-learning
A selective overview of deep learning
Markowitz Revisited: Mean-Variance Models in Financial Portfolio Analysis
Optimal Dynamic Portfolio Selection: Multiperiod Mean-Variance Formulation
10.1162/153244303765208377
Statistical Learning Theory: Models, Concepts, and Results
PAC Bounds for Discounted MDPs
DYNAMIC INDIFFERENCE VALUATION VIA CONVEX RISK MEASURES
High-frequency trading in a limit order book
OnActor-Critic Algorithms
Optimal order placement in limit order markets
Risk-Constrained Reinforcement Learning with Percentile Risk Criteria
Empirical properties of asset returns: stylized facts and statistical issues
Applications of Hilbert–Huang transform to non‐stationary financial time series analysis
10.1162/1532443041827907
Optimal Portfolio Liquidation with Limit Orders
Equal risk pricing of derivatives with deep hedging
Learning a functional control for high-frequency finance
Robust Risk-Aware Reinforcement Learning
Time-consistent strategies for multi-period mean-variance portfolio optimization with the serially correlated returns
What is the value of the cross-sectional approach to deep reinforcement learning?
Machine Learning in Finance
Quant GANs: deep generation of financial time series
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon
Deep Reinforcement Learning for Market Making in Corporate Bonds: Beating the Curse of Dimensionality
Deep hedging
The QLBS Q-Learner goes NuQLear: fitted Q iteration, inverse RL, and option portfolios
A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options
On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability
Risk-Sensitive Reinforcement Learning
Q-Learning with Linear Function Approximation
Fallacy of the log-normal approximation to optimal portfolio decision-making over many periods
Optimal high-frequency trading with limit and market orders
Some aspects of the sequential design of experiments
Continuous‐time mean–variance portfolio selection: A reinforcement learning framework
Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits Under Realizability

This page was built for publication: Recent advances in reinforcement learning in finance