Regularity and Stability of Feedback Relaxed Controls
From MaRDI portal
Publication:3382776
DOI10.1137/20M1312435zbMath1471.93100arXiv2001.03148OpenAlexW3199075411MaRDI QIDQ3382776
Christoph Reisinger, Yu-Fei Zhang
Publication date: 22 September 2021
Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2001.03148
Hamilton-Jacobi-Bellman equationreinforcement learningLipschitz stabilityexploration and exploitationsensitivity equationfeedback relaxed control
Related Items (3)
Exploratory HJB Equations and Their Convergence ⋮ Reinforcement Learning for Linear-Convex Models with Jumps via Stability Analysis of Feedback Controls ⋮ Optimal Scheduling of Entropy Regularizer for Continuous-Time Linear-Quadratic Reinforcement Learning
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Discontinuous Galerkin finite element methods for time-dependent Hamilton-Jacobi-Bellman equations with Cordes coefficients
- Uniqueness to elliptic and parabolic Hamilton-Jacobi-Bellman equations with non-smooth boundary
- Adapted Wasserstein distances and stability in mathematical finance
- Proto-derivative formulas for basic subgradient mappings in mathematical programming
- Smoothing methods for convex inequalities and linear complementarity problems
- A non-interior continuation method for generalized linear complementarity problems
- Continuity of utility maximization under weak convergence
- A neural network-based policy iteration algorithm with global \(H^2\)-superlinear convergence for stochastic games on domains
- All adapted topologies are equal
- Extended weak convergence and utility maximisation with proportional transaction costs
- Recursive approximation of the high dimensional max function
- Controlled Markov processes and viscosity solutions
- Generalized Hamilton--Jacobi--Bellman Equations with Dirichlet Boundary Condition and Stochastic Exit Time Optimal Control Problem
- Some Convergence Results for Howard's Algorithm
- A smoothing-out technique for min—max optimization
- Convergence of Dynamic Programming Models
- Elliptic Partial Differential Equations of Second Order
- A strong comparison result for the bellman equation arising in stochastic exit time control problems and its applications
- Existence and uniqueness theorems for solutions of McKean–Vlasov stochastic equations
- Robustness to Incorrect System Models in Stochastic Control
- Discontinuous Galerkin Finite Element Approximation of Hamilton--Jacobi--Bellman Equations with Cordes Coefficients
- Convex Analysis
- Continuous‐time mean–variance portfolio selection: A reinforcement learning framework
This page was built for publication: Regularity and Stability of Feedback Relaxed Controls