Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching (Q6657507)

scientific article; zbMATH DE number 7962330

Language	Label	Description	Also known as
English	Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching	scientific article; zbMATH DE number 7962330

Statements

instance of

scholarly article

0 references

title

Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching (English)

0 references

0 references

0 references

0 references

Applied Mathematics and Optimization

0 references

publication date

6 January 2025

0 references

zbMATH Keywords

reinforcement learning in continuous time

0 references

policy gradient

0 references

control randomization

0 references

actor-critic algorithms

0 references

optimal switching

0 references

MaRDI profile type

Publication

0 references

cites work

A stochastic target formulation for optimal switching problems in finite horizon

0 references

Monte-Carlo Valuation of American Options: Facts and New Algorithms to Improve Existing Methods

0 references

Valuation of energy storage: an optimal switching approach

0 references

Randomized Optimal Stopping Problem in Continuous time and Reinforcement Learning Algorithm

0 references

Probabilistic representation and approximation for coupled systems of variational inequalities

0 references

Randomized and backward SDE representation for optimal control of non-Markovian SDEs

0 references

Representation of non-Markovian optimal stopping problems by constrained BSDEs with a single jump

0 references

On the Starting and Stopping Problem: Application in Reversible Investments

0 references

Multivariate point processes: predictable projection, Radon-Nikodym derivatives, representation of martingales

0 references

Backward SDEs with constrained jumps and quasi-variational inequalities

0 references

Feynman-Kac representation for Hamilton-Jacobi-Bellman IPDE

0 references

Valuation of power plants by utility indifference and numerical computation

0 references

Q5149240

0 references

Reservoir optimization and Machine Learning methods

0 references

Identifiers

DOI

10.1007/s00245-024-10207-5

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6657507