Real-Time Reinforcement Learning of Constrained Markov Decision Processes with Weak Derivatives (Q4925757)

From MaRDI portal





scientific article; zbMATH DE number 6174824
Language Label Description Also known as
English
Real-Time Reinforcement Learning of Constrained Markov Decision Processes with Weak Derivatives
scientific article; zbMATH DE number 6174824

    Statements

    12 June 2013
    0 references
    stochastic approximation algorithm
    0 references
    spherical coordinate parametrization
    0 references
    asymptotic bias
    0 references
    math.OC
    0 references

    Identifiers