Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning (Q6401740)

From MaRDI portal





preprint article from arXiv
Language Label Description Also known as
English
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
preprint article from arXiv

    Statements

    10 June 2022
    0 references
    cs.LG
    0 references
    math.OC
    0 references
    Ruida Zhou
    0 references
    Tao Liu
    0 references
    Dileep Kalathil
    0 references
    P. R. Kumar
    0 references
    Chao Tian
    0 references

    Identifiers

    0 references