Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes (Q6359420)

From MaRDI portal





scientific article
Language Label Description Also known as
English
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
scientific article

    Statements

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references