Global optimality guarantees for policy gradient methods (Q6655175)

From MaRDI portal





scientific article; zbMATH DE number 7960288
Language Label Description Also known as
English
Global optimality guarantees for policy gradient methods
scientific article; zbMATH DE number 7960288

    Statements

    Global optimality guarantees for policy gradient methods (English)
    0 references
    0 references
    0 references
    20 December 2024
    0 references
    reinforcement learning
    0 references
    policy gradient methods
    0 references
    policy iteration
    0 references
    dynamic programming
    0 references
    gradient dominance
    0 references

    Identifiers