Occupancy information ratio: infinite-horizon, information-directed, parameterized policy search (Q6652398)

From MaRDI portal





scientific article; zbMATH DE number 7957555
Language Label Description Also known as
English
Occupancy information ratio: infinite-horizon, information-directed, parameterized policy search
scientific article; zbMATH DE number 7957555

    Statements

    Occupancy information ratio: infinite-horizon, information-directed, parameterized policy search (English)
    0 references
    0 references
    0 references
    0 references
    12 December 2024
    0 references
    reinforcement learning
    0 references
    policy gradient methods
    0 references
    nonconvex optimization
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references