Maximum a Posteriori Policy Optimisation (Q6303189)

From MaRDI portal





preprint article from arXiv
Language Label Description Also known as
English
Maximum a Posteriori Policy Optimisation
preprint article from arXiv

    Statements

    14 June 2018
    0 references
    cs.LG
    0 references
    cs.AI
    0 references
    cs.IT
    0 references
    cs.RO
    0 references
    math.IT
    0 references
    stat.ML
    0 references
    Abbas Abdolmaleki
    0 references
    Jost Tobias Springenberg
    0 references
    Yuval Tassa
    0 references
    Remi Munos
    0 references
    Nicolas Heess
    0 references
    Martin Riedmiller
    0 references

    Identifiers

    0 references