Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling (Q2051259)

From MaRDI portal





scientific article; zbMATH DE number 7432813
Language Label Description Also known as
English
Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling
scientific article; zbMATH DE number 7432813

    Statements

    Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling (English)
    0 references
    0 references
    0 references
    0 references
    24 November 2021
    0 references

    Identifiers