Discrete multiarmed bandits and multiparameter processes (Q1317211)

From MaRDI portal





scientific article; zbMATH DE number 528195
Language Label Description Also known as
English
Discrete multiarmed bandits and multiparameter processes
scientific article; zbMATH DE number 528195

    Statements

    Discrete multiarmed bandits and multiparameter processes (English)
    0 references
    0 references
    21 April 1994
    0 references
    The author reformulates the multiarmed bandit problem in discrete time as an optimal stochastic control problem for a multiparameter process. Within this framework, the dynamic allocation index, the so-called Gittins index, becomes a multiparameter process, and it is shown how it leads to optimal solutions. The main advantage of such an approach is that it provides a convenient and elegant representation of switching strategies by using the notion of optimal increasing paths or strategies over a partially ordered set.
    0 references
    multiarmed bandit problem
    0 references
    dynamic allocation index
    0 references
    Gittins index
    0 references
    switching strategies
    0 references

    Identifiers