Asymptotic behavior of a hierarchical system of learning automata (Q1062760)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Asymptotic behavior of a hierarchical system of learning automata |
scientific article; zbMATH DE number 3915630
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Asymptotic behavior of a hierarchical system of learning automata |
scientific article; zbMATH DE number 3915630 |
Statements
Asymptotic behavior of a hierarchical system of learning automata (English)
0 references
1985
0 references
Learning automata arranged in a two-level hierarchy are considered. The automata operate in a stationary random environment and update their action probabilities according to the linear-reward-\(\epsilon\)-penalty algorithm at each level. Unlike some hierarchical systems previously proposed, no information transfer exists from one level to another, and yet the hierarchy possesses good convergence properties. Using weak- convergence concepts it is shown that for large time and small values of parameters in the algorithm, the evolution of the optimal path probability can be represented by a diffusion whose parameters can be computed explicitly.
0 references
stochastic automaton
0 references
stationary random environment
0 references
convergence
0 references
optimal path probability
0 references
diffusion
0 references
0 references