Use of Stochastic Automata for Parameter Self-Optimization with Multimodal Performance Criteria
From MaRDI portal
Publication:5575971
DOI10.1109/TSSC.1969.300228zbMath0184.19401MaRDI QIDQ5575971
Kumpati S. Narendra, I. J. Shapiro
Publication date: 1969
Published in: IEEE Transactions on Systems Science and Cybernetics (Search for Journal in Brave)
Related Items (23)
Achieving Unbounded Resolution inFinitePlayer Goore Games Using Stochastic Automata, and Its Applications ⋮ When can the two-armed bandit algorithm be trusted? ⋮ Distributed dynamic reinforcement of efficient outcomes in multiagent coordination and network formation ⋮ Regret bounds for Narendra-Shapiro bandit algorithms ⋮ How Fast Is the Bandit? ⋮ On ergodic two-armed bandits ⋮ Probabilistic automata ⋮ Convergence in models with bounded expected relative hazard rates ⋮ Nonconvergence to saddle boundary points under perturbed reinforcement learning ⋮ A strategy for controlling nonlinear systems using a learning automaton ⋮ Learning behavior of stochastic automata in the last stage of learning ⋮ Theoretical considerations of the parameter self-optimization by stochastic automata ⋮ Choice of optimal subset of numbers using a learning automaton ⋮ Stochastic automata and learning systems. ⋮ Learning automata algorithms for pattern classification. ⋮ An application of the stochastic automaton to the investment game ⋮ Combinatorial optimization by stochastic automata ⋮ A cooperative game of a pair of learning automata ⋮ Optimal non-linear reinforcement schemes for stochastic automata ⋮ epsilon-optimality of a general class of learning algorithms ⋮ A learning automata based algorithm for optimization of continuous complex functions ⋮ Reinforcement learning with internal expectation for the random neural network ⋮ On conditional optimality of a class of learning automata in random environments
This page was built for publication: Use of Stochastic Automata for Parameter Self-Optimization with Multimodal Performance Criteria