A \(Sarsa(\lambda)\) algorithm based on double-layer fuzzy reasoning
From MaRDI portal
Publication:473823
DOI10.1155/2013/561026zbMath1299.68169OpenAlexW2026500801WikidataQ59026577 ScholiaQ59026577MaRDI QIDQ473823
Qiming Fu, Xiang Mu, Wei Huang, Quan Liu, Yong-Gang Zhang
Publication date: 24 November 2014
Published in: Mathematical Problems in Engineering (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1155/2013/561026
Learning and adaptive systems in artificial intelligence (68T05) Reasoning under uncertainty in the context of artificial intelligence (68T37)
Cites Work
- Simulation-based algorithms for Markov decision processes.
- Reinforcement distribution in fuzzy Q-learning
- The convergence of \(TD(\lambda)\) for general \(\lambda\)
- Type-2 fuzzy logic: Theory and applications (Cover title: Contributions to fuzzy and rough sets theories and their applications)
- An analysis of temporal-difference learning with function approximation