Pages that link to "Item:Q2862464"
From MaRDI portal
The following pages link to Convergent learning algorithms for unknown reward games (Q2862464):
Displaying 13 items.
- Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning (Q413845) (← links)
- Revisiting log-linear learning: asynchrony, completeness and payoff-based implementation (Q423755) (← links)
- Hedging under uncertainty: regret minimization meets exponentially fast convergence (Q681880) (← links)
- Convergent multiple-timescales reinforcement learning algorithms in normal form games (Q1429103) (← links)
- Stochastic learning in multi-agent optimization: communication and payoff-based approaches (Q1716626) (← links)
- Learning in games with continuous action sets and unknown payoff functions (Q1717237) (← links)
- On convergence rates of game theoretic reinforcement learning algorithms (Q1737909) (← links)
- A strategic learning algorithm for state-based games (Q2173896) (← links)
- A reinforcement learning scheme for the equilibrium of the in-vehicle route choice problem based on congestion game (Q2287675) (← links)
- Analysis of Hannan consistent selection for Monte Carlo tree search in simultaneous move games (Q2303656) (← links)
- AWESOME: a general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents (Q2384141) (← links)
- Optimal Off-line Experimentation for Games (Q4991767) (← links)
- Evaluation and learning in two-player symmetric games via best and better responses (Q6095620) (← links)