Pages that link to "Item:Q2520139"
From MaRDI portal
The following pages link to Mechanisms with learning for stochastic multi-armed bandit problems (Q2520139):
Displaying 9 items.
- Exploration and exploitation of scratch games (Q374139) (← links)
- A quality assuring, cost optimal multi-armed bandit mechanism for expertsourcing (Q1690964) (← links)
- A reliability-aware multi-armed bandit approach to learn and select users in demand response (Q2207171) (← links)
- On the value of learning for Bernoulli bandits with unknown parameters (Q2730296) (← links)
- An Efficient Algorithm for Learning with Semi-bandit Feedback (Q2859220) (← links)
- Machine learning and nonparametric bandit theory (Q4850249) (← links)
- Memory-Constrained No-Regret Learning in Adversarial Multi-Armed Bandits (Q5103511) (← links)
- ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS (Q5358114) (← links)
- (Q5744820) (← links)