Pages that link to "Item:Q2006767"
From MaRDI portal
The following pages link to Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards (Q2006767):
Displaying 6 items.
- A bandit process with delayed responses (Q1573129) (← links)
- Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates (Q1848931) (← links)
- Kernel estimation and model combination in a bandit problem with covariates (Q2834477) (← links)
- Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback (Q4596721) (← links)
- Adaptive Algorithm for Multi-Armed Bandit Problem with High-Dimensional Covariates (Q6567892) (← links)
- Integrating multi-armed bandit with local search for MaxSAT (Q6665727) (← links)