The following pages link to (Q5405185):
Displaying 20 items.
- Doubly robust policy evaluation and optimization (Q252797) (← links)
- Optimal learning for sequential sampling with non-parametric beliefs (Q742143) (← links)
- Optimizing infill drilling decisions using multi-armed bandits: application in a long-term, multi-element stockpile (Q1719844) (← links)
- A tutorial on Gaussian process regression: modelling, exploring, and exploiting functions (Q1735988) (← links)
- Contextual dependent click bandit algorithm for web recommendation (Q1790952) (← links)
- Interactive Thompson sampling for multi-objective multi-armed bandits (Q1990281) (← links)
- Forecasting the unemployment rate over districts with the use of distinct methods (Q2697073) (← links)
- On the Prior Sensitivity of Thompson Sampling (Q2831392) (← links)
- Bayesian Incentive-Compatible Bandit Exploration (Q3387959) (← links)
- (Q4558161) (← links)
- (Q4558206) (← links)
- (Q5053193) (← links)
- Are Humans Bayesian in the Optimization of Black-Box Functions? (Q5122270) (← links)
- MNL-Bandit: A Dynamic Learning Approach to Assortment Selection (Q5129205) (← links)
- (Q5149015) (← links)
- Learning to Optimize via Posterior Sampling (Q5247618) (← links)
- Randomized allocation with arm elimination in a bandit problem with covariates (Q5965323) (← links)
- Thompson sampling for networked control over unknown channels (Q6566766) (← links)
- Adaptive Algorithm for Multi-Armed Bandit Problem with High-Dimensional Covariates (Q6567892) (← links)
- Multi-armed bandit experiments in the online service economy (Q6574679) (← links)