Pages that link to "Item:Q2810878"

What links here

⧼whatlinkshere-whatlinkshere-target⧽

Page:

⧼whatlinkshere-whatlinkshere-ns⧽

Namespace:

Invert selection

⧼whatlinkshere-whatlinkshere-filter⧽

Hide transclusions

Hide links

Hide redirects

The following pages link to An information-theoretic analysis of Thompson sampling (Q2810878):

Displaying 26 items.

Generalizations of maximal inequalities to arbitrary selection rules (Q1640918) (← links)
Foraging decisions as multi-armed bandit problems: applying reinforcement learning algorithms to foraging data (Q1730108) (← links)
Exploratory distributions for convex functions (Q1737974) (← links)
Improved regret for zeroth-order adversarial bandit convex optimisation (Q2035748) (← links)
Dismemberment and design for controlling the replication variance of regret for the multi-armed bandit (Q2081727) (← links)
Probabilistic bisection with spatial metamodels (Q2184151) (← links)
Adaptive policies for perimeter surveillance problems (Q2286935) (← links)
On the Prior Sensitivity of Thompson Sampling (Q2831392) (← links)
A Tutorial on Thompson Sampling (Q4556183) (← links)
Multi-Armed Bandit for Species Discovery: A Bayesian Nonparametric Approach (Q4690972) (← links)
Learning to Optimize via Information-Directed Sampling (Q4969321) (← links)
Matching While Learning (Q4994180) (← links)
Bandit Theory: Applications to Learning Healthcare Systems and Clinical Trials (Q5072150) (← links)
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning (Q5089723) (← links)
Nonstationary Bandits with Habituation and Recovery Dynamics (Q5144777) (← links)
Efficient Simulation of High Dimensional Gaussian Vectors (Q5219708) (← links)
Derivative-free optimization methods (Q5230522) (← links)
On the Worth of Perfect Information in Bandits with Random Discounting (Q5458027) (← links)
Satisficing in Time-Sensitive Bandit Learning (Q5870357) (← links)
Entropy Regularization for Mean Field Games with Learning (Q5870374) (← links)
A Bayesian approach to (online) transfer learning: theory and algorithms (Q6066772) (← links)
Information theory for ranking and selection (Q6093553) (← links)
Reward Maximization Through Discrete Active Inference (Q6136191) (← links)
Lower bounds on the noiseless worst-case complexity of efficient global optimization (Q6536837) (← links)
Generalized probabilistic bisection for stochastic root finding (Q6600075) (← links)
Occupancy information ratio: infinite-horizon, information-directed, parameterized policy search (Q6652398) (← links)