Pages that link to "Item:Q1614793"

From MaRDI portal

← Optimal learning and experimentation in bandit problems. (Q1614793)

Jump to:navigation, search

What links here

⧼whatlinkshere-whatlinkshere-target⧽

Page:

⧼whatlinkshere-whatlinkshere-ns⧽

Namespace:

Invert selection

⧼whatlinkshere-whatlinkshere-filter⧽

Hide transclusions

Hide links

Hide redirects

The following pages link to Optimal learning and experimentation in bandit problems. (Q1614793):

Displaying 40 items.

Optimal experimental design for a class of bandit problems (Q618180) (← links)
Analyzing bandit-based adaptive operator selection mechanisms (Q647443) (← links)
Periodic learning about a hidden state variable (Q690170) (← links)
A comparative study of ad hoc techniques and evolutionary methods for multi-armed bandit problems (Q949395) (← links)
Online regret bounds for Markov decision processes with deterministic transitions (Q982638) (← links)
Response adaptive designs that incorporate switching costs and constraints (Q997275) (← links)
A Bayesian analysis of human decision-making on bandit problems (Q1042313) (← links)
An experimental analysis of the bandit problem (Q1361094) (← links)
Learning by doing and the value of optimal experimentation (Q1606181) (← links)
Keeping your options open (Q1657578) (← links)
A common value experimentation with multiarmed bandits (Q1720971) (← links)
The K-armed bandit problem with multiple priors (Q1736953) (← links)
Optimal stopping for Brownian motion with applications to sequential analysis and option pricing (Q1763432) (← links)
Exploration and correlation (Q2002350) (← links)
Gittins' theorem under uncertainty (Q2076662) (← links)
The multi-armed bandit problem: an efficient nonparametric solution (Q2176624) (← links)
On the optimal amount of experimentation in sequential decision problems (Q2267618) (← links)
Choosing a good toolkit. I: Prior-free heuristics (Q2291801) (← links)
Optimal learning of a set: or how to edit a journal if you must (Q2446253) (← links)
On the value of learning for Bernoulli bandits with unknown parameters (Q2730296) (← links)
Online learning methods for networking (Q2799529) (← links)
Optimal learning with non-Gaussian rewards (Q2806349) (← links)
Optimal sequential exploration: bandits, clairvoyants, and wildcats (Q2846423) (← links)
Variance Regularization in Sequential Bayesian Optimization (Q3387910) (← links)
A Learning Approach for Interactive Marketing to a Customer Segment (Q3392140) (← links)
Online Regret Bounds for Markov Decision Processes with Deterministic Transitions (Q3529915) (← links)
INDEXABILITY OF BANDIT PROBLEMS WITH RESPONSE DELAYS (Q3585147) (← links)
(Q4558161) (← links)
Machine learning and nonparametric bandit theory (Q4850249) (← links)
On Incomplete Learning and Certainty-Equivalence Control (Q4971399) (← links)
The Local Time Method for Targeting and Selection (Q4971570) (← links)
Optimistic Gittins Indices (Q5060515) (← links)
Learning in Combinatorial Optimization: What and How to Explore (Q5144784) (← links)
Algorithms for recursive delegation (Q5145458) (← links)
Optimal Learning by Experimentation (Q5202789) (← links)
Bandits and Experts in Metric Spaces (Q5215459) (← links)
ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS (Q5358114) (← links)
Corrected random walk approximations to free boundary problems in optimal stopping (Q5426468) (← links)
Sequential Generalized Likelihood Ratios and Adaptive Treatment Allocation for Optimal Sequential Selection (Q5478885) (← links)
Optimal anytime regret with two experts (Q6062702) (← links)

Retrieved from "https://mardi.schubotz.org/wiki/Special:WhatLinksHere/Item:Q1614793"