Pages that link to "Item:Q5959973"
From MaRDI portal
The following pages link to Finite-time analysis of the multiarmed bandit problem (Q5959973):
Displaying 50 items.
- Analysis of Hannan consistent selection for Monte Carlo tree search in simultaneous move games (Q2303656) (← links)
- A bad arm existence checking problem: how to utilize asymmetric problem structure? (Q2303673) (← links)
- sampling based automatic modulation classifier (Q2333091) (← links)
- On the almost sure convergence of adaptive allocation procedures (Q2348729) (← links)
- Multi-armed bandit processes with optimal selection of the operating times (Q2387146) (← links)
- On learning and branching: a survey (Q2408515) (← links)
- Good arm identification via bandit feedback (Q2425222) (← links)
- Pure exploration in finitely-armed and continuous-armed bandits (Q2431430) (← links)
- Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm (Q2514758) (← links)
- Maximin effects in inhomogeneous large-scale data (Q2515497) (← links)
- Multi-armed bandits based on a variant of simulated annealing (Q2520136) (← links)
- Mechanisms with learning for stochastic multi-armed bandit problems (Q2520139) (← links)
- Maximizing revenue for publishers using header bidding and ad exchange auctions (Q2661631) (← links)
- Distributed cooperative decision making in multi-agent multi-armed bandits (Q2663944) (← links)
- An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems (Q2665165) (← links)
- Deep neural networks algorithms for stochastic control problems on finite horizon: numerical applications (Q2671220) (← links)
- Bayesian optimization with partially specified queries (Q2673324) (← links)
- Customization of J. Bather's UCB strategy for a Gaussian multiarmed bandit (Q2689638) (← links)
- Quantum greedy algorithms for multi-armed bandits (Q2693852) (← links)
- Non-asymptotic analysis of a new bandit algorithm for semi-bounded rewards (Q2788426) (← links)
- Truthful mechanisms with implicit payment computation (Q2796397) (← links)
- Optimal learning with non-Gaussian rewards (Q2806349) (← links)
- On modification of population-based search algorithms for convergence in stochastic combinatorial optimization (Q2808309) (← links)
- Online collaborative filtering on graphs (Q2830757) (← links)
- On the solution of stochastic optimization and variational problems in imperfect information regimes (Q2832894) (← links)
- A Competitive Rate Allocation Game (Q2840913) (← links)
- (Q3121140) (← links)
- Linearly Parameterized Bandits (Q3169099) (← links)
- Optimal Learning with Local Nonlinear Parametric Models over Continuous Designs (Q3303989) (← links)
- On Solving Finite State Multi-Armed Bandit Problem by Linear Programming (Q3360683) (← links)
- Polynomial-Time Algorithms for Multiple-Arm Identification with Full-Bandit Feedback (Q3386400) (← links)
- Differentially Private and Budget-Limited Bandit Learning over Matroids (Q3386800) (← links)
- Bayesian Incentive-Compatible Bandit Exploration (Q3387959) (← links)
- Optimal Information Blending with Measurements in the <i>L</i><sup>2</sup> Sphere (Q3465948) (← links)
- Tuning Bandit Algorithms in Stochastic Environments (Q3520056) (← links)
- Online Regret Bounds for Markov Decision Processes with Deterministic Transitions (Q3529915) (← links)
- Active Learning in Multi-armed Bandits (Q3529929) (← links)
- Reward-Modulated Hebbian Learning of Decision Making (Q3568365) (← links)
- Pure Exploration in Multi-armed Bandits Problems (Q3648740) (← links)
- (Q3734915) (← links)
- Some memoryless bandit policies (Q4408546) (← links)
- Optimal Learning for Nonlinear Parametric Belief Models Over Multidimensional Continuous Spaces (Q4554064) (← links)
- (Q4558158) (← links)
- (Q4558161) (← links)
- (Q4558206) (← links)
- (Q4558474) (← links)
- (Q4558552) (← links)
- Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models (Q4586173) (← links)
- Learning the distribution with largest mean: two bandit frameworks (Q4606431) (← links)
- Gaussian processes for computer experiments (Q4606435) (← links)