Pages that link to "Item:Q5959973"

From MaRDI portal

← Finite-time analysis of the multiarmed bandit problem (Q5959973)

Jump to:navigation, search

What links here

⧼whatlinkshere-whatlinkshere-target⧽

Page:

⧼whatlinkshere-whatlinkshere-ns⧽

Namespace:

Invert selection

⧼whatlinkshere-whatlinkshere-filter⧽

Hide transclusions

Hide links

Hide redirects

The following pages link to Finite-time analysis of the multiarmed bandit problem (Q5959973):

Displaying 50 items.

Analysis of Hannan consistent selection for Monte Carlo tree search in simultaneous move games (Q2303656) (← links)
A bad arm existence checking problem: how to utilize asymmetric problem structure? (Q2303673) (← links)
sampling based automatic modulation classifier (Q2333091) (← links)
On the almost sure convergence of adaptive allocation procedures (Q2348729) (← links)
Multi-armed bandit processes with optimal selection of the operating times (Q2387146) (← links)
On learning and branching: a survey (Q2408515) (← links)
Good arm identification via bandit feedback (Q2425222) (← links)
Pure exploration in finitely-armed and continuous-armed bandits (Q2431430) (← links)
Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm (Q2514758) (← links)
Maximin effects in inhomogeneous large-scale data (Q2515497) (← links)
Multi-armed bandits based on a variant of simulated annealing (Q2520136) (← links)
Mechanisms with learning for stochastic multi-armed bandit problems (Q2520139) (← links)
Maximizing revenue for publishers using header bidding and ad exchange auctions (Q2661631) (← links)
Distributed cooperative decision making in multi-agent multi-armed bandits (Q2663944) (← links)
An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems (Q2665165) (← links)
Deep neural networks algorithms for stochastic control problems on finite horizon: numerical applications (Q2671220) (← links)
Bayesian optimization with partially specified queries (Q2673324) (← links)
Customization of J. Bather's UCB strategy for a Gaussian multiarmed bandit (Q2689638) (← links)
Quantum greedy algorithms for multi-armed bandits (Q2693852) (← links)
Non-asymptotic analysis of a new bandit algorithm for semi-bounded rewards (Q2788426) (← links)
Truthful mechanisms with implicit payment computation (Q2796397) (← links)
Optimal learning with non-Gaussian rewards (Q2806349) (← links)
On modification of population-based search algorithms for convergence in stochastic combinatorial optimization (Q2808309) (← links)
Online collaborative filtering on graphs (Q2830757) (← links)
On the solution of stochastic optimization and variational problems in imperfect information regimes (Q2832894) (← links)
A Competitive Rate Allocation Game (Q2840913) (← links)
(Q3121140) (← links)
Linearly Parameterized Bandits (Q3169099) (← links)
Optimal Learning with Local Nonlinear Parametric Models over Continuous Designs (Q3303989) (← links)
On Solving Finite State Multi-Armed Bandit Problem by Linear Programming (Q3360683) (← links)
Polynomial-Time Algorithms for Multiple-Arm Identification with Full-Bandit Feedback (Q3386400) (← links)
Differentially Private and Budget-Limited Bandit Learning over Matroids (Q3386800) (← links)
Bayesian Incentive-Compatible Bandit Exploration (Q3387959) (← links)
Optimal Information Blending with Measurements in the <i>L</i><sup>2</sup> Sphere (Q3465948) (← links)
Tuning Bandit Algorithms in Stochastic Environments (Q3520056) (← links)
Online Regret Bounds for Markov Decision Processes with Deterministic Transitions (Q3529915) (← links)
Active Learning in Multi-armed Bandits (Q3529929) (← links)
Reward-Modulated Hebbian Learning of Decision Making (Q3568365) (← links)
Pure Exploration in Multi-armed Bandits Problems (Q3648740) (← links)
(Q3734915) (← links)
Some memoryless bandit policies (Q4408546) (← links)
Optimal Learning for Nonlinear Parametric Belief Models Over Multidimensional Continuous Spaces (Q4554064) (← links)
(Q4558158) (← links)
(Q4558161) (← links)
(Q4558206) (← links)
(Q4558474) (← links)
(Q4558552) (← links)
Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models (Q4586173) (← links)
Learning the distribution with largest mean: two bandit frameworks (Q4606431) (← links)
Gaussian processes for computer experiments (Q4606435) (← links)

Retrieved from "https://mardi.schubotz.org/wiki/Special:WhatLinksHere/Item:Q5959973"