Pages that link to "Item:Q2806349"
From MaRDI portal
The following pages link to Optimal learning with non-Gaussian rewards (Q2806349):
Displaying 9 items.
- Lévy bandits: Multi-armed bandits driven by Lévy processes (Q1901088) (← links)
- Undiscounted bandit games (Q2212738) (← links)
- Optimal stopping problems in Lévy models with random observations (Q2334743) (← links)
- Optimal learning with \textit{Q}-aggregation (Q2448729) (← links)
- ∊-Optimal nonlinear reinforcement scheme under a nonstationary muititeacher environment (Q3219111) (← links)
- Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models (Q4586173) (← links)
- A Framework of Learning Through Empirical Gain Maximization (Q5004380) (← links)
- Learning Preferences Under Noise and Loss Aversion: An Optimization Approach (Q5166275) (← links)
- ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS (Q5358114) (← links)