Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit
From MaRDI portal
Publication:2074307
DOI10.1214/21-EJS1909zbMath1493.62477arXiv2109.11612OpenAlexW3202768707MaRDI QIDQ2074307
Yun Yang, Ke Li, Naveen Naidu Narisetty
Publication date: 9 February 2022
Published in: Electronic Journal of Statistics (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2109.11612
Cites Work
- Unnamed Item
- Nearly unbiased variable selection under minimax concave penalty
- The lower tail of random quadratic forms with applications to ordinary least squares
- Statistics for high-dimensional data. Methods, theory and applications.
- Woodroofe's one-armed bandit problem revisited
- Fast learning rates for plug-in classifiers
- Reinforcement learning with immediate rewards and linear hypotheses
- Reconstruction From Anisotropic Random Measurements
- Linearly Parameterized Bandits
- Decoding by Linear Programming
- Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties
- High-Dimensional Statistics
- 10.1162/153244303321897663
- Online Decision Making with High-Dimensional Covariates
- A linear response bandit problem
- Finite-time analysis of the multiarmed bandit problem
This page was built for publication: Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit