Online Network Revenue Management Using Thompson Sampling
From MaRDI portal
Publication:5131540
DOI10.1287/opre.2018.1755zbMath1446.90095OpenAlexW3122960105MaRDI QIDQ5131540
He Wang, Kris Johnson Ferreira, David Simchi-Levi
Publication date: 8 November 2020
Published in: Operations Research (Search for Journal in Brave)
Full work available at URL: https://hdl.handle.net/1721.1/125757
Management decision making, including multiple objectives (90B50) Deterministic network models in operations research (90B10) Microeconomic theory (price theory and economic markets) (91B24)
Related Items (21)
Dynamic Learning and Market Making in Spread Betting Markets with Informed Bettors ⋮ Stochastic Optimization for Dynamic Pricing ⋮ Online Linear Programming: Dual Convergence, New Algorithms, and Regret Bounds ⋮ Constant Regret Resolving Heuristics for Price-Based Revenue Management ⋮ Online Resource Allocation with Personalized Learning ⋮ Multiproduct Pricing with Discrete Price Sets ⋮ Online weakly DR-submodular optimization with stochastic long-term constraints ⋮ Coordinating Pricing and Inventory Replenishment with Nonparametric Demand Learning ⋮ Bandits with Global Convex Constraints and Objective ⋮ A Data-Driven Functionally Robust Approach for Simultaneous Pricing and Order Quantity Decisions with Unknown Demand Function ⋮ Dynamic Inventory and Price Controls Involving Unknown Demand on Discrete Nonperishable Items ⋮ Simple Bayesian Algorithms for Best-Arm Identification ⋮ Algorithms for Online Matching, Assortment, and Pricing with Tight Weight-Dependent Competitive Ratios ⋮ Dynamic pricing with finite price sets: a non-parametric approach ⋮ Nonparametric Self-Adjusting Control for Joint Learning and Optimization of Multiproduct Pricing with Finite Resource Capacity ⋮ Technical Note—On Revenue Management with Strategic Customers Choosing When and What to Buy ⋮ Technical Note—Joint Learning and Optimization of Multi-Product Pricing with Finite Resource Capacity and Unknown Demand Parameters ⋮ Matching While Learning ⋮ Nonparametric Learning Algorithms for Joint Pricing and Inventory Control with Lost Sales and Censored Demand ⋮ Unnamed Item ⋮ A Primal–Dual Learning Algorithm for Personalized Dynamic Pricing with an Inventory Constraint
Cites Work
- Simple Policies for Dynamic Pricing with Imperfect Forecasts
- Close the Gaps: A Learning-While-Doing Algorithm for Single-Product Revenue Management Problems
- A Re-Solving Heuristic with Bounded Revenue Loss for Network Revenue Management with Customer Choice
- Dynamic Pricing with an Unknown Demand Model: Asymptotically Optimal Semi-Myopic Policies
- Dynamic Pricing for Nonperishable Products with Demand Learning
- Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms
- Dynamic Pricing with a Prior on Market Response
- Thompson Sampling: An Asymptotically Optimal Finite-Time Analysis
- Linearly Parameterized Bandits
- Performance of an LP-Based Control for Revenue Management with Unknown Demand Parameters
- Asymptotic Behavior of an Allocation Policy for Revenue Management
- Optimal Dynamic Pricing of Inventories with Stochastic Demand over Finite Horizons
- A Multiproduct Dynamic Pricing Problem and Its Applications to Network Yield Management
- Blind Network Revenue Management
- Learning to Optimize via Posterior Sampling
- Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
- On the Asymptotic Behavior of Bayes' Estimates in the Discrete Case
- Finite-time analysis of the multiarmed bandit problem
This page was built for publication: Online Network Revenue Management Using Thompson Sampling