Dynamic Learning and Decision Making via Basis Weight Vectors
From MaRDI portal
Publication:5095179
DOI10.1287/opre.2021.2240zbMath1497.91101OpenAlexW4210906234WikidataQ114058133 ScholiaQ114058133MaRDI QIDQ5095179
Publication date: 5 August 2022
Published in: Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/opre.2021.2240
approximate dynamic programmingbasis representation of functionsdynamic pricing with learninglearning and doinglinear contextual bandits
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- A survey of solution techniques for the partially observed Markov decision process
- A survey of algorithmic methods for partially observed Markov decision processes
- Partially Observed Markov Decision Processes
- Information Relaxations, Duality, and Convex Stochastic Dynamic Programs
- Approximate Dynamic Programming
- Information Relaxations and Duality in Stochastic Dynamic Programs
- Dynamic Pricing for Nonperishable Products with Demand Learning
- Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms
- Dynamic Pricing with a Prior on Market Response
- Partially Observable Markov Decision Processes: A Geometric Technique and Analysis
- A Partially Observed Markov Decision Process for Dynamic Pricing
- Dynamic Assortment with Demand Learning for Seasonal Consumer Goods
- Dynamic Pricing Under a General Parametric Choice Model
- Linearly Parameterized Bandits
- A Learning Approach for Interactive Marketing to a Customer Segment
- Investment Timing with Incomplete Information and Multiple Means of Learning
- Controlling a Stochastic Process with Unknown Parameters
- State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms
- Optimal Experimentation in a Changing Environment
- The Optimal Control of Partially Observable Markov Processes over a Finite Horizon
- 10.1162/153244303321897663
- On Incomplete Learning and Certainty-Equivalence Control
- Dynamic Selling Mechanisms for Product Differentiation and Learning
- Implementation and Parallelization of a Reverse-Search Algorithm for Minkowski Sums
- Some aspects of the sequential design of experiments
- Sequential Tests of Statistical Hypotheses
This page was built for publication: Dynamic Learning and Decision Making via Basis Weight Vectors