Dynamic Learning and Decision Making via Basis Weight Vectors

DOI10.1287/opre.2021.2240zbMath1497.91101OpenAlexW4210906234WikidataQ114058133 ScholiaQ114058133MaRDI QIDQ5095179

Publication date: 5 August 2022

Published in: Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/opre.2021.2240

zbMATH Keywords

approximate dynamic programming basis representation of functions dynamic pricing with learning learning and doing linear contextual bandits

Mathematics Subject Classification ID

Decision theory (91B06) Dynamic programming (90C39)

Uses Software

Minksum

Cites Work

Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
A survey of solution techniques for the partially observed Markov decision process
A survey of algorithmic methods for partially observed Markov decision processes
Partially Observed Markov Decision Processes
Information Relaxations, Duality, and Convex Stochastic Dynamic Programs
Approximate Dynamic Programming
Information Relaxations and Duality in Stochastic Dynamic Programs
Dynamic Pricing for Nonperishable Products with Demand Learning
Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms
Dynamic Pricing with a Prior on Market Response
Partially Observable Markov Decision Processes: A Geometric Technique and Analysis
A Partially Observed Markov Decision Process for Dynamic Pricing
Dynamic Assortment with Demand Learning for Seasonal Consumer Goods
Dynamic Pricing Under a General Parametric Choice Model
Linearly Parameterized Bandits
A Learning Approach for Interactive Marketing to a Customer Segment
Investment Timing with Incomplete Information and Multiple Means of Learning
Controlling a Stochastic Process with Unknown Parameters
State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms
Optimal Experimentation in a Changing Environment
The Optimal Control of Partially Observable Markov Processes over a Finite Horizon
10.1162/153244303321897663
On Incomplete Learning and Certainty-Equivalence Control
Dynamic Selling Mechanisms for Product Differentiation and Learning
Implementation and Parallelization of a Reverse-Search Algorithm for Minkowski Sums
Some aspects of the sequential design of experiments
Sequential Tests of Statistical Hypotheses

This page was built for publication: Dynamic Learning and Decision Making via Basis Weight Vectors