Pages that link to "Item:Q1771225"
From MaRDI portal
The following pages link to A reinforcement learning algorithm based on policy iteration for average reward: Empirical results with yield management and convergence analysis (Q1771225):
Displaying 10 items.
- Simulation optimization for revenue management of airlines with cancellations and overbooking (Q858602) (← links)
- Dynamic cruise ship revenue management (Q992627) (← links)
- Least squares approximate policy iteration for learning bid prices in choice-based revenue management (Q1652041) (← links)
- Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning (Q1762118) (← links)
- Reinforcement learning for joint pricing, lead-time and scheduling decisions in make-to-order systems (Q1926824) (← links)
- Convergence of deep fictitious play for stochastic differential games (Q2170300) (← links)
- Integrated revenue management approaches for capacity control with planned upgrades (Q2253355) (← links)
- Look-ahead control of conveyor-serviced production station by using potential-based online policy iteration (Q3654586) (← links)
- A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs (Q5227201) (← links)
- A reinforcement learning approach to distribution-free capacity allocation for sea cargo revenue management (Q6092091) (← links)