Pages that link to "Item:Q1427588"
From MaRDI portal
The following pages link to Reinforcement learning for long-run average cost. (Q1427588):
Displaying 16 items.
- A reinforcement-learning approach for admission control in distributed network service systems (Q266057) (← links)
- Approximate dynamic programming for capacity allocation in the service industry (Q439484) (← links)
- A policy gradient method for semi-Markov decision processes with application to call admission control (Q859693) (← links)
- Reinforcement learning algorithms with function approximation: recent advances and applications (Q903601) (← links)
- Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning (Q1762118) (← links)
- A reinforcement learning algorithm based on policy iteration for average reward: Empirical results with yield management and convergence analysis (Q1771225) (← links)
- Average cost temporal-difference learning (Q1805802) (← links)
- Reinforcement learning for joint pricing, lead-time and scheduling decisions in make-to-order systems (Q1926824) (← links)
- A performance-centred approach to optimising maintenance of complex systems (Q2030609) (← links)
- A sojourn-based approach to semi-Markov reinforcement learning (Q2149523) (← links)
- Application of reinforcement learning to the game of Othello (Q2462546) (← links)
- Reinforcement learning based algorithms for average cost Markov decision processes (Q2643632) (← links)
- Learning algorithms for Markov decision processes with average cost (Q2753225) (← links)
- Reinforcement learning: a tutorial survey and recent advances (Q2901057) (← links)
- Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning (Q3116659) (← links)
- Look-ahead control of conveyor-serviced production station by using potential-based online policy iteration (Q3654586) (← links)