Pages that link to "Item:Q1095048"
From MaRDI portal
The following pages link to A unified approach to adaptive control of average reward Markov decision processes (Q1095048):
Displaying 5 items.
- A unified approach to time-aggregated Markov decision processes (Q259403) (← links)
- Adaptive aggregation for reinforcement learning in average reward Markov decision processes (Q378753) (← links)
- Estimation and control in multichain processes (Q1176867) (← links)
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes (Q3984139) (← links)
- Optimal Adaptive Policies for Markov Decision Processes (Q4339383) (← links)