Pages that link to "Item:Q2151247"
From MaRDI portal
The following pages link to Value iteration for long-run average reward in Markov decision processes (Q2151247):
Displaying 12 items.
- Economic design of memory-type control charts: the fallacy of the formula proposed by Lorenzen and Vance (1986) (Q1995869) (← links)
- Multi-objective optimization of long-run average and total rewards (Q2044201) (← links)
- Markov automata with multiple objectives (Q2151241) (← links)
- Value iteration for simple stochastic games: stopping criterion and learning algorithm (Q2672267) (← links)
- Approximate Value Iteration with Temporally Extended Actions (Q2941739) (← links)
- Long-Run Rewards for Markov Automata (Q3303930) (← links)
- Value Iteration in a Class of Communicating Markov Decision Chains with the Average Cost Criterion (Q3837277) (← links)
- (Q5688839) (← links)
- (Q5875366) (← links)
- (Q5875369) (← links)
- PAC Statistical Model Checking of Mean Payoff in Discrete- and Continuous-Time MDP (Q6487329) (← links)
- Correct approximation of stationary distributions (Q6535371) (← links)