Pages that link to "Item:Q982638"
From MaRDI portal
The following pages link to Online regret bounds for Markov decision processes with deterministic transitions (Q982638):
Displaying 4 items.
- A perpetual search for talents across overlapping generations: a learning process (Q898767) (← links)
- Simple regret optimization in online planning for Markov decision processes (Q2921080) (← links)
- Online Markov Decision Processes (Q3169063) (← links)
- Markov Decision Processes with Arbitrary Reward Processes (Q3169064) (← links)