Pages that link to "Item:Q2375396"
From MaRDI portal
The following pages link to An information-theoretic analysis of return maximization in reinforcement learning (Q2375396):
Displaying 3 items.
- Trading utility and uncertainty: applying the value of information to resolve the exploration-exploitation dilemma in reinforcement learning (Q2094051) (← links)
- The asymptotic equipartition property in reinforcement learning and its relation to return maximization (Q2488678) (← links)
- An information-theoretic analysis of Thompson sampling (Q2810878) (← links)