Pages that link to "Item:Q2880979"
From MaRDI portal
The following pages link to Reinforcement learning in finite MDPs: PAC analysis (Q2880979):
Displaying 21 items.
- Extreme state aggregation beyond Markov decision processes (Q329613) (← links)
- Hybrid answer set programming (Q392277) (← links)
- Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model (Q399890) (← links)
- Knows what it knows: a framework for self-aware learning (Q413843) (← links)
- Near-optimal PAC bounds for discounted MDPs (Q465258) (← links)
- Reducing reinforcement learning to KWIK online regression (Q616761) (← links)
- An analysis of model-based interval estimation for Markov decision processes (Q959899) (← links)
- Reinforcement learning with immediate rewards and linear hypotheses (Q1762980) (← links)
- An information-theoretic analysis of return maximization in reinforcement learning (Q2375396) (← links)
- Efficient PAC learning for episodic tasks with acyclic state spaces (Q2465672) (← links)
- Provably efficient learning with typed parametric models (Q2880957) (← links)
- PAC Bounds for Discounted MDPs (Q3164829) (← links)
- Solving for Best Responses and Equilibria in Extensive-Form Games with Reinforcement Learning Methods (Q3299845) (← links)
- Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes (Q4607932) (← links)
- (Q4998915) (← links)
- (Q5053310) (← links)
- (Q5149240) (← links)
- (Q5214220) (← links)
- Identity concealment games: how I learned to stop revealing and love the coincidences (Q6119741) (← links)
- Recent advances in reinforcement learning in finance (Q6146668) (← links)
- Controlling estimation error in reinforcement learning via reinforced operation (Q6556447) (← links)