Pages that link to "Item:Q2834459"

What links here

⧼whatlinkshere-whatlinkshere-target⧽

Page:

⧼whatlinkshere-whatlinkshere-ns⧽

Namespace:

Invert selection

⧼whatlinkshere-whatlinkshere-filter⧽

Hide transclusions

Hide links

Hide redirects

The following pages link to Regularized policy iteration with nonparametric function spaces (Q2834459):

Displaying 15 items.

Model selection in reinforcement learning (Q415618) (← links)
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path (Q1009248) (← links)
On high-order differentiability of the policy function (Q1341453) (← links)
Multi-agent reinforcement learning: a selective overview of theories and algorithms (Q2094040) (← links)
Batch policy learning in average reward Markov decision processes (Q2112817) (← links)
Policy mirror descent for reinforcement learning: linear convergence, new sampling complexity, and generalized problem classes (Q2687069) (← links)
Analysis of classification-based policy iteration algorithms (Q2810787) (← links)
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630) (← links)
Learning When-to-Treat Policies (Q5857115) (← links)
Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (Q5857153) (← links)
A mathematical perspective of machine learning (Q6118171) (← links)
A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets (Q6138596) (← links)
Projected state-action balancing weights for offline reinforcement learning (Q6183753) (← links)
Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization (Q6631700) (← links)
Optimal policy evaluation using kernel-based temporal difference methods (Q6656605) (← links)