Pages that link to "Item:Q2220059"
From MaRDI portal
The following pages link to Learning parametric policies and transition probability models of Markov decision processes from data (Q2220059):
Displaying 5 items.
- Introduction to internally consistent modeling, aggregation, inference, and policy (Q472739) (← links)
- Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (Q889297) (← links)
- \(L^\ast\)-based learning of Markov decision processes (extended version) (Q1982638) (← links)
- Provably efficient learning with typed parametric models (Q2880957) (← links)
- Learning Markov Models Via Low-Rank Optimization (Q5106374) (← links)