Pages that link to "Item:Q286519"
From MaRDI portal
The following pages link to A constrained optimization perspective on actor-critic algorithms and application to network routing (Q286519):
Displaying 5 items.
- An online actor-critic algorithm with function approximation for constrained Markov decision processes (Q438776) (← links)
- An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes (Q616967) (← links)
- Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies (Q2318167) (← links)
- On linear and super-linear convergence of natural policy gradient algorithm (Q2670744) (← links)
- Scalable $\epsilon$-Optimal Decision-Making and Stochastic Routing in Large Networks via Distributed Supervision of Probabilistic Automata (Q2930967) (← links)