Pages that link to "Item:Q438776"
From MaRDI portal
The following pages link to An online actor-critic algorithm with function approximation for constrained Markov decision processes (Q438776):
Displaying 13 items.
- A constrained optimization perspective on actor-critic algorithms and application to network routing (Q286519) (← links)
- Multiscale Q-learning with linear function approximation (Q312650) (← links)
- An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes (Q616967) (← links)
- Event-based optimization approach for solving stochastic decision problems with probabilistic constraint (Q828677) (← links)
- Variance-constrained actor-critic algorithms for discounted and average reward MDPs (Q1689603) (← links)
- Suboptimal control for nonlinear systems with disturbance via integral sliding mode control and policy iteration (Q2178900) (← links)
- Learning algorithms for finite horizon constrained Markov decision processes (Q2468856) (← links)
- An actor-critic algorithm for constrained Markov decision processes (Q2504518) (← links)
- Risk-Constrained Reinforcement Learning with Percentile Risk Criteria (Q4558492) (← links)
- Queueing Network Controls via Deep Reinforcement Learning (Q5084497) (← links)
- An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions (Q5380403) (← links)
- On the sample complexity of actor-critic method for reinforcement learning with function approximation (Q6134324) (← links)
- Optimal deterministic controller synthesis from steady-state distributions (Q6156635) (← links)