The following pages link to (Q5744808):
Displaying 49 items.
- An iterative scheme of safe reinforcement learning for nonlinear systems via barrier certificate generation (Q832198) (← links)
- Enforcing almost-sure reachability in POMDPs (Q832296) (← links)
- Probabilistic guarantees for safe deep reinforcement learning (Q1996032) (← links)
- Risk-averse autonomous systems: a brief history and recent developments from the perspective of optimal control (Q2082497) (← links)
- Risk-averse policy optimization via risk-neutral policy optimization (Q2082514) (← links)
- Multi-agent reinforcement learning: a selective overview of theories and algorithms (Q2094040) (← links)
- Reinforcement learning: an industrial perspective (Q2094053) (← links)
- Learning-based vs model-free adaptive control of a MAV under wind gust (Q2101765) (← links)
- Lifted model checking for relational MDPs (Q2102421) (← links)
- A predictive safety filter for learning-based control of constrained nonlinear dynamical systems (Q2665095) (← links)
- Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee (Q2665179) (← links)
- Sim-to-lab-to-real: safe reinforcement learning with shielding and generalization guarantees (Q2680780) (← links)
- Deceptive Reinforcement Learning Under Adversarial Manipulations on Cost Signals (Q3297666) (← links)
- Experience selection in deep reinforcement learning for control (Q4558146) (← links)
- Safe Exploration of State and Action Spaces in Reinforcement Learning (Q4899127) (← links)
- An Interpretable Graph-Based Mapping of Trustworthy Machine Learning Research (Q5050960) (← links)
- Learning for Constrained Optimization: Identifying Optimal Active Constraint Sets (Q5084662) (← links)
- (Q5089265) (← links)
- Nonconvex Policy Search Using Variational Inequalities (Q5380851) (← links)
- Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms (Q5856487) (← links)
- Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning (Q5870485) (← links)
- Robust Control for Dynamical Systems with Non-Gaussian Noise via Formal Abstractions (Q5881801) (← links)
- Markov decision processes with burstiness constraints (Q6087483) (← links)
- Safe reinforcement learning: A control barrier function optimization approach (Q6089847) (← links)
- Smoothing policies and safe policy gradients (Q6097096) (← links)
- Risk-averse optimization of reward-based coherent risk measures (Q6098851) (← links)
- Dynamic shielding for reinforcement learning in black-box environments (Q6103158) (← links)
- Risk-aware controller for autonomous vehicles using model-based collision prediction and reinforcement learning (Q6103673) (← links)
- Model Checking for Safe Navigation Among Humans (Q6104810) (← links)
- Safety-constrained reinforcement learning with a distributional safety critic (Q6106435) (← links)
- Off‐policy model‐based end‐to‐end safe reinforcement learning (Q6117696) (← links)
- Inverse reinforcement learning through logic constraint inference (Q6134331) (← links)
- Certified reinforcement learning with logic guidance (Q6136089) (← links)
- Multi-task safe reinforcement learning for navigating intersections in dense traffic (Q6136441) (← links)
- Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning (Q6143823) (← links)
- Safe multi-agent reinforcement learning for multi-robot control (Q6161500) (← links)
- Approval-directed agency and the decision theory of Newcomb-like problems (Q6182772) (← links)
- Almost surely safe exploration and exploitation for deep reinforcement learning with state safety estimation (Q6495127) (← links)
- A learner-verifier framework for neural network controllers and certificates of stochastic systems (Q6535337) (← links)
- A survey of learning criteria going beyond the usual risk (Q6535427) (← links)
- Reinforcement learning for linear exponential quadratic Gaussian problem (Q6540839) (← links)
- Probabilistic counterexample guidance for safer reinforcement learning (Q6546466) (← links)
- A reinforcement learning based dynamic room pricing model for hotel industry (Q6557738) (← links)
- Finding the optimal exploration-exploitation trade-off online through Bayesian risk estimation and minimization (Q6566614) (← links)
- Federated reinforcement learning for robot motion planning with zero-shot generalization (Q6574451) (← links)
- Verifying the generalization of deep learning to out-of-distribution domains (Q6611966) (← links)
- Sampled-data funnel control and its use for safe continual learning (Q6636452) (← links)
- A stabilizing reinforcement learning approach for sampled systems with partially unknown models (Q6646984) (← links)
- Safe reinforcement learning-based control using deep deterministic policy gradient algorithm and slime mould algorithm with experimental tower crane system validation (Q6665123) (← links)