The following pages link to (Q4626283):
Displaying 50 items.
- Adaptive importance sampling for control and inference (Q290478) (← links)
- Using reinforcement learning to find an optimal set of features (Q316296) (← links)
- Machine learning in agent-based stochastic simulation: inferential theory and evaluation in transportation logistics (Q356384) (← links)
- Reinforcement learning: exploration-exploitation dilemma in multi-agent foraging task (Q505118) (← links)
- Attack allocation on remote state estimation in multi-systems: structural results and asymptotic solution (Q680517) (← links)
- Imitation learning of car driving skills with decision trees and random forests (Q747406) (← links)
- Approximate-optimal control algorithm for constrained zero-sum differential games through event-triggering mechanism (Q783330) (← links)
- Collective behavior of artificial intelligence population: transition from optimization to game (Q784096) (← links)
- Event-based optimization approach for solving stochastic decision problems with probabilistic constraint (Q828677) (← links)
- A convex optimization approach to dynamic programming in continuous state and action spaces (Q831365) (← links)
- Model-free reinforcement learning for branching Markov decision processes (Q832301) (← links)
- Tutorial series on brain-inspired computing. IV: Reinforcement learning: machine learning and natural learning (Q867508) (← links)
- Multi-objective optimization of water-using systems (Q877675) (← links)
- Real-time dynamic programming for Markov decision processes with imprecise probabilities (Q901046) (← links)
- Perception control (Q1591784) (← links)
- A Markovian mechanism of proportional resource allocation in the incentive model as a dynamic stochastic inverse Stackelberg game (Q1634389) (← links)
- Unfazed by both the bull and bear: strategic exploration in dynamic environments (Q1651798) (← links)
- Open problems in universal induction \& intelligence (Q1662486) (← links)
- On the computability of Solomonoff induction and AIXI (Q1704559) (← links)
- Reinforcement learning for a class of continuous-time input constrained optimal control problems (Q1716659) (← links)
- Shape constraints in economics and operations research (Q1730901) (← links)
- Refinement of the four-dimensional GLV method on elliptic curves (Q1746951) (← links)
- Post-quantum static-static key agreement using multiple protocol instances (Q1746952) (← links)
- Efficient reductions in cyclotomic rings -- application to Ring LWE based FHE schemes (Q1746962) (← links)
- Reinforcement learning with via-point representation (Q1883866) (← links)
- A projected primal-dual gradient optimal control method for deep reinforcement learning (Q1980960) (← links)
- Pitfalls in quantifying exploration in reward-based motor learning and how to avoid them (Q1981976) (← links)
- Methods for improving the efficiency of swarm optimization algorithms. A survey (Q1982839) (← links)
- Qualitative case-based reasoning and learning (Q1989398) (← links)
- The concept of constructing an artificial dispatcher intelligent system based on deep reinforcement learning for the automatic control system of electric networks (Q1995375) (← links)
- Deep reinforcement learning with temporal logics (Q1996007) (← links)
- Clustering in block Markov chains (Q1996780) (← links)
- An online-learning-based evolutionary many-objective algorithm (Q1999048) (← links)
- Learning output reference model tracking for higher-order nonlinear systems with unknown dynamics (Q2004902) (← links)
- Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards (Q2006767) (← links)
- Adaptive learning in large populations (Q2007697) (← links)
- Models and measures of animal aggregation and dispersal (Q2010864) (← links)
- Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks (Q2021298) (← links)
- A linear programming methodology for approximate dynamic programming (Q2023646) (← links)
- Machine learning for combinatorial optimization: a methodological tour d'horizon (Q2029358) (← links)
- Deep hedging of long-term financial derivatives (Q2038257) (← links)
- On satisficing in quantitative games (Q2044188) (← links)
- Markov decision processes with dynamic transition probabilities: an analysis of shooting strategies in basketball (Q2044233) (← links)
- On the finite horizon optimal switching problem with random lag (Q2045122) (← links)
- Negotiating team formation using deep reinforcement learning (Q2046007) (← links)
- The voice of optimization (Q2051238) (← links)
- Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling (Q2051259) (← links)
- Importance sampling in reinforcement learning with an estimated behavior policy (Q2051319) (← links)
- Boltzmann distributed replicator dynamics: population games in a microgrid context (Q2052487) (← links)
- Accelerating reinforcement learning with a directional-Gaussian-smoothing evolution strategy (Q2055215) (← links)