The following pages link to (Q3997575):
Displaying 50 items.
- Learning strict Nash equilibria through reinforcement (Q2441216) (← links)
- Markovian stochastic approximation with expanding projections (Q2448703) (← links)
- Transient and asymptotic dynamics of reinforcement learning in games (Q2462286) (← links)
- Adaptive stepsize selection for tracking in a regime-switching environment (Q2470042) (← links)
- Extensions of stochastic optimization results to problems with system failure probability functions (Q2471082) (← links)
- A behavioral stock market model (Q2482683) (← links)
- Algorithms and networks for accelerated convergence of adaptive LDA (Q2485059) (← links)
- Sufficient and necessary condition for the convergence of stochastic approximation algorithms (Q2489838) (← links)
- A theory on flat histogram Monte Carlo algorithms (Q2492779) (← links)
- Design and analysis of linear precoders under a mean square error criterion. II: MMSE designs and conclusions (Q2503496) (← links)
- Avoidance of traps in stochastic approximation (Q2503522) (← links)
- Linear stochastic approximation driven by slowly varying Markov chains (Q2503529) (← links)
- An actor-critic algorithm for constrained Markov decision processes (Q2504518) (← links)
- A dual purpose principal and minor component flow (Q2504574) (← links)
- Gradient free parameter estimation for hidden Markov models with intractable likelihoods (Q2516386) (← links)
- A Liapounov bound for solutions of the Poisson equation (Q2563941) (← links)
- Attainability of boundary points under reinforcement learning (Q2577444) (← links)
- A partial history of the early development of continuous-time nonlinear stochastic systems theory (Q2628408) (← links)
- Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement (Q2692526) (← links)
- Convergence of Markovian stochastic approximation with discontinuous dynamics (Q2799358) (← links)
- Optimal Delaunay and Voronoi Quantization Schemes for Pricing American Style Options (Q2917431) (← links)
- Weak Convergence Rates of Population Versus Single-Chain Stochastic Approximation MCMC Algorithms (Q2939266) (← links)
- Asymptotically optimal quantization schemes for Gaussian processes on Hilbert spaces (Q3085571) (← links)
- (Q4220656) (← links)
- (Q4362979) (← links)
- Optimal order placement in limit order markets (Q4555056) (← links)
- ESTIMATING STRUCTURAL PARAMETERS IN REGRESSION MODELS WITH ADAPTIVE LEARNING (Q4599617) (← links)
- Stochastic Gradient Descent in Continuous Time (Q4607057) (← links)
- Coupling a stochastic approximation version of EM with an MCMC procedure (Q4671811) (← links)
- Averaging analysis of a point process adaptive algorithm (Q4822473) (← links)
- A universal procedure for parametric frailty models (Q4825481) (← links)
- (Q4909837) (← links)
- A Resampling-Based Stochastic Approximation Method for Analysis of Large Geostatistical Data (Q4916950) (← links)
- Stochastic approximation schemes for economic capital and risk margin computations (Q4967869) (← links)
- Online linear and quadratic discriminant analysis with adaptive forgetting for streaming classification (Q4969831) (← links)
- On Information Distortions in Online Ratings (Q4971363) (← links)
- (Q4999045) (← links)
- (Q4999096) (← links)
- On the fast convergence of random perturbations of the gradient flow (Q5000006) (← links)
- Score-Based Parameter Estimation for a Class of Continuous-Time State Space Models (Q5005015) (← links)
- Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation (Q5009779) (← links)
- Convergence of Recursive Stochastic Algorithms Using Wasserstein Divergence (Q5018894) (← links)
- Continuous-time reinforcement learning for robust control under worst-case uncertainty (Q5028006) (← links)
- Approximating quasi-stationary distributions with interacting reinforced random walks (Q5030240) (← links)
- Stochastic approximation Hamiltonian Monte Carlo (Q5033464) (← links)
- Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning (Q5081106) (← links)
- Constant step stochastic approximations involving differential inclusions: stability, long-run convergence and applications (Q5086426) (← links)
- Stochastic averaging principle for two-time-scale jump-diffusion SDEs under the non-Lipschitz coefficients (Q5086701) (← links)
- A Concentration Bound for Stochastic Approximation via Alekseev’s Formula (Q5113889) (← links)
- Stochastic Gradient Descent in Continuous Time: A Central Limit Theorem (Q5119415) (← links)