scientific article
From MaRDI portal
Publication:3997575
zbMath0752.93073MaRDI QIDQ3997575
Michel Métivier, Pierre Priouret, Albert Benveniste
Publication date: 17 September 1992
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
stochastic approximationfilteringadaptive algorithmsdiffusion approximationsaveraging principlesystems description
Asymptotic distribution theory in statistics (62E20) Filtering in stochastic control theory (93E11) Adaptive control/observation systems (93C40) Identification in stochastic control theory (93E12) Applications of stochastic analysis (to PDEs, etc.) (60H30)
Related Items (only showing first 100 items - show all)
Stochastic approximation Hamiltonian Monte Carlo ⋮ Unbiased estimation of the gradient of the log-likelihood for a class of continuous-time state-space models ⋮ Interacting generalized Friedman's urn systems ⋮ Accelerating mini-batch SARAH by step size rules ⋮ Online adjoint methods for optimization of PDEs ⋮ A partial history of the early development of continuous-time nonlinear stochastic systems theory ⋮ Reference points and learning ⋮ Stochastic approximation to understand simple simulation models ⋮ Quantile estimation with adaptive importance sampling ⋮ Annealing stochastic approximation Monte Carlo algorithm for neural network training ⋮ Learning in perturbed asymmetric games ⋮ Optimal order placement in limit order markets ⋮ Optimal Delaunay and Voronoi Quantization Schemes for Pricing American Style Options ⋮ Approximating the operating characteristics of Bayesian uncertainty directed trial designs ⋮ Convergence of constant step stochastic gradient descent for non-smooth non-convex functions ⋮ An adaptively weighted stochastic gradient MCMC algorithm for Monte Carlo simulation and global optimization ⋮ Averaging analysis of a point process adaptive algorithm ⋮ A universal procedure for parametric frailty models ⋮ Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning ⋮ Network games; adaptations to Nash-Cournot equilibrium ⋮ The method of averaged models for discrete-time adaptive systems ⋮ Constant step stochastic approximations involving differential inclusions: stability, long-run convergence and applications ⋮ Stochastic averaging principle for two-time-scale jump-diffusion SDEs under the non-Lipschitz coefficients ⋮ Weak Convergence Rates of Population Versus Single-Chain Stochastic Approximation MCMC Algorithms ⋮ An algorithm for blind equalization and synchronization ⋮ Stochastic Nelder-Mead simplex method -- a new globally convergent direct search method for simulation optimization ⋮ A generalized urn with multiple drawing and random addition ⋮ An approximation of the distribution of learning estimates in macroeconomic models ⋮ Real time estimation of stochastic volatility processes ⋮ Exchanges and measures of risks ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Bridging the gap between constant step size stochastic gradient descent and Markov chains ⋮ ESTIMATING STRUCTURAL PARAMETERS IN REGRESSION MODELS WITH ADAPTIVE LEARNING ⋮ A Concentration Bound for Stochastic Approximation via Alekseev’s Formula ⋮ Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement ⋮ An adaptive version for the Metropolis adjusted Langevin algorithm with a truncated drift ⋮ Stochastic Gradient Descent in Continuous Time ⋮ Stochastic Gradient Descent in Continuous Time: A Central Limit Theorem ⋮ Uncertainty Quantification for Stochastic Approximation Limits Using Chaos Expansion ⋮ Learning strict Nash equilibria through reinforcement ⋮ Structured prediction by joint kernel support estimation ⋮ Subspace-based fault detection algorithms for vibration monitoring ⋮ Markovian stochastic approximation with expanding projections ⋮ Rates of convergence of adaptive step-size of stochastic approximation algorithms ⋮ Non asymptotic controls on a recursive superquantile approximation ⋮ Transient and asymptotic dynamics of reinforcement learning in games ⋮ Strong averaging principle for two-time-scale stochastic McKean-Vlasov equations ⋮ Adaptive stepsize selection for tracking in a regime-switching environment ⋮ Extensions of stochastic optimization results to problems with system failure probability functions ⋮ Adaptive sampling of large deviations ⋮ Optimal consumption under uncertainty, liquidity constraints, and bounded rationality ⋮ General multilevel adaptations for stochastic approximation algorithms. II: CLTs ⋮ A Resampling-Based Stochastic Approximation Method for Analysis of Large Geostatistical Data ⋮ A behavioral stock market model ⋮ On the stability of some controlled Markov chains and its applications to stochastic approximation with Markovian dynamic ⋮ Algorithms and networks for accelerated convergence of adaptive LDA ⋮ The stochastic approximation method for estimation of a distribution function ⋮ Sufficient and necessary condition for the convergence of stochastic approximation algorithms ⋮ Coupling a stochastic approximation version of EM with an MCMC procedure ⋮ A theory on flat histogram Monte Carlo algorithms ⋮ Convergence and convergence rate of stochastic gradient search in the case of multiple and non-isolated extrema ⋮ Learning in games with unstable equilibria ⋮ Exchange rates and fundamentals under adaptive learning ⋮ Convergence of stochastic proximal gradient algorithm ⋮ Nonlinear randomized urn models: a stochastic approximation viewpoint ⋮ Design and analysis of linear precoders under a mean square error criterion. II: MMSE designs and conclusions ⋮ Avoidance of traps in stochastic approximation ⋮ Linear stochastic approximation driven by slowly varying Markov chains ⋮ An actor-critic algorithm for constrained Markov decision processes ⋮ A dual purpose principal and minor component flow ⋮ Convergence of Markovian Stochastic Approximation with Discontinuous Dynamics ⋮ Streaming changepoint detection for transition matrices ⋮ Gradient free parameter estimation for hidden Markov models with intractable likelihoods ⋮ A latent discrete Markov random field approach to identifying and classifying historical forest communities based on spatial multivariate tree species counts ⋮ Stochastic approximation schemes for economic capital and risk margin computations ⋮ Online linear and quadratic discriminant analysis with adaptive forgetting for streaming classification ⋮ Asymptotically optimal quantization schemes for Gaussian processes on Hilbert spaces ⋮ On Information Distortions in Online Ratings ⋮ Lower error bounds for the stochastic gradient descent optimization algorithm: sharp convergence rates for slowly and fast decaying learning rates ⋮ Unbiased estimation of the gradient of the log-likelihood in inverse problems ⋮ Efficient stochastic optimisation by unadjusted Langevin Monte Carlo. Application to maximum marginal likelihood and empirical Bayesian estimation ⋮ Fast incremental expectation maximization for finite-sum optimization: nonasymptotic convergence ⋮ A Liapounov bound for solutions of the Poisson equation ⋮ Stochastic proximal-gradient algorithms for penalized mixed models ⋮ On the fast convergence of random perturbations of the gradient flow ⋮ Score-Based Parameter Estimation for a Class of Continuous-Time State Space Models ⋮ Invariant measures for multidimensional fractional stochastic volatility models ⋮ Fundamental design principles for reinforcement learning algorithms ⋮ Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation ⋮ Attainability of boundary points under reinforcement learning ⋮ Computation for latent variable model estimation: a unified stochastic proximal framework ⋮ Tackling algorithmic bias in neural-network classifiers using Wasserstein-2 regularization ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Convergence of Recursive Stochastic Algorithms Using Wasserstein Divergence ⋮ On the almost sure convergence of adaptive allocation procedures ⋮ Continuous-time reinforcement learning for robust control under worst-case uncertainty ⋮ The Barron space and the flow-induced function spaces for neural network models ⋮ Approximating quasi-stationary distributions with interacting reinforced random walks
This page was built for publication: