scientific article

From MaRDI portal

Publication:3997575

Jump to:navigation, search

zbMath0752.93073MaRDI QIDQ3997575

Michel Métivier, Pierre Priouret, Albert Benveniste

Publication date: 17 September 1992

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

stochastic approximation filtering adaptive algorithms diffusion approximations averaging principle systems description

Mathematics Subject Classification ID

Asymptotic distribution theory in statistics (62E20) Filtering in stochastic control theory (93E11) Adaptive control/observation systems (93C40) Identification in stochastic control theory (93E12) Applications of stochastic analysis (to PDEs, etc.) (60H30)

Related Items (only showing first 100 items - show all)

Stochastic approximation Hamiltonian Monte Carlo ⋮ Unbiased estimation of the gradient of the log-likelihood for a class of continuous-time state-space models ⋮ Interacting generalized Friedman's urn systems ⋮ Accelerating mini-batch SARAH by step size rules ⋮ Online adjoint methods for optimization of PDEs ⋮ A partial history of the early development of continuous-time nonlinear stochastic systems theory ⋮ Reference points and learning ⋮ Stochastic approximation to understand simple simulation models ⋮ Quantile estimation with adaptive importance sampling ⋮ Annealing stochastic approximation Monte Carlo algorithm for neural network training ⋮ Learning in perturbed asymmetric games ⋮ Optimal order placement in limit order markets ⋮ Optimal Delaunay and Voronoi Quantization Schemes for Pricing American Style Options ⋮ Approximating the operating characteristics of Bayesian uncertainty directed trial designs ⋮ Convergence of constant step stochastic gradient descent for non-smooth non-convex functions ⋮ An adaptively weighted stochastic gradient MCMC algorithm for Monte Carlo simulation and global optimization ⋮ Averaging analysis of a point process adaptive algorithm ⋮ A universal procedure for parametric frailty models ⋮ Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning ⋮ Network games; adaptations to Nash-Cournot equilibrium ⋮ The method of averaged models for discrete-time adaptive systems ⋮ Constant step stochastic approximations involving differential inclusions: stability, long-run convergence and applications ⋮ Stochastic averaging principle for two-time-scale jump-diffusion SDEs under the non-Lipschitz coefficients ⋮ Weak Convergence Rates of Population Versus Single-Chain Stochastic Approximation MCMC Algorithms ⋮ An algorithm for blind equalization and synchronization ⋮ Stochastic Nelder-Mead simplex method -- a new globally convergent direct search method for simulation optimization ⋮ A generalized urn with multiple drawing and random addition ⋮ An approximation of the distribution of learning estimates in macroeconomic models ⋮ Real time estimation of stochastic volatility processes ⋮ Exchanges and measures of risks ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Bridging the gap between constant step size stochastic gradient descent and Markov chains ⋮ ESTIMATING STRUCTURAL PARAMETERS IN REGRESSION MODELS WITH ADAPTIVE LEARNING ⋮ A Concentration Bound for Stochastic Approximation via Alekseev’s Formula ⋮ Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement ⋮ An adaptive version for the Metropolis adjusted Langevin algorithm with a truncated drift ⋮ Stochastic Gradient Descent in Continuous Time ⋮ Stochastic Gradient Descent in Continuous Time: A Central Limit Theorem ⋮ Uncertainty Quantification for Stochastic Approximation Limits Using Chaos Expansion ⋮ Learning strict Nash equilibria through reinforcement ⋮ Structured prediction by joint kernel support estimation ⋮ Subspace-based fault detection algorithms for vibration monitoring ⋮ Markovian stochastic approximation with expanding projections ⋮ Rates of convergence of adaptive step-size of stochastic approximation algorithms ⋮ Non asymptotic controls on a recursive superquantile approximation ⋮ Transient and asymptotic dynamics of reinforcement learning in games ⋮ Strong averaging principle for two-time-scale stochastic McKean-Vlasov equations ⋮ Adaptive stepsize selection for tracking in a regime-switching environment ⋮ Extensions of stochastic optimization results to problems with system failure probability functions ⋮ Adaptive sampling of large deviations ⋮ Optimal consumption under uncertainty, liquidity constraints, and bounded rationality ⋮ General multilevel adaptations for stochastic approximation algorithms. II: CLTs ⋮ A Resampling-Based Stochastic Approximation Method for Analysis of Large Geostatistical Data ⋮ A behavioral stock market model ⋮ On the stability of some controlled Markov chains and its applications to stochastic approximation with Markovian dynamic ⋮ Algorithms and networks for accelerated convergence of adaptive LDA ⋮ The stochastic approximation method for estimation of a distribution function ⋮ Sufficient and necessary condition for the convergence of stochastic approximation algorithms ⋮ Coupling a stochastic approximation version of EM with an MCMC procedure ⋮ A theory on flat histogram Monte Carlo algorithms ⋮ Convergence and convergence rate of stochastic gradient search in the case of multiple and non-isolated extrema ⋮ Learning in games with unstable equilibria ⋮ Exchange rates and fundamentals under adaptive learning ⋮ Convergence of stochastic proximal gradient algorithm ⋮ Nonlinear randomized urn models: a stochastic approximation viewpoint ⋮ Design and analysis of linear precoders under a mean square error criterion. II: MMSE designs and conclusions ⋮ Avoidance of traps in stochastic approximation ⋮ Linear stochastic approximation driven by slowly varying Markov chains ⋮ An actor-critic algorithm for constrained Markov decision processes ⋮ A dual purpose principal and minor component flow ⋮ Convergence of Markovian Stochastic Approximation with Discontinuous Dynamics ⋮ Streaming changepoint detection for transition matrices ⋮ Gradient free parameter estimation for hidden Markov models with intractable likelihoods ⋮ A latent discrete Markov random field approach to identifying and classifying historical forest communities based on spatial multivariate tree species counts ⋮ Stochastic approximation schemes for economic capital and risk margin computations ⋮ Online linear and quadratic discriminant analysis with adaptive forgetting for streaming classification ⋮ Asymptotically optimal quantization schemes for Gaussian processes on Hilbert spaces ⋮ On Information Distortions in Online Ratings ⋮ Lower error bounds for the stochastic gradient descent optimization algorithm: sharp convergence rates for slowly and fast decaying learning rates ⋮ Unbiased estimation of the gradient of the log-likelihood in inverse problems ⋮ Efficient stochastic optimisation by unadjusted Langevin Monte Carlo. Application to maximum marginal likelihood and empirical Bayesian estimation ⋮ Fast incremental expectation maximization for finite-sum optimization: nonasymptotic convergence ⋮ A Liapounov bound for solutions of the Poisson equation ⋮ Stochastic proximal-gradient algorithms for penalized mixed models ⋮ On the fast convergence of random perturbations of the gradient flow ⋮ Score-Based Parameter Estimation for a Class of Continuous-Time State Space Models ⋮ Invariant measures for multidimensional fractional stochastic volatility models ⋮ Fundamental design principles for reinforcement learning algorithms ⋮ Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation ⋮ Attainability of boundary points under reinforcement learning ⋮ Computation for latent variable model estimation: a unified stochastic proximal framework ⋮ Tackling algorithmic bias in neural-network classifiers using Wasserstein-2 regularization ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Convergence of Recursive Stochastic Algorithms Using Wasserstein Divergence ⋮ On the almost sure convergence of adaptive allocation procedures ⋮ Continuous-time reinforcement learning for robust control under worst-case uncertainty ⋮ The Barron space and the flow-induced function spaces for neural network models ⋮ Approximating quasi-stationary distributions with interacting reinforced random walks

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3997575&oldid=17690843"