Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scientific article - MaRDI portal

scientific article

From MaRDI portal

Publication:3527701

Jump to:navigation, search

zbMath1181.62119MaRDI QIDQ3527701

Vivek S. Borkar

Publication date: 29 September 2008

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

stability functional central limit theorem collective phenomena stochastic fixed point iterations

Mathematics Subject Classification ID

Lua error in Module:PublicationMSCList at line 37: attempt to index local 'msc_result' (a nil value).

Related Items (only showing first 100 items - show all)

A Biologically Plausible Neural Network for Multichannel Canonical Correlation Analysis ⋮ Reinforcement learning, sequential Monte Carlo and the EM algorithm ⋮ Some Limit Properties of Markov Chains Induced by Recursive Stochastic Algorithms ⋮ An incremental off-policy search in a model-free Markov decision process using a single sample path ⋮ A constrained optimization perspective on actor-critic algorithms and application to network routing ⋮ An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method ⋮ Broadcast control of multi-agent systems ⋮ Reference points and learning ⋮ Stochastic approximation to understand simple simulation models ⋮ Tuning positive feedback for signal detection in noisy dynamic environments ⋮ Convergence and efficiency of adaptive importance sampling techniques with partial biasing ⋮ On Best-Response Dynamics in Potential Games ⋮ Multiscale Q-learning with linear function approximation ⋮ Estimating the position of a moving object based on test disturbance of camera position ⋮ Learning to control a structured-prediction decoder for detection of HTTP-layer DDoS attackers ⋮ Unified reinforcement Q-learning for mean field game and control problems ⋮ Linear Convergence of Comparison-based Step-size Adaptive Randomized Search via Stability of Markov Chains ⋮ Revisiting SIR in the Age of COVID-19: Explicit Solutions and Control Problems ⋮ Stochastic Multilevel Composition Optimization Algorithms with Level-Independent Convergence Rates ⋮ A Finite Memory Interacting Pólya Contagion Network and Its Approximating Dynamical Systems ⋮ ASTRO-DF: A Class of Adaptive Sampling Trust-Region Algorithms for Derivative-Free Stochastic Optimization ⋮ Stochastic Methods for Composite and Weakly Convex Optimization Problems ⋮ Distributed Stochastic Approximation with Local Projections ⋮ A convergence analysis of the perturbed compositional gradient flow: averaging principle and normal deviations ⋮ Oja's algorithm for graph clustering, Markov spectral decomposition, and risk sensitive control ⋮ Stochastic recursive inclusions with non-additive iterate-dependent Markov noise ⋮ Pseudo-perturbation-based broadcast control of multi-agent systems ⋮ Constant step stochastic approximations involving differential inclusions: stability, long-run convergence and applications ⋮ Distributed caching over heterogeneous mobile networks ⋮ An adaptive learning model with foregone payoff information ⋮ A stability criterion for two timescale stochastic approximation schemes ⋮ Stochastic approximation with long range dependent and heavy tailed noise ⋮ Stability and delay of distributed scheduling algorithms for networks of conflicting queues ⋮ Asymptotic behavior of truncated stochastic approximation procedures ⋮ A Q-Learning Algorithm for Discrete-Time Linear-Quadratic Control with Random Parameters of Unknown Distribution: Convergence and Stabilization ⋮ Robust adaptive dynamic programming for linear and nonlinear systems: an overview ⋮ An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes ⋮ Truncated stochastic approximation with moving bounds: convergence ⋮ Error bounds for constant step-size \(Q\)-learning ⋮ Stochastic fictitious play with continuous action sets ⋮ A stopping rule for stochastic approximation ⋮ On Sampling Rates in Simulation-Based Recursions ⋮ Reinforcement learning algorithms with function approximation: recent advances and applications ⋮ A mean field approach for optimization in discrete time ⋮ Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement ⋮ Weak convergence of dynamical systems in two timescales ⋮ Unnamed Item ⋮ An online actor-critic algorithm with function approximation for constrained Markov decision processes ⋮ Approachability in Stackelberg stochastic games with vector costs ⋮ On stochastic gradient and subgradient methods with adaptive steplength sequences ⋮ Random algorithms for convex minimization problems ⋮ Stabilization of stochastic approximation by step size adaptation ⋮ A stochastic Kaczmarz algorithm for network tomography ⋮ Markovian stochastic approximation with expanding projections ⋮ Reinforcement learning behaviors in sponsored search ⋮ Rejoinder to ‘Reinforcement learning behaviors in sponsored search’ ⋮ Newton-based stochastic optimization using \(q\)-Gaussian smoothed functional algorithms ⋮ Deceptive Reinforcement Learning Under Adversarial Manipulations on Cost Signals ⋮ On learning dynamics underlying the evolution of learning rules ⋮ Negatively reinforced balanced urn schemes ⋮ On Generalized Bellman Equations and Temporal-Difference Learning ⋮ A concentration bound for contractive stochastic approximation ⋮ Prospect-theoretic Q-learning ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Near-optimal stochastic approximation for online principal component estimation ⋮ Q-learning for Markov decision processes with a satisfiability criterion ⋮ Simulation optimization of risk measures with adaptive risk levels ⋮ Rate of convergence of truncated stochastic approximation procedures with moving bounds ⋮ Risk-Constrained Reinforcement Learning with Percentile Risk Criteria ⋮ Simultaneous perturbation Newton algorithms for simulation optimization ⋮ High-dimensional exploratory item factor analysis by a Metropolis-Hastings Robbins-Monro algorithm ⋮ The Borkar-Meyn theorem for asynchronous stochastic approximations ⋮ Robust adaptive Metropolis algorithm with coerced acceptance rate ⋮ Finite dimensional approximation and Newton-based algorithm for stochastic approximation in Hilbert space ⋮ Convergence and convergence rate of stochastic gradient search in the case of multiple and non-isolated extrema ⋮ Stochastic approximation on Riemannian manifolds ⋮ Unnamed Item ⋮ Optimal stochastic extragradient schemes for pseudomonotone stochastic variational inequality problems and their variants ⋮ Stochastic First-Order Methods with Random Constraint Projection ⋮ Convergence of Markovian Stochastic Approximation with Discontinuous Dynamics ⋮ Some Examples of Stochastic Approximation in Communications ⋮ Conservative set valued fields, automatic differentiation, stochastic gradient methods and deep learning ⋮ Stochastic subgradient method converges on tame functions ⋮ Inexact stochastic subgradient projection method for stochastic equilibrium problems with nonmonotone bifunctions: application to expected risk minimization in machine learning ⋮ Incremental without replacement sampling in nonconvex optimization ⋮ Nonlinear Gossip ⋮ Optimal survey schemes for stochastic gradient descent with applications to M-estimation ⋮ Stochastic approximation search algorithms with randomization at the input ⋮ Convergence results on stochastic adaptive learning ⋮ Fast incremental expectation maximization for finite-sum optimization: nonasymptotic convergence ⋮ A Stochastic Subgradient Method for Nonsmooth Nonconvex Multilevel Composition Optimization ⋮ Finite-Time Performance of Distributed Temporal-Difference Learning with Linear Function Approximation ⋮ On the fast convergence of random perturbations of the gradient flow ⋮ Financial replicator dynamics: emergence of systemic-risk-averting strategies ⋮ Non-asymptotic error bounds for constant stepsize stochastic approximation for tracking mobile agents ⋮ Incremental constraint projection methods for variational inequalities ⋮ Convergence of Recursive Stochastic Algorithms Using Wasserstein Divergence ⋮ Distributed Bregman-Distance Algorithms for Min-Max Optimization ⋮ On Gradient-Based Learning in Continuous Games

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3527701&oldid=16903137"

Pages with script errors