scientific article
From MaRDI portal
Publication:3527701
zbMath1181.62119MaRDI QIDQ3527701
Publication date: 29 September 2008
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Lua error in Module:PublicationMSCList at line 37: attempt to index local 'msc_result' (a nil value).
Related Items (only showing first 100 items - show all)
A Biologically Plausible Neural Network for Multichannel Canonical Correlation Analysis ⋮ Reinforcement learning, sequential Monte Carlo and the EM algorithm ⋮ Some Limit Properties of Markov Chains Induced by Recursive Stochastic Algorithms ⋮ An incremental off-policy search in a model-free Markov decision process using a single sample path ⋮ A constrained optimization perspective on actor-critic algorithms and application to network routing ⋮ An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method ⋮ Broadcast control of multi-agent systems ⋮ Reference points and learning ⋮ Stochastic approximation to understand simple simulation models ⋮ Tuning positive feedback for signal detection in noisy dynamic environments ⋮ Convergence and efficiency of adaptive importance sampling techniques with partial biasing ⋮ On Best-Response Dynamics in Potential Games ⋮ Multiscale Q-learning with linear function approximation ⋮ Estimating the position of a moving object based on test disturbance of camera position ⋮ Learning to control a structured-prediction decoder for detection of HTTP-layer DDoS attackers ⋮ Unified reinforcement Q-learning for mean field game and control problems ⋮ Linear Convergence of Comparison-based Step-size Adaptive Randomized Search via Stability of Markov Chains ⋮ Revisiting SIR in the Age of COVID-19: Explicit Solutions and Control Problems ⋮ Stochastic Multilevel Composition Optimization Algorithms with Level-Independent Convergence Rates ⋮ A Finite Memory Interacting Pólya Contagion Network and Its Approximating Dynamical Systems ⋮ ASTRO-DF: A Class of Adaptive Sampling Trust-Region Algorithms for Derivative-Free Stochastic Optimization ⋮ Stochastic Methods for Composite and Weakly Convex Optimization Problems ⋮ Distributed Stochastic Approximation with Local Projections ⋮ A convergence analysis of the perturbed compositional gradient flow: averaging principle and normal deviations ⋮ Oja's algorithm for graph clustering, Markov spectral decomposition, and risk sensitive control ⋮ Stochastic recursive inclusions with non-additive iterate-dependent Markov noise ⋮ Pseudo-perturbation-based broadcast control of multi-agent systems ⋮ Constant step stochastic approximations involving differential inclusions: stability, long-run convergence and applications ⋮ Distributed caching over heterogeneous mobile networks ⋮ An adaptive learning model with foregone payoff information ⋮ A stability criterion for two timescale stochastic approximation schemes ⋮ Stochastic approximation with long range dependent and heavy tailed noise ⋮ Stability and delay of distributed scheduling algorithms for networks of conflicting queues ⋮ Asymptotic behavior of truncated stochastic approximation procedures ⋮ A Q-Learning Algorithm for Discrete-Time Linear-Quadratic Control with Random Parameters of Unknown Distribution: Convergence and Stabilization ⋮ Robust adaptive dynamic programming for linear and nonlinear systems: an overview ⋮ An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes ⋮ Truncated stochastic approximation with moving bounds: convergence ⋮ Error bounds for constant step-size \(Q\)-learning ⋮ Stochastic fictitious play with continuous action sets ⋮ A stopping rule for stochastic approximation ⋮ On Sampling Rates in Simulation-Based Recursions ⋮ Reinforcement learning algorithms with function approximation: recent advances and applications ⋮ A mean field approach for optimization in discrete time ⋮ Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement ⋮ Weak convergence of dynamical systems in two timescales ⋮ Unnamed Item ⋮ An online actor-critic algorithm with function approximation for constrained Markov decision processes ⋮ Approachability in Stackelberg stochastic games with vector costs ⋮ On stochastic gradient and subgradient methods with adaptive steplength sequences ⋮ Random algorithms for convex minimization problems ⋮ Stabilization of stochastic approximation by step size adaptation ⋮ A stochastic Kaczmarz algorithm for network tomography ⋮ Markovian stochastic approximation with expanding projections ⋮ Reinforcement learning behaviors in sponsored search ⋮ Rejoinder to ‘Reinforcement learning behaviors in sponsored search’ ⋮ Newton-based stochastic optimization using \(q\)-Gaussian smoothed functional algorithms ⋮ Deceptive Reinforcement Learning Under Adversarial Manipulations on Cost Signals ⋮ On learning dynamics underlying the evolution of learning rules ⋮ Negatively reinforced balanced urn schemes ⋮ On Generalized Bellman Equations and Temporal-Difference Learning ⋮ A concentration bound for contractive stochastic approximation ⋮ Prospect-theoretic Q-learning ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Near-optimal stochastic approximation for online principal component estimation ⋮ Q-learning for Markov decision processes with a satisfiability criterion ⋮ Simulation optimization of risk measures with adaptive risk levels ⋮ Rate of convergence of truncated stochastic approximation procedures with moving bounds ⋮ Risk-Constrained Reinforcement Learning with Percentile Risk Criteria ⋮ Simultaneous perturbation Newton algorithms for simulation optimization ⋮ High-dimensional exploratory item factor analysis by a Metropolis-Hastings Robbins-Monro algorithm ⋮ The Borkar-Meyn theorem for asynchronous stochastic approximations ⋮ Robust adaptive Metropolis algorithm with coerced acceptance rate ⋮ Finite dimensional approximation and Newton-based algorithm for stochastic approximation in Hilbert space ⋮ Convergence and convergence rate of stochastic gradient search in the case of multiple and non-isolated extrema ⋮ Stochastic approximation on Riemannian manifolds ⋮ Unnamed Item ⋮ Optimal stochastic extragradient schemes for pseudomonotone stochastic variational inequality problems and their variants ⋮ Stochastic First-Order Methods with Random Constraint Projection ⋮ Convergence of Markovian Stochastic Approximation with Discontinuous Dynamics ⋮ Some Examples of Stochastic Approximation in Communications ⋮ Conservative set valued fields, automatic differentiation, stochastic gradient methods and deep learning ⋮ Stochastic subgradient method converges on tame functions ⋮ Inexact stochastic subgradient projection method for stochastic equilibrium problems with nonmonotone bifunctions: application to expected risk minimization in machine learning ⋮ Incremental without replacement sampling in nonconvex optimization ⋮ Nonlinear Gossip ⋮ Optimal survey schemes for stochastic gradient descent with applications to M-estimation ⋮ Stochastic approximation search algorithms with randomization at the input ⋮ Convergence results on stochastic adaptive learning ⋮ Fast incremental expectation maximization for finite-sum optimization: nonasymptotic convergence ⋮ A Stochastic Subgradient Method for Nonsmooth Nonconvex Multilevel Composition Optimization ⋮ Finite-Time Performance of Distributed Temporal-Difference Learning with Linear Function Approximation ⋮ On the fast convergence of random perturbations of the gradient flow ⋮ Financial replicator dynamics: emergence of systemic-risk-averting strategies ⋮ Non-asymptotic error bounds for constant stepsize stochastic approximation for tracking mobile agents ⋮ Incremental constraint projection methods for variational inequalities ⋮ Convergence of Recursive Stochastic Algorithms Using Wasserstein Divergence ⋮ Distributed Bregman-Distance Algorithms for Min-Max Optimization ⋮ On Gradient-Based Learning in Continuous Games
This page was built for publication: