scientific article

From MaRDI portal
Publication:3527701

zbMath1181.62119MaRDI QIDQ3527701

Vivek S. Borkar

Publication date: 29 September 2008


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.


Lua error in Module:PublicationMSCList at line 37: attempt to index local 'msc_result' (a nil value).


Related Items (only showing first 100 items - show all)

A Biologically Plausible Neural Network for Multichannel Canonical Correlation AnalysisReinforcement learning, sequential Monte Carlo and the EM algorithmSome Limit Properties of Markov Chains Induced by Recursive Stochastic AlgorithmsAn incremental off-policy search in a model-free Markov decision process using a single sample pathA constrained optimization perspective on actor-critic algorithms and application to network routingAn online prediction algorithm for reinforcement learning with linear function approximation using cross entropy methodBroadcast control of multi-agent systemsReference points and learningStochastic approximation to understand simple simulation modelsTuning positive feedback for signal detection in noisy dynamic environmentsConvergence and efficiency of adaptive importance sampling techniques with partial biasingOn Best-Response Dynamics in Potential GamesMultiscale Q-learning with linear function approximationEstimating the position of a moving object based on test disturbance of camera positionLearning to control a structured-prediction decoder for detection of HTTP-layer DDoS attackersUnified reinforcement Q-learning for mean field game and control problemsLinear Convergence of Comparison-based Step-size Adaptive Randomized Search via Stability of Markov ChainsRevisiting SIR in the Age of COVID-19: Explicit Solutions and Control ProblemsStochastic Multilevel Composition Optimization Algorithms with Level-Independent Convergence RatesA Finite Memory Interacting Pólya Contagion Network and Its Approximating Dynamical SystemsASTRO-DF: A Class of Adaptive Sampling Trust-Region Algorithms for Derivative-Free Stochastic OptimizationStochastic Methods for Composite and Weakly Convex Optimization ProblemsDistributed Stochastic Approximation with Local ProjectionsA convergence analysis of the perturbed compositional gradient flow: averaging principle and normal deviationsOja's algorithm for graph clustering, Markov spectral decomposition, and risk sensitive controlStochastic recursive inclusions with non-additive iterate-dependent Markov noisePseudo-perturbation-based broadcast control of multi-agent systemsConstant step stochastic approximations involving differential inclusions: stability, long-run convergence and applicationsDistributed caching over heterogeneous mobile networksAn adaptive learning model with foregone payoff informationA stability criterion for two timescale stochastic approximation schemesStochastic approximation with long range dependent and heavy tailed noiseStability and delay of distributed scheduling algorithms for networks of conflicting queuesAsymptotic behavior of truncated stochastic approximation proceduresA Q-Learning Algorithm for Discrete-Time Linear-Quadratic Control with Random Parameters of Unknown Distribution: Convergence and StabilizationRobust adaptive dynamic programming for linear and nonlinear systems: an overviewAn actor-critic algorithm with function approximation for discounted cost constrained Markov decision processesTruncated stochastic approximation with moving bounds: convergenceError bounds for constant step-size \(Q\)-learningStochastic fictitious play with continuous action setsA stopping rule for stochastic approximationOn Sampling Rates in Simulation-Based RecursionsReinforcement learning algorithms with function approximation: recent advances and applicationsA mean field approach for optimization in discrete timeTwo-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placementWeak convergence of dynamical systems in two timescalesUnnamed ItemAn online actor-critic algorithm with function approximation for constrained Markov decision processesApproachability in Stackelberg stochastic games with vector costsOn stochastic gradient and subgradient methods with adaptive steplength sequencesRandom algorithms for convex minimization problemsStabilization of stochastic approximation by step size adaptationA stochastic Kaczmarz algorithm for network tomographyMarkovian stochastic approximation with expanding projectionsReinforcement learning behaviors in sponsored searchRejoinder to ‘Reinforcement learning behaviors in sponsored search’Newton-based stochastic optimization using \(q\)-Gaussian smoothed functional algorithmsDeceptive Reinforcement Learning Under Adversarial Manipulations on Cost SignalsOn learning dynamics underlying the evolution of learning rulesNegatively reinforced balanced urn schemesOn Generalized Bellman Equations and Temporal-Difference LearningA concentration bound for contractive stochastic approximationProspect-theoretic Q-learningUnnamed ItemUnnamed ItemNear-optimal stochastic approximation for online principal component estimationQ-learning for Markov decision processes with a satisfiability criterionSimulation optimization of risk measures with adaptive risk levelsRate of convergence of truncated stochastic approximation procedures with moving boundsRisk-Constrained Reinforcement Learning with Percentile Risk CriteriaSimultaneous perturbation Newton algorithms for simulation optimizationHigh-dimensional exploratory item factor analysis by a Metropolis-Hastings Robbins-Monro algorithmThe Borkar-Meyn theorem for asynchronous stochastic approximationsRobust adaptive Metropolis algorithm with coerced acceptance rateFinite dimensional approximation and Newton-based algorithm for stochastic approximation in Hilbert spaceConvergence and convergence rate of stochastic gradient search in the case of multiple and non-isolated extremaStochastic approximation on Riemannian manifoldsUnnamed ItemOptimal stochastic extragradient schemes for pseudomonotone stochastic variational inequality problems and their variantsStochastic First-Order Methods with Random Constraint ProjectionConvergence of Markovian Stochastic Approximation with Discontinuous DynamicsSome Examples of Stochastic Approximation in CommunicationsConservative set valued fields, automatic differentiation, stochastic gradient methods and deep learningStochastic subgradient method converges on tame functionsInexact stochastic subgradient projection method for stochastic equilibrium problems with nonmonotone bifunctions: application to expected risk minimization in machine learningIncremental without replacement sampling in nonconvex optimizationNonlinear GossipOptimal survey schemes for stochastic gradient descent with applications to M-estimationStochastic approximation search algorithms with randomization at the inputConvergence results on stochastic adaptive learningFast incremental expectation maximization for finite-sum optimization: nonasymptotic convergenceA Stochastic Subgradient Method for Nonsmooth Nonconvex Multilevel Composition OptimizationFinite-Time Performance of Distributed Temporal-Difference Learning with Linear Function ApproximationOn the fast convergence of random perturbations of the gradient flowFinancial replicator dynamics: emergence of systemic-risk-averting strategiesNon-asymptotic error bounds for constant stepsize stochastic approximation for tracking mobile agentsIncremental constraint projection methods for variational inequalitiesConvergence of Recursive Stochastic Algorithms Using Wasserstein DivergenceDistributed Bregman-Distance Algorithms for Min-Max OptimizationOn Gradient-Based Learning in Continuous Games




This page was built for publication: