Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scientific article; zbMATH DE number 825585 - MaRDI portal

scientific article; zbMATH DE number 825585

From MaRDI portal
Publication:4858374

zbMath0838.60001MaRDI QIDQ4858374

Vivek S. Borkar

Publication date: 12 December 1995


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (50)

Tullock contests reward information advantagesAn online prediction algorithm for reinforcement learning with linear function approximation using cross entropy methodMultiscale Q-learning with linear function approximationStrong robustness to incomplete information and the uniqueness of a correlated equilibriumUncorrelatedness and orthogonality for vector-valued processesA generalized Kalman filter for fixed point approximation and efficient temporal-difference learningNonclassical total probability formula and quantum interference of probabilitiesRealLife: the continuum limit of larger than life cellular automataStrategic interaction in trend-driven dynamicsVerification of General Markov Decision Processes by Approximate Similarity Relations and Policy RefinementInvariant measures for frequently hypercyclic operatorsInformation in Tullock contestsConditional Central Limit TheoremAnnealed asymptotics for Brownian motion of renormalized potential in mobile random mediumZero-sum risk-sensitive stochastic games on a countable state spaceScaling limits for continuous opinion dynamics systemsBrownian motion and parabolic Anderson model in a renormalized Poisson potentialStabilization of stochastic approximation by step size adaptationRandomized filtering and Bellman equation in Wasserstein space for partial observation control problemLevy multiplicative chaos and star scale invariant random measuresOptimal portfolio choice for a behavioural investor in continuous-time marketsStatistical estimation of the Shannon entropyFull Gradient DQN Reinforcement Learning: A Provably Convergent SchemeRobust target localization in the absence of signal propagation modelsMild solution to parabolic Anderson model in Gaussian and Poisson potentialIndependence and atomsOn the construction and Malliavin differentiability of solutions of Lévy noise driven SDE's with singular coefficientsA further remark on dynamic programming for partially observed Markov processesSimultaneous perturbation Newton algorithms for simulation optimizationOn the Hamiltonicity Gap and doubly stochastic matricesAvoidance of traps in stochastic approximationStochastic approximation with `controlled Markov' noiseOpinion dynamics with Lotka-Volterra type interactionsStochastic approximation algorithms: overview and recent trends.Pathwise asymptotics for Volterra type stochastic volatility modelsA two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric OptimizationEuclidean Gibbs measures of interacting quantum anharmonic oscillatorsBest predictors in logarithmic distance between positive random variablesSimultaneous small noise limit for singularly perturbed slow-fast coupled diffusionsModeling and estimation of stochastic transition rates in life insurance with regime switching based on generalized Cox processesStatistical estimation of conditional Shannon entropyTwo Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference LearningMultiscale Stochastic Approximation for Parametric Optimization of Hidden Markov ModelsAsymptotics of the Invariant Measure in Mean Field Models with JumpsLarge deviations for conditionally Gaussian processes: estimates of level crossing probabilitySample complexity for Markov chain self-tunerA large deviation principle for empirical measures on Polish spaces: application to singular Gibbs measures on manifoldsDynamic programming for ergodic control with partial observations.Stability of annealing schemes and related processesEmpirical Q-Value Iteration




This page was built for publication: