Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming

From MaRDI portal
Publication:851872

DOI10.1007/s10994-006-8365-9zbMath1475.90122OpenAlexW2146917784MaRDI QIDQ851872

Warren B. Powell, Abraham P. George

Publication date: 22 November 2006

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10994-006-8365-9




Related Items (21)

A stochastic successive minimization method for nonsmooth nonconvex optimization with applications to transceiver design in wireless communication networksASD+M: automatic parameter tuning in stochastic optimization and on-line learningApproximate dynamic programming for lateral transshipment problems in multi-location inventory systemsA stochastic gradient method for a class of nonlinear PDE-constrained optimal control problems under uncertaintyBlock-cyclic stochastic coordinate descent for deep neural networksCross-docking based factory logistics unitisation process: an approximate dynamic programming approachReinforcement learning algorithms with function approximation: recent advances and applicationsMinimizing total tardiness in a stochastic single machine scheduling problem using approximate dynamic programmingIntegrated condition-based maintenance and multi-item lot-sizing with stochastic demandStochastic model predictive control with adaptive constraint tightening for non-conservative chance constraints satisfactionBenchmarking a Scalable Approximate Dynamic Programming Algorithm for Stochastic Control of Grid-Level Energy StorageAutonomous reinforcement learning with experience replayA unified framework for stochastic optimizationScalable estimation strategies based on stochastic approximations: classical results and new insightsA Stochastic Line Search Method with Expected Complexity AnalysisRisk-Averse Approximate Dynamic Programming with Quantile-Based Risk MeasuresBayesian Exploration for Approximate Dynamic ProgrammingProjected Stochastic Gradients for Convex Constrained Problems in Hilbert SpacesProbabilistic Line Searches for Stochastic OptimizationConvergence Rates and Decoupling in Linear Stochastic Approximation AlgorithmsAn inexact restoration-nonsmooth algorithm with variable accuracy for stochastic nonsmooth convex optimization problems in machine learning and stochastic linear complementarity problems



Cites Work


This page was built for publication: Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming