Suboptimal coverings for continuous spaces of control tasks
From MaRDI portal
Publication:6366004
arXiv2104.11865MaRDI QIDQ6366004
Gaurav S. Sukhatme, James A. Preiss
Publication date: 23 April 2021
Abstract: We propose the {alpha}-suboptimal covering number to characterize multi-task control problems where the set of dynamical systems and/or cost functions is infinite, analogous to the cardinality of finite task sets. This notion may help quantify the function class expressiveness needed to represent a good multi-task policy, which is important for learning-based control methods that use parameterized function approximation. We study suboptimal covering numbers for linear dynamical systems with quadratic cost (LQR problems) and construct a class of multi-task LQR problems amenable to analysis. For the scalar case, we show logarithmic dependence on the "breadth" of the space. For the matrix case, we present experiments 1) measuring the efficiency of a particular constructive cover, and 2) visualizing the behavior of two candidate systems for the lower bound.
Has companion code repository: https://github.com/jpreiss/suboptimal_coverings
This page was built for publication: Suboptimal coverings for continuous spaces of control tasks
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6366004)