Distributed Bootstrap for Simultaneous Inference Under High Dimensionality
From MaRDI portal
Publication:6361086
arXiv2102.10080MaRDI QIDQ6361086
Author name not available (Why is that?)
Publication date: 19 February 2021
Abstract: We propose a distributed bootstrap method for simultaneous inference on high-dimensional massive data that are stored and processed with many machines. The method produces an -norm confidence region based on a communication-efficient de-biased lasso, and we propose an efficient cross-validation approach to tune the method at every iteration. We theoretically prove a lower bound on the number of communication rounds that warrants the statistical accuracy and efficiency. Furthermore, only increases logarithmically with the number of workers and the intrinsic dimensionality, while nearly invariant to the nominal dimensionality. We test our theory by extensive simulation studies, and a variable screening task on a semi-synthetic dataset based on the US Airline On-Time Performance dataset. The code to reproduce the numerical results is available at GitHub: https://github.com/skchao74/Distributed-bootstrap.
Has companion code repository: https://github.com/skchao74/Distributed-bootstrap
This page was built for publication: Distributed Bootstrap for Simultaneous Inference Under High Dimensionality
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6361086)