A split-and-conquer approach for analysis of
From MaRDI portal
Publication:3195165
DOI10.5705/ss.2013.088zbMath1480.62258OpenAlexW2320803774MaRDI QIDQ3195165
Publication date: 21 October 2015
Published in: Statistica Sinica (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.5705/ss.2013.088
distributed computinggeneralized linear modelsbig datapenalized regressionlarge sample theorycombining results from independent analyses
Ridge regression; shrinkage estimators (Lasso) (62J07) Generalized linear models (logistic models) (62J12) Statistical aspects of big data and data science (62R07)
Related Items
A review of distributed statistical inference, An adaptive lack of fit test for big data, A short note on fitting a single-index model with massive data, Partitioned Approach for High-dimensional Confidence Intervals with Large Split Sizes, Distributed Sufficient Dimension Reduction for Heterogeneous Massive Data, Automatic variable selection in a linear model on massive data, Composite quasi-likelihood for single-index models with massive datasets, Communication-efficient distributed estimator for generalized linear models with a diverging number of covariates, Robust distributed estimation and variable selection for massive datasets via rank regression, Score-matching representative approach for big data analysis with generalized linear models, WONDER: Weighted one-shot distributed ridge regression in high dimensions, A partially linear framework for massive heterogeneous data, Method G: Uncertainty Quantification for Distributed Data Problems Using Generalized Fiducial Inference, Least-Square Approximation for a Distributed System, Unnamed Item, Online Updating of Survival Analysis, Distributed testing and estimation under sparse high dimensional models, Inference for binomial probability based on dependent Bernoulli random variables with applications to meta-analysis and group level studies, Statistical inference in massive datasets by empirical likelihood, Multi-round smoothed composite quantile regression for distributed data, Unnamed Item, Online updating of information based model selection in the big data setting, Central subspaces review: methods and applications, Feasible algorithm for linear mixed model for massive data, On statistics, computation and scalability, Block average quantile regression for massive dataset, Communication-Efficient Distributed Linear Discriminant Analysis for Binary Classification, Composite quantile regression for massive datasets, Sufficiency Revisited: Rethinking Statistical Algorithms in the Big Data Era, Relative efficiency of using summary versus individual data in random‐effects meta‐analysis, Distributed inference for two‐sample U‐statistics in massive data analysis, Distributed estimation with empirical likelihood, Divide and conquer for accelerated failure time model with massive time‐to‐event data, Information-based optimal subdata selection for big data logistic regression, Optimal subsampling for multiplicative regression with massive data, Online updating method to correct for measurement error in big data streams, Split sample empirical likelihood, Global debiased DC estimations for biased estimators via pro forma regression, Distributed penalized modal regression for massive data, Multivariate survival analysis in big data: A divide‐and‐combine approach, Model aggregation for doubly divided data with large size and large dimension, Scalable and efficient inference via CPE, First-Order Newton-Type Estimator for Distributed Estimation and Inference, Nonparametric Fusion Learning for Multiparameters: Synthesize Inferences From Diverse Sources Using Data Depth and Confidence Distribution, Individual Data Protected Integrative Regression Analysis of High-Dimensional Heterogeneous Data, Confidence interval construction in massive data sets, Distributed smoothed rank regression with heterogeneous errors for massive data, Communication-efficient distributed estimation of partially linear additive models for large-scale data, Targeting underrepresented populations in precision medicine: a federated transfer learning approach, Communication-efficient surrogate quantile regression for non-randomly distributed system, Conditional characteristic feature screening for massive imbalanced data, Communication-Efficient Accurate Statistical Estimation, Distributed testing on mutual independence of massive multivariate data, Distributed statistical optimization for non-randomly stored big data with application to penalized learning, A framework for mediation analysis with massive data, Distributed estimation of functional linear regression with functional responses, An Asymptotic Analysis of Random Partition Based Minibatch Momentum Methods for Linear Regression Models, Divide-and-Conquer: A Distributed Hierarchical Factor Approach to Modeling Large-Scale Time Series Data, Estimation and inference in sparse multivariate regression and conditional Gaussian graphical models under an unbalanced distributed setting, Empirical likelihood ratio tests for non-nested model selection based on predictive losses, Single-index composite quantile regression for massive data, A faster algorithm to estimate multiresolution densities, Communication-efficient sparse composite quantile regression for distributed data, Distributed Bayesian posterior voting strategy for massive data, iFusion: Individualized Fusion Learning, Sparse estimation technique for digital pre-distortion of impedance-mismatched power amplifiers, A Large-Scale Constrained Joint Modeling Approach for Predicting User Activity, Engagement, and Churn With Application to Freemium Mobile Games, Testing multivariate quantile by empirical likelihood, Bottom-up estimation and top-down prediction: solar energy prediction combining information from multiple sources, A distributed one-step estimator, Robust distributed modal regression for massive data, Robust communication-efficient distributed composite quantile regression and variable selection for massive data, Estimating population size of heterogeneous populations with large data sets and a large number of parameters, High-dimensional integrative analysis with homogeneity and sparsity recovery, Partially functional linear regression with quadratic regularization, Distributed estimation of principal eigenspaces, Quantile regression under memory constraint, Distributed linear regression by averaging, Quantile regression in big data: a divide and conquer based strategy, Distributed simultaneous inference in generalized linear models via confidence distribution, A MOM-based ensemble method for robustness, subsampling and hyperparameter tuning, Distributed statistical inference for massive data, Distributed Generalized Cross-Validation for Divide-and-Conquer Kernel Ridge Regression and Its Asymptotic Optimality, Adaptive quantile regressions for massive datasets, A scalable nonparametric specification testing for massive data, Divide-and-conquer information-based optimal subdata selection algorithm, Distributed adaptive Huber regression, Projected spline estimation of the nonparametric function in high-dimensional partially linear models for massive data, A Distributed and Integrated Method of Moments for High-Dimensional Correlated Data Analysis, Unnamed Item, A Massive Data Framework for M-Estimators with Cubic-Rate, A split-and-conquer variable selection approach for high-dimensional general semiparametric models with massive data