Pooling multiple imputations when the sample happens to be the population

From MaRDI portal
Publication:6255079

arXiv1409.8542MaRDI QIDQ6255079

Gerko Vink, Stef van Buuren

Publication date: 30 September 2014

Abstract: Current pooling rules for multiply imputed data assume infinite populations. In some situations this assumption is not feasible as every unit in the population has been observed, potentially leading to over-covered population estimates. We simplify the existing pooling rules for situations where the sampling variance is not of interest. We compare these rules to the conventional pooling rules and demonstrate their use in a situation where there is no sampling variance. Using the standard pooling rules in situations where sampling variance should not be considered, leads to overestimation of the variance of the estimates of interest, especially when the amount of missingness is not very large. As a result, populations estimates are over-covered, which may lead to a loss of statistical power. We conclude that the theory of multiple imputation can be extended to the situation where the sample happens to be the population. The simplified pooling rules can be easily implemented to obtain valid inference in cases where we have observed essentially all units and in simulation studies addressing the missingness mechanism only.




Has companion code repository: https://github.com/gerkovink/Pooling_MI








This page was built for publication: Pooling multiple imputations when the sample happens to be the population

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6255079)