Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Randomly initialized EM algorithm for two-component Gaussian mixture achieves near optimality in \(O(\sqrt{n})\) iterations - MaRDI portal

Randomly initialized EM algorithm for two-component Gaussian mixture achieves near optimality in \(O(\sqrt{n})\) iterations (Q2113265)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Randomly initialized EM algorithm for two-component Gaussian mixture achieves near optimality in \(O(\sqrt{n})\) iterations
scientific article

    Statements

    Randomly initialized EM algorithm for two-component Gaussian mixture achieves near optimality in \(O(\sqrt{n})\) iterations (English)
    0 references
    0 references
    11 March 2022
    0 references
    Summary: We analyze the classical EM algorithm for parameter estimation in the symmetric two-component Gaussian mixtures in \(d\) dimensions. We show that, even in the absence of any separation between components, provided that the sample size satisfies \(n=\Omega (d \log^4 d)\), the randomly initialized EM algorithm converges to an estimate in at most \(O(\sqrt{n})\) iterations with high probability, which is at most \(O((d/n)^{1/4} \log n)\) in Euclidean distance from the true parameter and within logarithmic factors of the minimax rate of \((d/n)^{1/4}\). Both the nonparametric statistical rate and the sublinear convergence rate are direct consequences of the zero Fisher information in the worst case. Refined pointwise guarantees beyond worst-case analysis and convergence to the MLE are also shown under mild conditions. This improves the previous result of \textit{S. Balakrishnan} et al. [Ann. Stat. 45, No. 1, 77--120 (2017; Zbl 1367.62052)], which requires strong conditions on both the separation of the components and the quality of the initialization, and that of \textit{C. Daskalakis}, \textit{C. Tzamos} and \textit{M. Zampetakis} [``Ten steps of EM suffice for mixtures of two Gaussians'', Proc. Mach. Learn. Res. (PMLR) 65, 704--710 (2017)], which requires sample splitting and restarting the EM iteration.
    0 references
    EM algorithm
    0 references
    Gaussian mixture
    0 references
    minimax rates
    0 references
    convergence rate
    0 references
    random initialization
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers