Optimal clustering on the real line (Q1116592)

From MaRDI portal





scientific article; zbMATH DE number 4090609
Language Label Description Also known as
English
Optimal clustering on the real line
scientific article; zbMATH DE number 4090609

    Statements

    Optimal clustering on the real line (English)
    0 references
    0 references
    1988
    0 references
    A method is proposed for assessing the number of groups (clusters) in a random sample (presented by order statistics) drawn from a continuous population distributed on (a,b)\(\subseteq (-\infty,\infty)\). A sample quantile function is defined to be a piecewise linear interpolation of these order statistics. The method is based on the calculation of an asymptotic nonparametric confidence interval for the fractional reduction of the within-group error due to \((g+1)\)-grouping over g-grouping. The confidence interval theory (as well as the central limit theory) is developed for both universally optimal and bounded locally optimal groupings. The derivation makes use of a Donsker-type theorem for the quantile process that is also proven. The consistency of the estimators used in the proofs of the theorem is shown.
    0 references
    M-estimator
    0 references
    quantization
    0 references
    piecewise linear interpolation of order statistics
    0 references
    universally optimal groupings
    0 references
    number of groups
    0 references
    continuous population
    0 references
    sample quantile function
    0 references
    asymptotic nonparametric confidence interval
    0 references
    within-group error
    0 references
    central limit theory
    0 references
    bounded locally optimal groupings
    0 references
    Donsker-type theorem
    0 references
    quantile process
    0 references
    consistency
    0 references

    Identifiers