Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
SMART: a subspace clustering algorithm that automatically identifies the appropriate number of clusters - MaRDI portal

SMART: a subspace clustering algorithm that automatically identifies the appropriate number of clusters (Q1046581)

From MaRDI portal





scientific article; zbMATH DE number 5651380
Language Label Description Also known as
English
SMART: a subspace clustering algorithm that automatically identifies the appropriate number of clusters
scientific article; zbMATH DE number 5651380

    Statements

    SMART: a subspace clustering algorithm that automatically identifies the appropriate number of clusters (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    22 December 2009
    0 references
    Summary: This paper presents a subspace \(k\)-means clustering algorithm for high-dimensional data with automatic selection of \(k\). A new penalty term is introduced to the objective function of the fuzzy k-means clustering process to enable several clusters to compete for objects, which leads to merging some cluster centres and the identification of the `true' number of clusters. The algorithm determines the number of clusters in a dataset by adjusting the penalty term factor. A subspace cluster validation index is proposed and employed to verify the subspace clustering results generated by the algorithm. The experimental results from both the synthetic and real data have demonstrated that the algorithm is effective in producing consistent clustering results and the correct number of clusters. Some real datasets are used to demonstrate how the proposed algorithm can determine interesting sub-clusters in the datasets.
    0 references
    data mining
    0 references
    subspace clustering
    0 references
    fuzzy \(k\)-means
    0 references
    cluster numbers
    0 references
    weighting
    0 references
    high-dimensional data
    0 references

    Identifiers