Influential features PCA for high dimensional clustering (Q510669)
From MaRDI portal
scientific article
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Influential features PCA for high dimensional clustering |
scientific article |
Statements
Influential features PCA for high dimensional clustering (English)
0 references
13 February 2017
0 references
Starting from the problem of clustering using gene microarray data, the paper approaches the situation when the feature vectors come from different classes the labels of which are unknown. The authors propose as solution to this problem the influential features PCA (IF-PCA) technique as a new spectral clustering method, along with the Kolmogorov-Smirnov (K-S) score. Since the performance of IF-PCA depends on the choice of the corresponding threshold, the Higher Criticism (H-C) technique is used accordingly. The model is applied to ten different microarray medical datasets (brain, breast cancer, leukemia, etc.) and compared with other clustering methods, and the method is proved to be efficient.
0 references
empirical null
0 references
feature selection
0 references
gene microarray
0 references
Hamming distance
0 references
phase transition
0 references
post-selection spectral clustering
0 references
sparsity
0 references