A partition based method for finding highly correlated pairs (Q615683)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A partition based method for finding highly correlated pairs |
scientific article; zbMATH DE number 5832979
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | A partition based method for finding highly correlated pairs |
scientific article; zbMATH DE number 5832979 |
Statements
A partition based method for finding highly correlated pairs (English)
0 references
6 January 2011
0 references
Summary: The problem of finding highly correlated pairs is to output all item pairs whose (Pearson) correlation coefficients are greater than a user-specified correlation threshold. Effective discovery of such item pairs is of primary importance in many real data mining applications. Algorithm and Taper algorithm are special cases of our new algorithm with respect to the number of segments. Experimental results on real datasets demonstrate the feasibility and superiority of our algorithm. Recently, the Taper algorithm is developed to discover the set of highly correlated item pairs. In this paper, we present a generalised Taper algorithm to find strongly correlated pairs between items by partitioning the collection of transactions into different segments, so as to achieve better pruning effect and less running time. Consequently, it can be proved that both are naive.
0 references
correlation
0 references
association rules
0 references
Pearson correlation coefficients
0 references
transactional databases
0 references
data mining
0 references
partition
0 references
highly correlated pairs
0 references