A partition based method for finding highly correlated pairs (Q615683)

From MaRDI portal





scientific article; zbMATH DE number 5832979
Language Label Description Also known as
English
A partition based method for finding highly correlated pairs
scientific article; zbMATH DE number 5832979

    Statements

    A partition based method for finding highly correlated pairs (English)
    0 references
    0 references
    0 references
    6 January 2011
    0 references
    Summary: The problem of finding highly correlated pairs is to output all item pairs whose (Pearson) correlation coefficients are greater than a user-specified correlation threshold. Effective discovery of such item pairs is of primary importance in many real data mining applications. Algorithm and Taper algorithm are special cases of our new algorithm with respect to the number of segments. Experimental results on real datasets demonstrate the feasibility and superiority of our algorithm. Recently, the Taper algorithm is developed to discover the set of highly correlated item pairs. In this paper, we present a generalised Taper algorithm to find strongly correlated pairs between items by partitioning the collection of transactions into different segments, so as to achieve better pruning effect and less running time. Consequently, it can be proved that both are naive.
    0 references
    correlation
    0 references
    association rules
    0 references
    Pearson correlation coefficients
    0 references
    transactional databases
    0 references
    data mining
    0 references
    partition
    0 references
    highly correlated pairs
    0 references

    Identifiers