New and efficient algorithms for producing frequent itemsets with the Map-Reduce framework (Q1712055)

From MaRDI portal





scientific article; zbMATH DE number 7003845
Language Label Description Also known as
English
New and efficient algorithms for producing frequent itemsets with the Map-Reduce framework
scientific article; zbMATH DE number 7003845

    Statements

    New and efficient algorithms for producing frequent itemsets with the Map-Reduce framework (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    21 January 2019
    0 references
    Summary: The Map-Reduce (MR) framework has become a popular framework for developing new parallel algorithms for Big Data. Efficient algorithms for data mining of big data and distributed databases has become an important problem. In this paper we focus on algorithms producing association rules and frequent itemsets. After reviewing the most recent algorithms that perform this task within the MR framework, we present two new algorithms: one algorithm for producing closed frequent itemsets, and the second one for producing frequent itemsets when the database is updated and new data is added to the old database. Both algorithms include novel optimizations which are suitable to the MR framework, as well as to other parallel architectures. A detailed experimental evaluation shows the effectiveness and advantages of the algorithms over existing methods when it comes to large distributed databases.
    0 references
    apriori
    0 references
    MapReduce
    0 references
    big data
    0 references
    frequent itemsets
    0 references
    closed itemsets
    0 references
    incremental computation
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references