HPAARM: Hybrid Parallel Algorithm for Association Rule Mining
Main Article Content
Abstract
Data mining is one of the vast areas of research and nowadays the research is going on the most important techniques for decision making processing in data mining. Discovering patterns or frequent episodes in transactions is an important problem in data mining for the purpose of inferring rules from them. So, mining association rules is considered as powerful technique in the data mining process. The problem of mining association rules is composed of finding the large itemsets and to generate the association rules from these itemsets. To find the large itemsets, the dataset must be scanned many times. Many algorithms have been developed to increase the performance of mining association rules through reducing the number of scans over the dataset. In this paper, we aims to enhance and optimize the process even further by developing techniques to reduce the number of database scans to just only once. To deal with the huge size of the data, we have designed a parallel algorithm for reducing both the execution time and the number of scans over the database, in order to minify I/O overheads as much as possible. In this paper, we introduce some approaches for the implementation of two basic algorithms for association rules discovery (namely Apriori and Eclat). Our approach combines efficient data structures (Radix Trees) to code different key information (line indexes, candidates). We also introduced different types of efficient data structures and their merits and de-merits of using them in deducting association rules.
Â
Keywords: Data mining; Patterns, Association Rules; Parallel Algorithm Item Sets; Apriori; Eclat; Radix Trees; Line Indexes;
Downloads
Article Details
COPYRIGHT
Submission of a manuscript implies: that the work described has not been published before, that it is not under consideration for publication elsewhere; that if and when the manuscript is accepted for publication, the authors agree to automatic transfer of the copyright to the publisher.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
- The journal allows the author(s) to retain publishing rights without restrictions.
- The journal allows the author(s) to hold the copyright without restrictions.