Transaction dataset.

Source publication

An Improved Frequent Pattern-growth Approach to Discover Rare Association Rules.

Conference Paper

Full-text available

Jan 2009

In this paper we have proposed an improved approach to extract rare association rules. The association rules which involve rare items are called rare association rules. Mining rare association rules is difficult with single minimum support (minsup) based approaches like Apriori and FP-growth as they suffer from "rare item problem" dilemma. At high...

Context 1

... the dataset shown in Table 1, the extraction of frequent patterns using CFP-growth algorithm is illustrated using Example 1. For ease of explaining this example we refer the support and MIS values of the items in terms of support counts and MIS counts. ...

View in full-text

Context 2

... 1: For the transaction dataset shown in Table 1, the itemset I = {bread, ball, jam, bat, pil- low, bed, pencil, pen}. Let the MIS values (in count) for bread, ball, jam, bat, pillow, bed, pen- cil and pen be 4, 4, 3, 3, 2, 2, 2, 2. Now, using the MIS values for the items, the CFP-growth ap- proach sorts the items in descending order of their MIS values and assigns the frequency value of zero to every item. ...

View in full-text

Context 3

... L 1 contain {{bread:0}, {ball:0}, {jam:0}, {bat:0}, {pillow:0}, {bed:0}, {pencil:0}, {pen:0}}. In the first scan of the dataset shown in Table 1, the first transaction "1: bread, jam" containing two items is scanned in L 1 order i.e., {bread, jam} and the frequencies of items "bread" and "jam" are updated by 1 in L 1 . Next, a first branch of tree is constructed with two nodes, bread: 1 and jam: 1, where "bread" is linked as a child of the root and "jam" is linked as a child of "bread". ...

View in full-text

An Improved Algorithm for Mining Association Rules Using Multiple Support Values.

Conference Paper

Jan 2003

Almost all the approaches in association rule mining suggested the use of a single minimum support, technique that either rules out all infrequent itemsets or suffers from the bottleneck of generating and examining too many candidate large itemsets. In this paper we consider the combination of two well-known algorithms, namely algorithm DIC and MSA...

Mining Temporal Association Rules in Network Traffic Data

Article

Full-text available

Jan 2014

Guojun Mao

Mining association rules is one of the most important and popular task in data mining. Current researches focus on discovering frequent itemsets that is an important step to it. Many algorithms for discovering frequent itemsets have been proposed. However, for a large database, an efficient mining algorithm must be a better balance in I/O cost and...

ParallelCharMax: An Effective Maximal Frequent Itemset Mining Algorithm Based on MapReduce Framework

Conference Paper

Oct 2017

Nowadays, the explosive growth in data collection in business and scientific areas has required the need to analyze and mine useful knowledge residing in these data. The recourse to data mining techniques seems to be inescapable in order to extract useful and novel patterns/models from large datasets. In this context, frequent itemsets (patterns) play an essential role in many data mining tasks that try to find interesting patterns from datasets. However, conventional approaches for mining frequent itemsets in Big Data era encounter significant challenges when computing power and memory space are limited. This paper proposes an efficient distributed frequent itemset mining algorithm, called ParallelCharMax, that is based on a powerful sequential algorithm, called Charm, and computes the maximal frequent itemsets that are considered perfect summaries of the frequent ones. The proposed algorithm has been implemented using MapReduce framework. The experimental component of the study shows the efficiency and the performance of the proposed algorithm compared with well known algorithms such as MineWithRounds and HMBA.

An Extractive Approach for Uyghur Text Summarization

Article

Full-text available

Apr 2016

This paper studies Uyghur single text summarization and proposes some of new or improved approaches in the aspects of keyword extraction and evaluation, sentence selection and redundancy removal, also in readability improvement and so on. Proposes an improved frequent pattern-growth approach to extract the semantic strings which perfect both on its semantics and structural integrity, to evaluate this strings uses multi-feature fusion approach and select most important ones as keywords to describe the text theme effectively. In the aspect of sentence similarity and redundancy removal, proposes the idea of theme including degree, so as to effectively remove the redundant sentences and improves the summary quality significantly. Also introduces sentence alignment between the texts that after being stemming and original text, so as to solve the problems that summary naturalness, coherence and comprehensibility decline and other issues caused by stemming process.

A Novel Approach for Finding Rare Items Based on Multiple Minimum Support Framework

Article

Full-text available

Dec 2015

Pattern mining methods describe valuable and advantageous items from a large amount of records stored in the corporate datasets and repositories. While mining, literature has almost singularly focused on frequent itemset but in many applications rare ones are of higher interest. For Example medical dataset can be considered, where rare combination of prodrome plays a vital role for the physicians. As rare items contain worthwhile information, researchers are making efforts to examine effective methodologies to extract the same. In this paper, an effort is made to analyze the complete set of rare items for finding almost all possible rare association rules from the dataset. The Proposed approach makes use of Maximum constraint model for extracting the rare items. A new approach is efficient to mine rare association rules which can be defined as rules containing the rare items. Based on the study of relevant data structures of the mining space, this approach utilizes a tree structure to ascertain the rare items. Finally, it is demonstrated that this new approach is more virtuous and robust than the existing algorithms.

Mining Interesting Rare Items with Maximum Constraint Model Based on Tree Structure

Conference Paper

Full-text available

Apr 2015

Rare association rule mining provides useful information from large database. Traditional association mining techniques generate frequent rules based on frequent item sets with reference to user defined: minimum support threshold and minimum confidence threshold. It is known as support-confidence framework. As many of generated rules are of no use, further analysis is essential to find interesting Rules. Rare association rule contains Rare Items. Rare Association Rules represents unpredictable or unknown associations, so that it becomes more interesting than frequent association rule mining. The main goal of rare association rule mining is to discover relationships among set of items in a database that occurs uncommonly. We have proposed a Maximum Constraint based method for generating rare association rule with tree structure. Tentative results show that MCRP-Tree takes less time for rule generation compared to the existing algorithm as well as it finds more interesting rare items.

An effective approach to mine rare items using Maximum Constraint

Conference Paper

Full-text available

Jan 2015

Rare association rule mining provides useful information from large database. Traditional association mining techniques generate frequent rules based on frequent itemsets with reference to user defined: minimum support threshold and minimum confidence threshold. It is known as support-confidence framework. As many of generated rules are of no use, further analysis is essential to find interesting Rules. Rare association rule contains Rare Items. Rare Association Rules represents unpredictable or unknown associations, so that it becomes more interesting than frequent association rule mining. The main goal of rare association rule mining is to discover relationships among set of items in a database that occurs uncommonly. We have proposed a Maximum Constraint based method for generating rare association rule with tree structure. Tentative results show that MCRP-Tree takes less time for rule generation compared to the existing algorithm as well as it finds more interesting rare items.

A Recent Overview: Rare Association Rule Mining

Article

Dec 2014

association rules are mine useful information form large dataset. Traditional association mining methods generate frequent rules based on frequent itemsets with reference of minimum support and minimum confidence threshold which specified by user. It called as support-confidence framework. As many of generated rules are of no use, further analysis is essential to find interesting Rules. A rule that contains rare items can consider as rare association rule. Rare Association Rules Represent unpredictable or unknown association, so it is more interesting than frequent association rule. Rare association rule mining provides relationship between items which occurs uncommonly. This paper presents brief survey in the area of rare association rule mining. Keywordspattern, support, confidence, Rare Items

Frequent Pattern Mining based on Multiple Minimum Support using Uncertain Dataset

Article

Full-text available

Aug 2014

Association rule mining plays a major role in decision making in the production and sales business area. It uses minimum support (minsup) and support confidence (supconf) as a base to generate the frequent patterns and strong association rules. Setting a single value of minsup for a transaction set doesn't seem feasible for some real life applications. Similarly the probabilistic value of items in the transaction set may be acceptable. So generating the frequent pattern from the uncertain dataset becomes a concern factor. This research work details the aforesaid problem and proposes a solution for the same.

Novel techniques to reduce search space in multiple minimum supports-based frequent pattern mining algorithms

Conference Paper

Full-text available

Mar 2011

Frequent patterns are an important class of regularities that exist in a transaction database. Certain frequent patterns with low minimum support (minsup) value can provide useful information in many real-world applications. However, extraction of these frequent patterns with single minsup-based frequent pattern mining algorithms such as Apriori and FP-growth leads to "rare item problem." That is, at high minsup value, the frequent patterns with low minsup are missed, and at low minsup value, the number of frequent patterns explodes. In the literature, "multiple minsups framework" was proposed to discover frequent patterns. Furthermore, frequent pattern mining techniques such as Multiple Support Apriori and Conditional Frequent Pattern-growth (CFP-growth) algorithms have been proposed. As the frequent patterns mined with this framework do not satisfy downward closure property, the algorithms follow different types of pruning techniques to reduce the search space. In this paper, we propose an efficient CFP-growth algorithm by proposing new pruning techniques. Experimental results show that the proposed pruning techniques are effective.

Improved Multiple Minimum Support Based Approaches to Mine Frequent Patterns

Thesis

Full-text available

Jul 2010

Currently, extracting knowledge pertaining to rare cases that are hidden in the large datasets has become an important research problem. Frequent patterns are an important class of regularities that exist in a transactional database. The frequent patterns containing rare items can provide useful knowledge. It is difficult to mine frequent patterns containing both frequent and relatively infrequent (or rare) items, because, single minimum support (minsup) based frequent pattern mining approaches such as Apriori and FP-growth suffer from rare item problem. That is, at high minsup, we miss the frequent patterns containing rare items, and at low minsup, combinatorial explosion can occur, producing too many frequent patterns. To address rare item problem, efforts have been made in the literature to find frequent patterns by using "multiple minsups framework." Even though this framework address rare item problem, but still suffers from performance problems. In this thesis, we have made an effort to propose efficient approaches to extract frequent patterns containing both frequent and rare items. The contribution of thesis is as follows: (i) We have proposed the notion "support difference" and proposed an efficient methodology to extract frequent patterns containing both frequent and rare items (ii) An efficient multiple minsups-based FP-growth-like algorithm has been proposed by introducing several heuristics to minimize the search space. (iii) An improved "multiple minsups framework" has been proposed by introducing the notion called "item-to-pattern difference." (iv) We have also proposed an improved periodic-frequent pattern mining algorithm by extending the notion of "multiple constraints." ii

Mining Rare Association Rules in the Datasets with Widely Varying Items’ Frequencies

Conference Paper

Apr 2010

Rare association rule is an association rule consisting of rare items. It is difficult to mine rare association rules with a single minimum support (minsup) constraint because low minsup can result in generating too many rules in which some of them can be uninteresting. In the literature, minimum constraint model using “multiple minsup framework” was proposed to efficiently discover rare association rules. However, that model still extracts uninteresting rules if the items’ frequencies in a dataset vary widely. In this paper, we exploit the notion of “item-to-pattern difference” and propose multiple minsup based FP-growth-like approach to efficiently discover rare association rules. Experimental results show that the proposed approach is efficient.

Transaction dataset.

Contexts in source publication

Similar publications

Citations