SlideShare una empresa de Scribd logo
1 de 5
Descargar para leer sin conexión
IOSR Journal of Computer Engineering (IOSRJCE)
ISSN: 2278-0661 Volume 3, Issue 5 (July-Aug. 2012), PP 26-30
www.iosrjournals.org
www.iosrjournals.org 26 | Page
Efficient Parallel Pruning of Associative Rules with Optimized
Search
1
K.Sangeetha, 2
Dr.P.S.Periasamy, 3
S.Prakash
1
Assistant Professor (SG), S.N.S.College of Technology, Coimbatore
,2
Professor, K.S.R.College of Engineering, Tiruchengode
3
Assistant Professor (SG,) Sri Shakthi Institute of Engineering and Technology, Coimbatore
Abstract: The main focus of this research work is to propose an improved association rule mining algorithm to
minimize the number of candidate sets while generating association rules with efficient pruning time and search
space optimization. The relative association with reduced candidate item set reduces the overall execution time.
The scalability of this work is measured with number of item sets used in the transaction and size of the data set.
Further Fuzzy based rule mining principle is adapted in this work to obtain more informative associative rules
and frequent items with increased sensitive. The requirement for sensitive items is to have a semantic connection
between the components of the item-value pairs. The effectiveness of item-value pairs minimizes the search
space to its optimality. Optimality of the search space indicates the trade off between pruning time and size of
the data set.
I. Introduction
A rapid growth of information extraction from large transactional data sets fueled the demand of
knowledge discovery and associative relation between the items. To identify the most frequent transacted items
and generates associative rules between various items, Apriori algorithm is one such most sought solution for
association rule mining, in which scanning of transaction item was done efficiently without missing any items.
So the generation of candidate item set consumes more time to generate associative rules. To overcome the
slowness in associative rule pruning various strategies were discussed in literature to improve the speed of rule
formation. The approaches presented in the literatures adapted multifold iteration of the transaction data sets
which affected the time and search space for pruning the more sensitive item and its relative association. Some
of the sampling methods are available, but these processes will again affect the performance of the items which
leads to missed out transaction.
In this framework, many algorithms have been proposed for proficient creation of normal item sets in
the literature because the problem was first introduced. To reduce the size of the candidate item sets, the Direct
Hashing and Pruning (DHP) algorithm uses a hash table that results in efficient pruning of item sets. The
Partition algorithm decreases Input / Output (I/O) by examining the database only twice.
II. Literature Review
Ken Sun et al (2008) introduced new focus on Association Rule Mining (ARM) algorithms. This
proposal uses w-support, which does not require pre assigned weights, but this method is constraint-based in the
sense that all rules must fulfill a predefined set of conditions, such as support and confidence. However, the
main goal of this algorithm is to reduce the number of generated rules.
Jens Teubner et al (2011) explore how to accelerate the computation of frequent itemset using field-
programmable gateway. The pipeline solution was introduced to improve the performance. It uses the minimum
count as a threshold, so it is a constraint based algorithm.
Zhaonian Zou et al (2010) investigates the problem of mining uncertain graph data and especially
focuses on mining frequent sub graph patterns on an uncertain graph database. based on minimum support value
the frequent sub graph pattern is minined. Claudia Marinica et al (2010) proposed a new interactive approach to
prune and filter discovered rules to use ontologies in order to improve the integration of user knowledge in the
post processing task
Alok Sharma et al (2008), proposed a new method to reduce the search space. In that, prior to
dimensionality-reduction transformation an additional rotational transform that rotates the feature vectors in the
original feature space around their respective class centroids in such a way that the overlap between the classes
in the reduced feature space is further minimized
Elena Baralis et al (2009) proposed a method called IMine index, a general and compact structure
which provides tight integration of item set extraction in a Relational Data Base Management System. E.
Hüllermeier et al (2007) proposed an algorithm which was an adaptation of the Apriori algorithm for number of
items in the attributes. It is not easy to extend the algorithm to higher dimensional cases.
Efficient Parallel Pruning of Associative Rules with Optimized Search
www.iosrjournals.org 27 | Page
III. Problem Definition
Normally, the principle of association rule mining is to mine a set of shared highly correlated
attributes/features amongst a huge number of records in a given database for knowledge discovery. The fuzzy
ARM algorithms are used under large datasets for a fast and efficient performance.
A fuzzy ARM algorithm for generating fuzzy association rules is not a simple one. The first process is
conversion of crisp dataset which consists of crisp binary and numerical attributes, into a fuzzy dataset,
containing crisp binary and fuzzy binary attributes. The second process is to calculate the frequency of an item
set using the presence or absence in a transaction of the dataset, but fuzzy ARM algorithms must taken into
account in a particular transaction of the dataset, in addition to its presence or absence. This becomes tedious
process.
The problem of scalability and higher memory requirements are addressed in this research work by
deploying parallel pruning technique at different levels of items sets (one item set, two item set, etc.,). From the
recent literature we came to know that, only Apriori and its adaptations are used for generating association rules.
Thus, the Fuzzy based Optimal Search Space Pruning (FOSSP) is compared with existing fuzzy Apriori and the
execution time is recorded as in Fig.1.
IV. Objective
The objective is to minimize the number of candidate sets and enhancing the association rule mining
algorithm while creating an association rules by evaluating maximal information associated with each item that
occurs in given set of transaction. Initial work starts with the evaluation of weighted association rule mining in
terms of item-value relational metrics. Then the number of item metrics is taken into account of the association
rule mining with reduced candidate item set. This may decrease not only the number of item sets generated but
also the overall execution time of the algorithm. Any valued attribute will be treated as item-value relational
metrics and will be used to derive the minimal number of association rules which increased the rules
information content.
The research work evaluates the scalability of the FOSSP (say for car purchase dataset and bank
transaction data set) by considering transaction time, number of item sets used in the transaction and memory
utilization. In addition, further the work moves in the direction of fuzzy based item value of the rule mining
principle on associative rules of the complete item sets. To evaluate the item-value relativity metric of the
scalable association mining, optimal search on parallel pruning is planned for deployment as it can hold more
number of associative information.
V. Scalable Association Rule Mining Using Parallel Pruning
FOSSP presented in this work, first analyze the scalability issues of association rule mining in large
data sets. Parallel pruning technique is deployed in FOSSP to mine the large transactional items simultaneously
at different levels of items sets to improve the execution speed for generating frequent items and association
rules. The enhancement of Apriori algorithm is done by increasing the efficiency of candidate pruning phase by
reducing the number of candidates that are generated for further verification. The FOSSP pruning technique use
information associated to the number of items to estimate overlap items in the transactions. The basic elements
considered in the development of the FOSSP are number of transactions, average size of transaction, average
size of the maximal large item sets, number of items, and distribution of occurrences of large item sets.
The parallel pruning in FOSSP provides improvement over Apriori by generating frequent items and rules for
transaction data. It generates all candidates based on n-level frequent item sets on sorted database, and all
frequent item sets that can no longer be supported by transactions that still have to be processed. Thus the
FOSSP has no longer to maintain the covers of all past item sets sequentially. The algorithm for parallel pruning
technique to generate informative rules and strong frequent items is presented as below:
5.1 Framework of FOSSP Algorithm
Input: Number of Transactions and items, larger data sets
Output: Candidate items, number of informative rules, frequent items, execution time
Steps of Procedures
a. Initialize number of items and transactions from large data sets
b. Generate candidate item sets with information requirement
c. Reduce the candidate item with relative item values
d. With probability ratio, generate frequent item sets (i.e., satisfy minimum support)
e. Parallel prune the frequent items at different levels of the item set
f. With conditional probability on parallel pruned item levels, generate strong association rules.
g. Calculate execution time of frequent item set and informative association rules
h. Sort the item sets based on the frequency and information association
Efficient Parallel Pruning of Associative Rules with Optimized Search
www.iosrjournals.org 28 | Page
i. Merge the more associated rules of item pairs
j. Discard the infrequent item value pairs
k. Perform Fuzzy Parallel Pruning (PP)
l. Iterate the steps c to f till the required scalability mining results are achieved
Fuzzy PP algorithm:
For each t Є T
Search the whole Transaction and return all the items
Membership Function (mF) = {a Є A | 0 ≤ a ≤ 1}mF = 1 ; 0 ≤ a ≤ 1; mF = 0; otherwise
Perform mapping function
End
Where T - total transaction, t - transaction instance, A - complete item set, a - items of transaction instance.
For B = (y1, y2,…yn)
fuzzy set (B, n) = {n(y1)/y1,…n(yn)/ yn}
Scan the transformed database
Evaluate the support with the predefined Min Support value.
End
Where B – candidate item set, y1,y1..yn – frequent item set of transaction instances, n – number of instances.
In FOSSP the candidate item reduction object is updated in the iteration to determine the processing
items. In the Apriori association mining algorithm, the data item read, needs to be matched against all
candidates to determine the set of candidates whose counts will be incremented. It is not possible to statically
partition the reduction object so that different process update disjoint portions of the collection which made
parallel pruning in FOSSP more efficient. However as the pruning transaction item is more concerned in
parallel, the search space for frequent item generation and item-value pair based maximal information sensitive
association rules becomes complex. To overcome these facts, in the next chapter, the optimization of search
space using fuzzy rule set, is described.
VI. Optimization Of Search Space Using Fuzzy Rule Set
The traditional fuzzy ARM exploits a data-driven pre-processing approach which makes routine to the
formation of fuzzy partitions for numerical attributes. Therefore, it converts the given data set to fuzzy data set
that desires a lesser amount of human communication for even very large datasets. Numerical attributes in the
real data sets are converted to fuzzy sets which comprises of split data sets with boundary limits. The item
values in the split boundaries can have the uncertainty factor which affects the quality and accuracy of fuzzy
association rule mining. In addition the search space using fuzzy modeled association rule mining needs larger
memory to accommodate larger transactions data sets. The FOSSP presented in this work, which improves
parallel pruning technique is described in chapter 5. FOSSP utilize fuzzy rule controlled feedback scheme to
optimize the search space for more effective association rule generation. The following section describes about
various techniques to evaluate the scalability of association rule mining and the resultant optimal search space
for efficient item pruning
6.1 Partitioning Fuzzy Domain Set
In presenting the optimal search space approach for fuzzy association rule mining process, fuzzy
partition domains are made based on the user defined item-value attribute on the original dataset. To evaluate
the fuzzy data set for informative association rule mining, support and confidence metrics are redefined based
on the fuzzy binary attributes. The generation of fuzzy association rules is directly impacted by the fuzzy
measures adapted in the parallel pruning approach. The dataset is logically divided into „p‟ disjoint horizontal
partitions P1, P2… Pp. Each partition is as large as can fit in available optimal memory space. The partitions are
equal-sized, though each partition could be of any arbitrary size as well.
6.2 Optimal Search Fuzzy Feedback scheme for Informative Rule Generation
The optimal search space with fuzzy for association rule mining deployed iterative feedback on the rule
set generation. The parallel pruning of multi-level item set is split with fuzzy data set to obtain the rules from
respective partitioned domain, whereas the feedback scheme gets into each partitioned domain. Within the
partitioned domain, the initial rules generated for item value attributes that are governed by the optimal search
based feedback scheme to identify the sensitivity of fuzzy binary value in one domain to other. The optimal
fuzzy feedback scheme minimizes the number of rules being generated in each and every partitioned domain of
multiple outliers which are divided into groups.
Efficient Parallel Pruning of Associative Rules with Optimized Search
www.iosrjournals.org 29 | Page
VII. Experimental Results And Discussions on FOSSP
The experimental evaluation of FOSSP on identifying the results of performance metrics such as
scalability, search space optimality, informative associative rules sets, and candidate set reduction. The
scalability evaluation is made on the size of the data set used and its pruning time for generating frequent items
and association rule sets with deployment of parallel pruning of multi-level item sets simultaneously. The
optimality of search space for parallel pruning is measured by varying large items using fuzzy rule
appropriation.
For experimental purpose on the scalability issue, the samples for banking data set obtained from the
local governmental banking streams with size of transaction data with Giga Bytes (GBs) is used . The total
number of distinct items was 1000 and the average number of items in a transaction was 15.
Fig. 1. Comparision of execution time with FOSSP and Fuzzy Apriori
The confidence value of 90% and support value of 50% is given as an input. Normally, when the
number of iterations for item pruning increases then execution time gradually increases. The execution time for
parallel pruning is illustrated to evaluate the performance of the proposed technique, compared with the existing
Apriori rule generation as shown in Fig.1.
In General, when the data size for item pruning increases then execution time gradually increases. The
scalability performance of FOSSP shows 2 times faster execution time compared to that of fuzzy Apriori
models. Though the performance of scalability is considerably higher for parallel pruning, the execution time
requirement increases with the growth in the size of unique items as shown in Fig.2.
Fig. 2. Scalability evaluation with FOSSP and Fuzzy Apriori
Usually, when the item set for pruning increases, the search space also gradually increase. Further
datasets from machine learning repository (Car Purchase Data set, Bank transaction data set) are extracted and
enhanced with data size to GBs with more number of unique items. The performance of FOSSP in terms of
scalability as well as the search space requirements at each of these data sets is depicted as in Fig.3. The optimal
value of memory for search space and the maximum size of the data set, minimal number of rule generation
covering most possible information of the data set, and candidate set reduction are evaluated.
The car dataset with 20 distinct items, where the average number of items per transaction is 6 to 8 are
used for the experimental evaluation of FOSSP. The total size of the dataset is 2 GBs and a confidence level(C)
of 90% is used. The support counts testified with the transaction for frequent item pruning are 70%, 85%, 93%,
and 62%. The execution time is improved for FOSSP with reduction of 2 to 4 times as compared to fuzzy
Apriori and the memory utilization reduced nearly 2 to 3 times for the data size of 2 GB Car purchase data set.
With experimental result on the car purchase data set, the performance of FOSSP is improved when compared
to Fuzzy Apriori.
No of Iteration Vs Execution time
0
20
40
60
80
100
120
140
8 -16 16 - 32 32 - 48 48 - 60
Number of Iterations
ExecutionTime
Execution time -
FOSSP (Proposed)
Execution time -
Fuzzy Apriori
(Existing)
Data size Vs Time
0
5000
10000
15000
20000
25000
30000
35000
40000
1 2 3 4 5 6 7 8 9
Data size (MB)
Time(sec)
Time for fuzzy based
apriori (Existing)
Time for FOSSP
(Proposed)
Efficient Parallel Pruning of Associative Rules with Optimized Search
www.iosrjournals.org 30 | Page
Fig. 3. Comparision of search space with FOSSP and Fuzzy Apriori
The performance results of FOSSP approach are evaluated with various values of support(S) ranging
from 25% to 40%. It is concluded from the observation of the results that the proposed FOSSP approach derives
effective item-value pair based strong association rule with optimal search space performs 25% faster than
fuzzy adapted variants of Apriori(Fig.3), based on the user defined support value. With other dataset samples,
the support value is approximated for 34%, in which optimal number of item sets is generated.
From these experiments, it is observed that the FOSSP approach performs most efficiently (more
accurate rules) and speedily at the optimal support value, which occurs in the range of 15% - 20% for car
dataset. Another purpose was to reduce the number of parallel pruning to the data transaction partitions in
FOSSP with just one partition for support values of 20% – 40% on car data sets and 10% – 40% on bank data
set, keeping in mind that the main memory is utilized in the best manner possible, without any thrashing.
Furthermore, with the fuzzy based optimal search feedback scheme, it was observed that more informative rules
for all the attributes with more sensitive frequent item have been occurred.
VIII. Conclusion
The fuzzy based optimal search pruning technique presented in this research work evaluated frequent
items with more sensitive item-value pairs. The rule obtained with FOSSP generated appropriate candidate item
set that contributes to the improvement of extracting maximal informative association rules from the large
transactional data sets. Parallel pruning of item sets at multiple levels of the complete items (one item set, two
item sets, … n item sets) decreased the execution time of the FOSSP rule mining, as the frequent items for all
the levels obtained simultaneously. Fuzzy rule is modeled to function parallel pruning with optimal search space
and reduced the trade off between scalability of data sets and the search space for larger items.
In fuzzy Apriori, search space for pruning gets increased as for larger data set which affected the
performance of association rule mining; however, FOSSP provided optimal search size for larger data sets. The
experimental results shows that FOSSP works better in terms of time reduction when contrast to fuzzy Apriori
model.
References
[1] Ken Sun and Fengshan Bai “Mining Weighted Association Rules without Pre assigned Weights “. IEEE Transactions on
Knowledge and Data Engineering, Vol. 20, No. 4, pp. 489-495, April 2008.
[2] Jens Teubner . Rene Mueller, and Gustavo Alonso “Frequent Item Computation on a Chip”, IEEE Transactions on Knowledge
and Data Engineering, Vol. 23, No. 8, pp 1169-1181, August 2011.
[3] Zhaonian Zou, Jianzhong Li, Hong Gao, and Shuo Zhang, “Mining Frequent Subgraph Patterns from Uncertain Graph Data”, IEEE
Transactions on Knowledge and Data Engineering, Vol. 22, No. 9,pp 1203-1218, September 2010.
[4] Claudia Marinica and Fabrice Guillet “Knowledge-Based Interactive Postmining of Association Rules Using Ontologies”, IEEE
Transactions on Knowledge and Data Engineering, Vol. 22, No. 6, pp. 784-797, June 2010.
[5] Alok Sharma, and K. Kuldip Paliwal, “Rotational Linear Discriminant Analysis Technique for Dimensionality Reduction”, IEEE
Transactions on Knowledge and Data Engineering, Vol. 20, No. 10, pp 1336-1347, October 2008.
[6] Elena Baralis, Tania Cerquitelli, and Silvia Chiusano, “IMine: Index Support for Item Set Mining”, IEEE Transactions on
Knowledge and Data Engineering, Vol. 21, No. 4, pp. 493-506, April 2009.
[7] Hüllermeier E, Y. Yi, “Defense of Fuzzy Association Analysis”, IEEE Transactions on Systems, Man, and Cybernetics - Part B:
Cybernetics, Vol. 37, No.4, pp.1039- 1043, July 2007.
[8] Verlinde H, M. De Cock, R. Boute, “Fuzzy Versus Quantitative Association Rules: A Fair Data-Driven Comparison”, IEEE
Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics, Vol. 36, No. 3, pp. 679-683, June 2006.
Number of Itemsets Vs Search Space
0
2000
4000
6000
8000
10000
12000
24 32 36 42
Number of Itemsets
SearchSpace(Bytes)
Search Space
(FOSSP)
Search Space
(Fuzzy Apriori)

Más contenido relacionado

La actualidad más candente

Mining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce FrameworkMining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce FrameworkIRJET Journal
 
An Efficient and Scalable UP-Growth Algorithm with Optimized Threshold (min_u...
An Efficient and Scalable UP-Growth Algorithm with Optimized Threshold (min_u...An Efficient and Scalable UP-Growth Algorithm with Optimized Threshold (min_u...
An Efficient and Scalable UP-Growth Algorithm with Optimized Threshold (min_u...IRJET Journal
 
Frequent Item Set Mining - A Review
Frequent Item Set Mining - A ReviewFrequent Item Set Mining - A Review
Frequent Item Set Mining - A Reviewijsrd.com
 
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...ertekg
 
3.[18 22]hybrid association rule mining using ac tree
3.[18 22]hybrid association rule mining using ac tree3.[18 22]hybrid association rule mining using ac tree
3.[18 22]hybrid association rule mining using ac treeAlexander Decker
 
A classification of methods for frequent pattern mining
A classification of methods for frequent pattern miningA classification of methods for frequent pattern mining
A classification of methods for frequent pattern miningIOSR Journals
 
An incremental mining algorithm for maintaining sequential patterns using pre...
An incremental mining algorithm for maintaining sequential patterns using pre...An incremental mining algorithm for maintaining sequential patterns using pre...
An incremental mining algorithm for maintaining sequential patterns using pre...Editor IJMTER
 
An Ontological Approach for Mining Association Rules from Transactional Dataset
An Ontological Approach for Mining Association Rules from Transactional DatasetAn Ontological Approach for Mining Association Rules from Transactional Dataset
An Ontological Approach for Mining Association Rules from Transactional DatasetIJERA Editor
 
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...IRJET Journal
 
A New Extraction Optimization Approach to Frequent 2 Item sets
A New Extraction Optimization Approach to Frequent 2 Item setsA New Extraction Optimization Approach to Frequent 2 Item sets
A New Extraction Optimization Approach to Frequent 2 Item setsijcsa
 
A comprehensive study of major techniques of multi level frequent pattern min...
A comprehensive study of major techniques of multi level frequent pattern min...A comprehensive study of major techniques of multi level frequent pattern min...
A comprehensive study of major techniques of multi level frequent pattern min...eSAT Publishing House
 
A Fuzzy Algorithm for Mining High Utility Rare Itemsets – FHURI
A Fuzzy Algorithm for Mining High Utility Rare Itemsets – FHURIA Fuzzy Algorithm for Mining High Utility Rare Itemsets – FHURI
A Fuzzy Algorithm for Mining High Utility Rare Itemsets – FHURIidescitation
 

La actualidad más candente (15)

Mining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce FrameworkMining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce Framework
 
An Efficient and Scalable UP-Growth Algorithm with Optimized Threshold (min_u...
An Efficient and Scalable UP-Growth Algorithm with Optimized Threshold (min_u...An Efficient and Scalable UP-Growth Algorithm with Optimized Threshold (min_u...
An Efficient and Scalable UP-Growth Algorithm with Optimized Threshold (min_u...
 
Frequent Item Set Mining - A Review
Frequent Item Set Mining - A ReviewFrequent Item Set Mining - A Review
Frequent Item Set Mining - A Review
 
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
 
K355662
K355662K355662
K355662
 
3.[18 22]hybrid association rule mining using ac tree
3.[18 22]hybrid association rule mining using ac tree3.[18 22]hybrid association rule mining using ac tree
3.[18 22]hybrid association rule mining using ac tree
 
A classification of methods for frequent pattern mining
A classification of methods for frequent pattern miningA classification of methods for frequent pattern mining
A classification of methods for frequent pattern mining
 
An incremental mining algorithm for maintaining sequential patterns using pre...
An incremental mining algorithm for maintaining sequential patterns using pre...An incremental mining algorithm for maintaining sequential patterns using pre...
An incremental mining algorithm for maintaining sequential patterns using pre...
 
Ijcatr04051004
Ijcatr04051004Ijcatr04051004
Ijcatr04051004
 
An Ontological Approach for Mining Association Rules from Transactional Dataset
An Ontological Approach for Mining Association Rules from Transactional DatasetAn Ontological Approach for Mining Association Rules from Transactional Dataset
An Ontological Approach for Mining Association Rules from Transactional Dataset
 
50120140501018
5012014050101850120140501018
50120140501018
 
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...
 
A New Extraction Optimization Approach to Frequent 2 Item sets
A New Extraction Optimization Approach to Frequent 2 Item setsA New Extraction Optimization Approach to Frequent 2 Item sets
A New Extraction Optimization Approach to Frequent 2 Item sets
 
A comprehensive study of major techniques of multi level frequent pattern min...
A comprehensive study of major techniques of multi level frequent pattern min...A comprehensive study of major techniques of multi level frequent pattern min...
A comprehensive study of major techniques of multi level frequent pattern min...
 
A Fuzzy Algorithm for Mining High Utility Rare Itemsets – FHURI
A Fuzzy Algorithm for Mining High Utility Rare Itemsets – FHURIA Fuzzy Algorithm for Mining High Utility Rare Itemsets – FHURI
A Fuzzy Algorithm for Mining High Utility Rare Itemsets – FHURI
 

Destacado

Congestion Control in Manets Using Hybrid Routing Protocol
Congestion Control in Manets Using Hybrid Routing ProtocolCongestion Control in Manets Using Hybrid Routing Protocol
Congestion Control in Manets Using Hybrid Routing ProtocolIOSR Journals
 
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...IOSR Journals
 
Effects of Addition of Sprint, Strength and Agility Training On Cardiovascula...
Effects of Addition of Sprint, Strength and Agility Training On Cardiovascula...Effects of Addition of Sprint, Strength and Agility Training On Cardiovascula...
Effects of Addition of Sprint, Strength and Agility Training On Cardiovascula...IOSR Journals
 
Effect of Seismic Joint in the Performance of Multi-Storeyed L-Shaped Building
Effect of Seismic Joint in the Performance of Multi-Storeyed L-Shaped BuildingEffect of Seismic Joint in the Performance of Multi-Storeyed L-Shaped Building
Effect of Seismic Joint in the Performance of Multi-Storeyed L-Shaped BuildingIOSR Journals
 
Postural Comparison of Dextrous and Ambidextrous Players
Postural Comparison of Dextrous and Ambidextrous PlayersPostural Comparison of Dextrous and Ambidextrous Players
Postural Comparison of Dextrous and Ambidextrous PlayersIOSR Journals
 
Investigation of Tribological Behavior of Stainless Steel 304 and Grey Cast I...
Investigation of Tribological Behavior of Stainless Steel 304 and Grey Cast I...Investigation of Tribological Behavior of Stainless Steel 304 and Grey Cast I...
Investigation of Tribological Behavior of Stainless Steel 304 and Grey Cast I...IOSR Journals
 
Comparative Performance of Crude Pongamia Oil in A Low Heat Rejection Diesel ...
Comparative Performance of Crude Pongamia Oil in A Low Heat Rejection Diesel ...Comparative Performance of Crude Pongamia Oil in A Low Heat Rejection Diesel ...
Comparative Performance of Crude Pongamia Oil in A Low Heat Rejection Diesel ...IOSR Journals
 
Common Sports Injuries amongst the Elite Women Basketball Players of India
Common Sports Injuries amongst the Elite Women Basketball Players of IndiaCommon Sports Injuries amongst the Elite Women Basketball Players of India
Common Sports Injuries amongst the Elite Women Basketball Players of IndiaIOSR Journals
 
Modified Pure Radix Sort for Large Heterogeneous Data Set
Modified Pure Radix Sort for Large Heterogeneous Data Set Modified Pure Radix Sort for Large Heterogeneous Data Set
Modified Pure Radix Sort for Large Heterogeneous Data Set IOSR Journals
 
Optimal Estimating Sequence for a Hilbert Space Valued Parameter
Optimal Estimating Sequence for a Hilbert Space Valued ParameterOptimal Estimating Sequence for a Hilbert Space Valued Parameter
Optimal Estimating Sequence for a Hilbert Space Valued ParameterIOSR Journals
 
Motor Fitness of Rural Primary School Girls In Comparison To Boys
Motor Fitness of Rural Primary School Girls In Comparison To Boys Motor Fitness of Rural Primary School Girls In Comparison To Boys
Motor Fitness of Rural Primary School Girls In Comparison To Boys IOSR Journals
 
Effects of Harness Running, Sand Running, Weight - Jacket Running and Weight ...
Effects of Harness Running, Sand Running, Weight - Jacket Running and Weight ...Effects of Harness Running, Sand Running, Weight - Jacket Running and Weight ...
Effects of Harness Running, Sand Running, Weight - Jacket Running and Weight ...IOSR Journals
 
On Stability Equilibrium Analysis of Endemic Malaria
On Stability Equilibrium Analysis of Endemic MalariaOn Stability Equilibrium Analysis of Endemic Malaria
On Stability Equilibrium Analysis of Endemic MalariaIOSR Journals
 
Critical barriers impeding the delivery of Physical Education in Zimbabwean p...
Critical barriers impeding the delivery of Physical Education in Zimbabwean p...Critical barriers impeding the delivery of Physical Education in Zimbabwean p...
Critical barriers impeding the delivery of Physical Education in Zimbabwean p...IOSR Journals
 
L Inequalities Concerning Polynomials Having Zeros in Closed Interior of A Ci...
L Inequalities Concerning Polynomials Having Zeros in Closed Interior of A Ci...L Inequalities Concerning Polynomials Having Zeros in Closed Interior of A Ci...
L Inequalities Concerning Polynomials Having Zeros in Closed Interior of A Ci...IOSR Journals
 

Destacado (20)

F0733038
F0733038F0733038
F0733038
 
Congestion Control in Manets Using Hybrid Routing Protocol
Congestion Control in Manets Using Hybrid Routing ProtocolCongestion Control in Manets Using Hybrid Routing Protocol
Congestion Control in Manets Using Hybrid Routing Protocol
 
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
Feature Extraction Based Estimation of Rain Fall By Cross Correlating Cloud R...
 
Effects of Addition of Sprint, Strength and Agility Training On Cardiovascula...
Effects of Addition of Sprint, Strength and Agility Training On Cardiovascula...Effects of Addition of Sprint, Strength and Agility Training On Cardiovascula...
Effects of Addition of Sprint, Strength and Agility Training On Cardiovascula...
 
Effect of Seismic Joint in the Performance of Multi-Storeyed L-Shaped Building
Effect of Seismic Joint in the Performance of Multi-Storeyed L-Shaped BuildingEffect of Seismic Joint in the Performance of Multi-Storeyed L-Shaped Building
Effect of Seismic Joint in the Performance of Multi-Storeyed L-Shaped Building
 
K0336268
K0336268K0336268
K0336268
 
D0452428
D0452428D0452428
D0452428
 
Postural Comparison of Dextrous and Ambidextrous Players
Postural Comparison of Dextrous and Ambidextrous PlayersPostural Comparison of Dextrous and Ambidextrous Players
Postural Comparison of Dextrous and Ambidextrous Players
 
Investigation of Tribological Behavior of Stainless Steel 304 and Grey Cast I...
Investigation of Tribological Behavior of Stainless Steel 304 and Grey Cast I...Investigation of Tribological Behavior of Stainless Steel 304 and Grey Cast I...
Investigation of Tribological Behavior of Stainless Steel 304 and Grey Cast I...
 
Comparative Performance of Crude Pongamia Oil in A Low Heat Rejection Diesel ...
Comparative Performance of Crude Pongamia Oil in A Low Heat Rejection Diesel ...Comparative Performance of Crude Pongamia Oil in A Low Heat Rejection Diesel ...
Comparative Performance of Crude Pongamia Oil in A Low Heat Rejection Diesel ...
 
Common Sports Injuries amongst the Elite Women Basketball Players of India
Common Sports Injuries amongst the Elite Women Basketball Players of IndiaCommon Sports Injuries amongst the Elite Women Basketball Players of India
Common Sports Injuries amongst the Elite Women Basketball Players of India
 
Modified Pure Radix Sort for Large Heterogeneous Data Set
Modified Pure Radix Sort for Large Heterogeneous Data Set Modified Pure Radix Sort for Large Heterogeneous Data Set
Modified Pure Radix Sort for Large Heterogeneous Data Set
 
L0745761
L0745761L0745761
L0745761
 
Optimal Estimating Sequence for a Hilbert Space Valued Parameter
Optimal Estimating Sequence for a Hilbert Space Valued ParameterOptimal Estimating Sequence for a Hilbert Space Valued Parameter
Optimal Estimating Sequence for a Hilbert Space Valued Parameter
 
Motor Fitness of Rural Primary School Girls In Comparison To Boys
Motor Fitness of Rural Primary School Girls In Comparison To Boys Motor Fitness of Rural Primary School Girls In Comparison To Boys
Motor Fitness of Rural Primary School Girls In Comparison To Boys
 
Effects of Harness Running, Sand Running, Weight - Jacket Running and Weight ...
Effects of Harness Running, Sand Running, Weight - Jacket Running and Weight ...Effects of Harness Running, Sand Running, Weight - Jacket Running and Weight ...
Effects of Harness Running, Sand Running, Weight - Jacket Running and Weight ...
 
H0534248
H0534248H0534248
H0534248
 
On Stability Equilibrium Analysis of Endemic Malaria
On Stability Equilibrium Analysis of Endemic MalariaOn Stability Equilibrium Analysis of Endemic Malaria
On Stability Equilibrium Analysis of Endemic Malaria
 
Critical barriers impeding the delivery of Physical Education in Zimbabwean p...
Critical barriers impeding the delivery of Physical Education in Zimbabwean p...Critical barriers impeding the delivery of Physical Education in Zimbabwean p...
Critical barriers impeding the delivery of Physical Education in Zimbabwean p...
 
L Inequalities Concerning Polynomials Having Zeros in Closed Interior of A Ci...
L Inequalities Concerning Polynomials Having Zeros in Closed Interior of A Ci...L Inequalities Concerning Polynomials Having Zeros in Closed Interior of A Ci...
L Inequalities Concerning Polynomials Having Zeros in Closed Interior of A Ci...
 

Similar a Efficient Parallel Pruning of Associative Rules with Optimized Search

Hadoop Map-Reduce To Generate Frequent Item Set on Large Datasets Using Impro...
Hadoop Map-Reduce To Generate Frequent Item Set on Large Datasets Using Impro...Hadoop Map-Reduce To Generate Frequent Item Set on Large Datasets Using Impro...
Hadoop Map-Reduce To Generate Frequent Item Set on Large Datasets Using Impro...BRNSSPublicationHubI
 
Mining Frequent Item set Using Genetic Algorithm
Mining Frequent Item set Using Genetic AlgorithmMining Frequent Item set Using Genetic Algorithm
Mining Frequent Item set Using Genetic Algorithmijsrd.com
 
Improved Frequent Pattern Mining Algorithm using Divide and Conquer Technique...
Improved Frequent Pattern Mining Algorithm using Divide and Conquer Technique...Improved Frequent Pattern Mining Algorithm using Divide and Conquer Technique...
Improved Frequent Pattern Mining Algorithm using Divide and Conquer Technique...ijsrd.com
 
Review on: Techniques for Predicting Frequent Items
Review on: Techniques for Predicting Frequent ItemsReview on: Techniques for Predicting Frequent Items
Review on: Techniques for Predicting Frequent Itemsvivatechijri
 
Review Over Sequential Rule Mining
Review Over Sequential Rule MiningReview Over Sequential Rule Mining
Review Over Sequential Rule Miningijsrd.com
 
Hardware enhanced association rule mining
Hardware enhanced association rule miningHardware enhanced association rule mining
Hardware enhanced association rule miningStudsPlanet.com
 
COMPARATIVE STUDY OF DISTRIBUTED FREQUENT PATTERN MINING ALGORITHMS FOR BIG S...
COMPARATIVE STUDY OF DISTRIBUTED FREQUENT PATTERN MINING ALGORITHMS FOR BIG S...COMPARATIVE STUDY OF DISTRIBUTED FREQUENT PATTERN MINING ALGORITHMS FOR BIG S...
COMPARATIVE STUDY OF DISTRIBUTED FREQUENT PATTERN MINING ALGORITHMS FOR BIG S...IAEME Publication
 
Hardware enhanced association rule mining
Hardware enhanced association rule miningHardware enhanced association rule mining
Hardware enhanced association rule miningStudsPlanet.com
 
Hybrid approach for generating non overlapped substring using genetic algorithm
Hybrid approach for generating non overlapped substring using genetic algorithmHybrid approach for generating non overlapped substring using genetic algorithm
Hybrid approach for generating non overlapped substring using genetic algorithmeSAT Publishing House
 
A genetic algorithm coupled with tree-based pruning for mining closed associa...
A genetic algorithm coupled with tree-based pruning for mining closed associa...A genetic algorithm coupled with tree-based pruning for mining closed associa...
A genetic algorithm coupled with tree-based pruning for mining closed associa...IJECEIAES
 
Improved Text Mining for Bulk Data Using Deep Learning Approach
Improved Text Mining for Bulk Data Using Deep Learning Approach Improved Text Mining for Bulk Data Using Deep Learning Approach
Improved Text Mining for Bulk Data Using Deep Learning Approach IJCSIS Research Publications
 
An Efficient Compressed Data Structure Based Method for Frequent Item Set Mining
An Efficient Compressed Data Structure Based Method for Frequent Item Set MiningAn Efficient Compressed Data Structure Based Method for Frequent Item Set Mining
An Efficient Compressed Data Structure Based Method for Frequent Item Set Miningijsrd.com
 
A signature based indexing method for efficient content-based retrieval of re...
A signature based indexing method for efficient content-based retrieval of re...A signature based indexing method for efficient content-based retrieval of re...
A signature based indexing method for efficient content-based retrieval of re...Mumbai Academisc
 
REVIEW: Frequent Pattern Mining Techniques
REVIEW: Frequent Pattern Mining TechniquesREVIEW: Frequent Pattern Mining Techniques
REVIEW: Frequent Pattern Mining TechniquesEditor IJMTER
 
A Survey on Fuzzy Association Rule Mining Methodologies
A Survey on Fuzzy Association Rule Mining MethodologiesA Survey on Fuzzy Association Rule Mining Methodologies
A Survey on Fuzzy Association Rule Mining MethodologiesIOSR Journals
 
A Quantified Approach for large Dataset Compression in Association Mining
A Quantified Approach for large Dataset Compression in Association MiningA Quantified Approach for large Dataset Compression in Association Mining
A Quantified Approach for large Dataset Compression in Association MiningIOSR Journals
 

Similar a Efficient Parallel Pruning of Associative Rules with Optimized Search (20)

Hadoop Map-Reduce To Generate Frequent Item Set on Large Datasets Using Impro...
Hadoop Map-Reduce To Generate Frequent Item Set on Large Datasets Using Impro...Hadoop Map-Reduce To Generate Frequent Item Set on Large Datasets Using Impro...
Hadoop Map-Reduce To Generate Frequent Item Set on Large Datasets Using Impro...
 
Mining Frequent Item set Using Genetic Algorithm
Mining Frequent Item set Using Genetic AlgorithmMining Frequent Item set Using Genetic Algorithm
Mining Frequent Item set Using Genetic Algorithm
 
J017114852
J017114852J017114852
J017114852
 
Improved Frequent Pattern Mining Algorithm using Divide and Conquer Technique...
Improved Frequent Pattern Mining Algorithm using Divide and Conquer Technique...Improved Frequent Pattern Mining Algorithm using Divide and Conquer Technique...
Improved Frequent Pattern Mining Algorithm using Divide and Conquer Technique...
 
Review on: Techniques for Predicting Frequent Items
Review on: Techniques for Predicting Frequent ItemsReview on: Techniques for Predicting Frequent Items
Review on: Techniques for Predicting Frequent Items
 
IMPROVED APRIORI ALGORITHM FOR ASSOCIATION RULES
IMPROVED APRIORI ALGORITHM FOR ASSOCIATION RULESIMPROVED APRIORI ALGORITHM FOR ASSOCIATION RULES
IMPROVED APRIORI ALGORITHM FOR ASSOCIATION RULES
 
Review Over Sequential Rule Mining
Review Over Sequential Rule MiningReview Over Sequential Rule Mining
Review Over Sequential Rule Mining
 
Hardware enhanced association rule mining
Hardware enhanced association rule miningHardware enhanced association rule mining
Hardware enhanced association rule mining
 
COMPARATIVE STUDY OF DISTRIBUTED FREQUENT PATTERN MINING ALGORITHMS FOR BIG S...
COMPARATIVE STUDY OF DISTRIBUTED FREQUENT PATTERN MINING ALGORITHMS FOR BIG S...COMPARATIVE STUDY OF DISTRIBUTED FREQUENT PATTERN MINING ALGORITHMS FOR BIG S...
COMPARATIVE STUDY OF DISTRIBUTED FREQUENT PATTERN MINING ALGORITHMS FOR BIG S...
 
Hardware enhanced association rule mining
Hardware enhanced association rule miningHardware enhanced association rule mining
Hardware enhanced association rule mining
 
Ijetcas14 316
Ijetcas14 316Ijetcas14 316
Ijetcas14 316
 
Hybrid approach for generating non overlapped substring using genetic algorithm
Hybrid approach for generating non overlapped substring using genetic algorithmHybrid approach for generating non overlapped substring using genetic algorithm
Hybrid approach for generating non overlapped substring using genetic algorithm
 
A genetic algorithm coupled with tree-based pruning for mining closed associa...
A genetic algorithm coupled with tree-based pruning for mining closed associa...A genetic algorithm coupled with tree-based pruning for mining closed associa...
A genetic algorithm coupled with tree-based pruning for mining closed associa...
 
K355662
K355662K355662
K355662
 
Improved Text Mining for Bulk Data Using Deep Learning Approach
Improved Text Mining for Bulk Data Using Deep Learning Approach Improved Text Mining for Bulk Data Using Deep Learning Approach
Improved Text Mining for Bulk Data Using Deep Learning Approach
 
An Efficient Compressed Data Structure Based Method for Frequent Item Set Mining
An Efficient Compressed Data Structure Based Method for Frequent Item Set MiningAn Efficient Compressed Data Structure Based Method for Frequent Item Set Mining
An Efficient Compressed Data Structure Based Method for Frequent Item Set Mining
 
A signature based indexing method for efficient content-based retrieval of re...
A signature based indexing method for efficient content-based retrieval of re...A signature based indexing method for efficient content-based retrieval of re...
A signature based indexing method for efficient content-based retrieval of re...
 
REVIEW: Frequent Pattern Mining Techniques
REVIEW: Frequent Pattern Mining TechniquesREVIEW: Frequent Pattern Mining Techniques
REVIEW: Frequent Pattern Mining Techniques
 
A Survey on Fuzzy Association Rule Mining Methodologies
A Survey on Fuzzy Association Rule Mining MethodologiesA Survey on Fuzzy Association Rule Mining Methodologies
A Survey on Fuzzy Association Rule Mining Methodologies
 
A Quantified Approach for large Dataset Compression in Association Mining
A Quantified Approach for large Dataset Compression in Association MiningA Quantified Approach for large Dataset Compression in Association Mining
A Quantified Approach for large Dataset Compression in Association Mining
 

Más de IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Último

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 

Último (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 

Efficient Parallel Pruning of Associative Rules with Optimized Search

  • 1. IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661 Volume 3, Issue 5 (July-Aug. 2012), PP 26-30 www.iosrjournals.org www.iosrjournals.org 26 | Page Efficient Parallel Pruning of Associative Rules with Optimized Search 1 K.Sangeetha, 2 Dr.P.S.Periasamy, 3 S.Prakash 1 Assistant Professor (SG), S.N.S.College of Technology, Coimbatore ,2 Professor, K.S.R.College of Engineering, Tiruchengode 3 Assistant Professor (SG,) Sri Shakthi Institute of Engineering and Technology, Coimbatore Abstract: The main focus of this research work is to propose an improved association rule mining algorithm to minimize the number of candidate sets while generating association rules with efficient pruning time and search space optimization. The relative association with reduced candidate item set reduces the overall execution time. The scalability of this work is measured with number of item sets used in the transaction and size of the data set. Further Fuzzy based rule mining principle is adapted in this work to obtain more informative associative rules and frequent items with increased sensitive. The requirement for sensitive items is to have a semantic connection between the components of the item-value pairs. The effectiveness of item-value pairs minimizes the search space to its optimality. Optimality of the search space indicates the trade off between pruning time and size of the data set. I. Introduction A rapid growth of information extraction from large transactional data sets fueled the demand of knowledge discovery and associative relation between the items. To identify the most frequent transacted items and generates associative rules between various items, Apriori algorithm is one such most sought solution for association rule mining, in which scanning of transaction item was done efficiently without missing any items. So the generation of candidate item set consumes more time to generate associative rules. To overcome the slowness in associative rule pruning various strategies were discussed in literature to improve the speed of rule formation. The approaches presented in the literatures adapted multifold iteration of the transaction data sets which affected the time and search space for pruning the more sensitive item and its relative association. Some of the sampling methods are available, but these processes will again affect the performance of the items which leads to missed out transaction. In this framework, many algorithms have been proposed for proficient creation of normal item sets in the literature because the problem was first introduced. To reduce the size of the candidate item sets, the Direct Hashing and Pruning (DHP) algorithm uses a hash table that results in efficient pruning of item sets. The Partition algorithm decreases Input / Output (I/O) by examining the database only twice. II. Literature Review Ken Sun et al (2008) introduced new focus on Association Rule Mining (ARM) algorithms. This proposal uses w-support, which does not require pre assigned weights, but this method is constraint-based in the sense that all rules must fulfill a predefined set of conditions, such as support and confidence. However, the main goal of this algorithm is to reduce the number of generated rules. Jens Teubner et al (2011) explore how to accelerate the computation of frequent itemset using field- programmable gateway. The pipeline solution was introduced to improve the performance. It uses the minimum count as a threshold, so it is a constraint based algorithm. Zhaonian Zou et al (2010) investigates the problem of mining uncertain graph data and especially focuses on mining frequent sub graph patterns on an uncertain graph database. based on minimum support value the frequent sub graph pattern is minined. Claudia Marinica et al (2010) proposed a new interactive approach to prune and filter discovered rules to use ontologies in order to improve the integration of user knowledge in the post processing task Alok Sharma et al (2008), proposed a new method to reduce the search space. In that, prior to dimensionality-reduction transformation an additional rotational transform that rotates the feature vectors in the original feature space around their respective class centroids in such a way that the overlap between the classes in the reduced feature space is further minimized Elena Baralis et al (2009) proposed a method called IMine index, a general and compact structure which provides tight integration of item set extraction in a Relational Data Base Management System. E. Hüllermeier et al (2007) proposed an algorithm which was an adaptation of the Apriori algorithm for number of items in the attributes. It is not easy to extend the algorithm to higher dimensional cases.
  • 2. Efficient Parallel Pruning of Associative Rules with Optimized Search www.iosrjournals.org 27 | Page III. Problem Definition Normally, the principle of association rule mining is to mine a set of shared highly correlated attributes/features amongst a huge number of records in a given database for knowledge discovery. The fuzzy ARM algorithms are used under large datasets for a fast and efficient performance. A fuzzy ARM algorithm for generating fuzzy association rules is not a simple one. The first process is conversion of crisp dataset which consists of crisp binary and numerical attributes, into a fuzzy dataset, containing crisp binary and fuzzy binary attributes. The second process is to calculate the frequency of an item set using the presence or absence in a transaction of the dataset, but fuzzy ARM algorithms must taken into account in a particular transaction of the dataset, in addition to its presence or absence. This becomes tedious process. The problem of scalability and higher memory requirements are addressed in this research work by deploying parallel pruning technique at different levels of items sets (one item set, two item set, etc.,). From the recent literature we came to know that, only Apriori and its adaptations are used for generating association rules. Thus, the Fuzzy based Optimal Search Space Pruning (FOSSP) is compared with existing fuzzy Apriori and the execution time is recorded as in Fig.1. IV. Objective The objective is to minimize the number of candidate sets and enhancing the association rule mining algorithm while creating an association rules by evaluating maximal information associated with each item that occurs in given set of transaction. Initial work starts with the evaluation of weighted association rule mining in terms of item-value relational metrics. Then the number of item metrics is taken into account of the association rule mining with reduced candidate item set. This may decrease not only the number of item sets generated but also the overall execution time of the algorithm. Any valued attribute will be treated as item-value relational metrics and will be used to derive the minimal number of association rules which increased the rules information content. The research work evaluates the scalability of the FOSSP (say for car purchase dataset and bank transaction data set) by considering transaction time, number of item sets used in the transaction and memory utilization. In addition, further the work moves in the direction of fuzzy based item value of the rule mining principle on associative rules of the complete item sets. To evaluate the item-value relativity metric of the scalable association mining, optimal search on parallel pruning is planned for deployment as it can hold more number of associative information. V. Scalable Association Rule Mining Using Parallel Pruning FOSSP presented in this work, first analyze the scalability issues of association rule mining in large data sets. Parallel pruning technique is deployed in FOSSP to mine the large transactional items simultaneously at different levels of items sets to improve the execution speed for generating frequent items and association rules. The enhancement of Apriori algorithm is done by increasing the efficiency of candidate pruning phase by reducing the number of candidates that are generated for further verification. The FOSSP pruning technique use information associated to the number of items to estimate overlap items in the transactions. The basic elements considered in the development of the FOSSP are number of transactions, average size of transaction, average size of the maximal large item sets, number of items, and distribution of occurrences of large item sets. The parallel pruning in FOSSP provides improvement over Apriori by generating frequent items and rules for transaction data. It generates all candidates based on n-level frequent item sets on sorted database, and all frequent item sets that can no longer be supported by transactions that still have to be processed. Thus the FOSSP has no longer to maintain the covers of all past item sets sequentially. The algorithm for parallel pruning technique to generate informative rules and strong frequent items is presented as below: 5.1 Framework of FOSSP Algorithm Input: Number of Transactions and items, larger data sets Output: Candidate items, number of informative rules, frequent items, execution time Steps of Procedures a. Initialize number of items and transactions from large data sets b. Generate candidate item sets with information requirement c. Reduce the candidate item with relative item values d. With probability ratio, generate frequent item sets (i.e., satisfy minimum support) e. Parallel prune the frequent items at different levels of the item set f. With conditional probability on parallel pruned item levels, generate strong association rules. g. Calculate execution time of frequent item set and informative association rules h. Sort the item sets based on the frequency and information association
  • 3. Efficient Parallel Pruning of Associative Rules with Optimized Search www.iosrjournals.org 28 | Page i. Merge the more associated rules of item pairs j. Discard the infrequent item value pairs k. Perform Fuzzy Parallel Pruning (PP) l. Iterate the steps c to f till the required scalability mining results are achieved Fuzzy PP algorithm: For each t Є T Search the whole Transaction and return all the items Membership Function (mF) = {a Є A | 0 ≤ a ≤ 1}mF = 1 ; 0 ≤ a ≤ 1; mF = 0; otherwise Perform mapping function End Where T - total transaction, t - transaction instance, A - complete item set, a - items of transaction instance. For B = (y1, y2,…yn) fuzzy set (B, n) = {n(y1)/y1,…n(yn)/ yn} Scan the transformed database Evaluate the support with the predefined Min Support value. End Where B – candidate item set, y1,y1..yn – frequent item set of transaction instances, n – number of instances. In FOSSP the candidate item reduction object is updated in the iteration to determine the processing items. In the Apriori association mining algorithm, the data item read, needs to be matched against all candidates to determine the set of candidates whose counts will be incremented. It is not possible to statically partition the reduction object so that different process update disjoint portions of the collection which made parallel pruning in FOSSP more efficient. However as the pruning transaction item is more concerned in parallel, the search space for frequent item generation and item-value pair based maximal information sensitive association rules becomes complex. To overcome these facts, in the next chapter, the optimization of search space using fuzzy rule set, is described. VI. Optimization Of Search Space Using Fuzzy Rule Set The traditional fuzzy ARM exploits a data-driven pre-processing approach which makes routine to the formation of fuzzy partitions for numerical attributes. Therefore, it converts the given data set to fuzzy data set that desires a lesser amount of human communication for even very large datasets. Numerical attributes in the real data sets are converted to fuzzy sets which comprises of split data sets with boundary limits. The item values in the split boundaries can have the uncertainty factor which affects the quality and accuracy of fuzzy association rule mining. In addition the search space using fuzzy modeled association rule mining needs larger memory to accommodate larger transactions data sets. The FOSSP presented in this work, which improves parallel pruning technique is described in chapter 5. FOSSP utilize fuzzy rule controlled feedback scheme to optimize the search space for more effective association rule generation. The following section describes about various techniques to evaluate the scalability of association rule mining and the resultant optimal search space for efficient item pruning 6.1 Partitioning Fuzzy Domain Set In presenting the optimal search space approach for fuzzy association rule mining process, fuzzy partition domains are made based on the user defined item-value attribute on the original dataset. To evaluate the fuzzy data set for informative association rule mining, support and confidence metrics are redefined based on the fuzzy binary attributes. The generation of fuzzy association rules is directly impacted by the fuzzy measures adapted in the parallel pruning approach. The dataset is logically divided into „p‟ disjoint horizontal partitions P1, P2… Pp. Each partition is as large as can fit in available optimal memory space. The partitions are equal-sized, though each partition could be of any arbitrary size as well. 6.2 Optimal Search Fuzzy Feedback scheme for Informative Rule Generation The optimal search space with fuzzy for association rule mining deployed iterative feedback on the rule set generation. The parallel pruning of multi-level item set is split with fuzzy data set to obtain the rules from respective partitioned domain, whereas the feedback scheme gets into each partitioned domain. Within the partitioned domain, the initial rules generated for item value attributes that are governed by the optimal search based feedback scheme to identify the sensitivity of fuzzy binary value in one domain to other. The optimal fuzzy feedback scheme minimizes the number of rules being generated in each and every partitioned domain of multiple outliers which are divided into groups.
  • 4. Efficient Parallel Pruning of Associative Rules with Optimized Search www.iosrjournals.org 29 | Page VII. Experimental Results And Discussions on FOSSP The experimental evaluation of FOSSP on identifying the results of performance metrics such as scalability, search space optimality, informative associative rules sets, and candidate set reduction. The scalability evaluation is made on the size of the data set used and its pruning time for generating frequent items and association rule sets with deployment of parallel pruning of multi-level item sets simultaneously. The optimality of search space for parallel pruning is measured by varying large items using fuzzy rule appropriation. For experimental purpose on the scalability issue, the samples for banking data set obtained from the local governmental banking streams with size of transaction data with Giga Bytes (GBs) is used . The total number of distinct items was 1000 and the average number of items in a transaction was 15. Fig. 1. Comparision of execution time with FOSSP and Fuzzy Apriori The confidence value of 90% and support value of 50% is given as an input. Normally, when the number of iterations for item pruning increases then execution time gradually increases. The execution time for parallel pruning is illustrated to evaluate the performance of the proposed technique, compared with the existing Apriori rule generation as shown in Fig.1. In General, when the data size for item pruning increases then execution time gradually increases. The scalability performance of FOSSP shows 2 times faster execution time compared to that of fuzzy Apriori models. Though the performance of scalability is considerably higher for parallel pruning, the execution time requirement increases with the growth in the size of unique items as shown in Fig.2. Fig. 2. Scalability evaluation with FOSSP and Fuzzy Apriori Usually, when the item set for pruning increases, the search space also gradually increase. Further datasets from machine learning repository (Car Purchase Data set, Bank transaction data set) are extracted and enhanced with data size to GBs with more number of unique items. The performance of FOSSP in terms of scalability as well as the search space requirements at each of these data sets is depicted as in Fig.3. The optimal value of memory for search space and the maximum size of the data set, minimal number of rule generation covering most possible information of the data set, and candidate set reduction are evaluated. The car dataset with 20 distinct items, where the average number of items per transaction is 6 to 8 are used for the experimental evaluation of FOSSP. The total size of the dataset is 2 GBs and a confidence level(C) of 90% is used. The support counts testified with the transaction for frequent item pruning are 70%, 85%, 93%, and 62%. The execution time is improved for FOSSP with reduction of 2 to 4 times as compared to fuzzy Apriori and the memory utilization reduced nearly 2 to 3 times for the data size of 2 GB Car purchase data set. With experimental result on the car purchase data set, the performance of FOSSP is improved when compared to Fuzzy Apriori. No of Iteration Vs Execution time 0 20 40 60 80 100 120 140 8 -16 16 - 32 32 - 48 48 - 60 Number of Iterations ExecutionTime Execution time - FOSSP (Proposed) Execution time - Fuzzy Apriori (Existing) Data size Vs Time 0 5000 10000 15000 20000 25000 30000 35000 40000 1 2 3 4 5 6 7 8 9 Data size (MB) Time(sec) Time for fuzzy based apriori (Existing) Time for FOSSP (Proposed)
  • 5. Efficient Parallel Pruning of Associative Rules with Optimized Search www.iosrjournals.org 30 | Page Fig. 3. Comparision of search space with FOSSP and Fuzzy Apriori The performance results of FOSSP approach are evaluated with various values of support(S) ranging from 25% to 40%. It is concluded from the observation of the results that the proposed FOSSP approach derives effective item-value pair based strong association rule with optimal search space performs 25% faster than fuzzy adapted variants of Apriori(Fig.3), based on the user defined support value. With other dataset samples, the support value is approximated for 34%, in which optimal number of item sets is generated. From these experiments, it is observed that the FOSSP approach performs most efficiently (more accurate rules) and speedily at the optimal support value, which occurs in the range of 15% - 20% for car dataset. Another purpose was to reduce the number of parallel pruning to the data transaction partitions in FOSSP with just one partition for support values of 20% – 40% on car data sets and 10% – 40% on bank data set, keeping in mind that the main memory is utilized in the best manner possible, without any thrashing. Furthermore, with the fuzzy based optimal search feedback scheme, it was observed that more informative rules for all the attributes with more sensitive frequent item have been occurred. VIII. Conclusion The fuzzy based optimal search pruning technique presented in this research work evaluated frequent items with more sensitive item-value pairs. The rule obtained with FOSSP generated appropriate candidate item set that contributes to the improvement of extracting maximal informative association rules from the large transactional data sets. Parallel pruning of item sets at multiple levels of the complete items (one item set, two item sets, … n item sets) decreased the execution time of the FOSSP rule mining, as the frequent items for all the levels obtained simultaneously. Fuzzy rule is modeled to function parallel pruning with optimal search space and reduced the trade off between scalability of data sets and the search space for larger items. In fuzzy Apriori, search space for pruning gets increased as for larger data set which affected the performance of association rule mining; however, FOSSP provided optimal search size for larger data sets. The experimental results shows that FOSSP works better in terms of time reduction when contrast to fuzzy Apriori model. References [1] Ken Sun and Fengshan Bai “Mining Weighted Association Rules without Pre assigned Weights “. IEEE Transactions on Knowledge and Data Engineering, Vol. 20, No. 4, pp. 489-495, April 2008. [2] Jens Teubner . Rene Mueller, and Gustavo Alonso “Frequent Item Computation on a Chip”, IEEE Transactions on Knowledge and Data Engineering, Vol. 23, No. 8, pp 1169-1181, August 2011. [3] Zhaonian Zou, Jianzhong Li, Hong Gao, and Shuo Zhang, “Mining Frequent Subgraph Patterns from Uncertain Graph Data”, IEEE Transactions on Knowledge and Data Engineering, Vol. 22, No. 9,pp 1203-1218, September 2010. [4] Claudia Marinica and Fabrice Guillet “Knowledge-Based Interactive Postmining of Association Rules Using Ontologies”, IEEE Transactions on Knowledge and Data Engineering, Vol. 22, No. 6, pp. 784-797, June 2010. [5] Alok Sharma, and K. Kuldip Paliwal, “Rotational Linear Discriminant Analysis Technique for Dimensionality Reduction”, IEEE Transactions on Knowledge and Data Engineering, Vol. 20, No. 10, pp 1336-1347, October 2008. [6] Elena Baralis, Tania Cerquitelli, and Silvia Chiusano, “IMine: Index Support for Item Set Mining”, IEEE Transactions on Knowledge and Data Engineering, Vol. 21, No. 4, pp. 493-506, April 2009. [7] Hüllermeier E, Y. Yi, “Defense of Fuzzy Association Analysis”, IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics, Vol. 37, No.4, pp.1039- 1043, July 2007. [8] Verlinde H, M. De Cock, R. Boute, “Fuzzy Versus Quantitative Association Rules: A Fair Data-Driven Comparison”, IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics, Vol. 36, No. 3, pp. 679-683, June 2006. Number of Itemsets Vs Search Space 0 2000 4000 6000 8000 10000 12000 24 32 36 42 Number of Itemsets SearchSpace(Bytes) Search Space (FOSSP) Search Space (Fuzzy Apriori)