Fast Association discovery in Derivative transaction collections
File version
Author(s)
Shen, Hong
Cheng, Ling
Pritchard, Paul
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
License
Abstract
Association discovery from a transaction collection is an important data-mining task. We study a new problem in this area whose solution can provide users with valuable association rules in some relevant collections: association discovery in derivative transaction collections. In this problem, we are given association rules in two transaction collections D 1 and D 2, and aim to find new association rules in derivative transaction collections D 1/D 2, D 1nD 2, D 2/D 1 and D 1?D 2. Direct application of existing algorithms can solve this problem, but in an expensive way. We propose an efficient solution through making full use of already discovered information, taking advantage of the relationships existing among relevant collections, and avoiding unnecessary but expensive support-counting operations by scanning databases. Experiments on well-known synthetic data show that our solution consistently outperforms the naive solution by factors from 2 to 3 in most cases. We also propose an efficient parallelization of our approach, as parallel algorithms are often interesting and necessary in the area of data mining.
Journal Title
Knowledge and Information Systems
Conference Title
Book Title
Edition
Volume
2
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject
History and Archaeology
Artificial Intelligence and Image Processing
Information Systems