An efficient dynamic superset bit-vector approach for mining frequent closed itemsets and their lattice structure. (January 2017)
- Record Type:
- Journal Article
- Title:
- An efficient dynamic superset bit-vector approach for mining frequent closed itemsets and their lattice structure. (January 2017)
- Main Title:
- An efficient dynamic superset bit-vector approach for mining frequent closed itemsets and their lattice structure
- Authors:
- Hashem, Tahrima
Rezaul Karim, Md.
Samiullah, Md.
Farhan Ahmed, Chowdhury - Abstract:
- Highlights: Efficient approach to mine frequent closed itemsets and their lattice structure. We propose a new memory efficient dynamic superset bit-vector (DSBV) structure. New candidate pruning techniques are developed. DSBV establishes hierarchical subset-superset relationship of itemset lattice. Extensive experiments to show scalability and supremacy of our proposed approach. Abstract: Fast discovery of association rules from millions of transactions in a variety of large databases has now become a major challenge in data mining domain. Frequent itemsets and frequent closed itemsets are key sources of mining association rules. Association rules can be mined efficiently from lattice of itemsets. Non-redundancy, accuracy, time efficiency and memory usage are the factors those need to be considered while developing algorithms in order to extract meaningful association rules. In this paper, we propose an efficient bit-vector approach that exploits dual properties, tidset and superset information of an itemset in order to mine frequent closed itemsets with their lattice structure. We introduce a new memory efficient data structure called dynamic superset bit-vector to establish the relationship among frequent closed itemsets in a lattice. The novelty of our approach is that it effectively uses dual data structures called a dynamic bit-vector and a dynamic superset bit-vector jointly in order to reduce the search space and eliminate the generators of non-closed itemsets. OurHighlights: Efficient approach to mine frequent closed itemsets and their lattice structure. We propose a new memory efficient dynamic superset bit-vector (DSBV) structure. New candidate pruning techniques are developed. DSBV establishes hierarchical subset-superset relationship of itemset lattice. Extensive experiments to show scalability and supremacy of our proposed approach. Abstract: Fast discovery of association rules from millions of transactions in a variety of large databases has now become a major challenge in data mining domain. Frequent itemsets and frequent closed itemsets are key sources of mining association rules. Association rules can be mined efficiently from lattice of itemsets. Non-redundancy, accuracy, time efficiency and memory usage are the factors those need to be considered while developing algorithms in order to extract meaningful association rules. In this paper, we propose an efficient bit-vector approach that exploits dual properties, tidset and superset information of an itemset in order to mine frequent closed itemsets with their lattice structure. We introduce a new memory efficient data structure called dynamic superset bit-vector to establish the relationship among frequent closed itemsets in a lattice. The novelty of our approach is that it effectively uses dual data structures called a dynamic bit-vector and a dynamic superset bit-vector jointly in order to reduce the search space and eliminate the generators of non-closed itemsets. Our proposed approach efficiently builds up the subset-superset relationship among the closed itemsets of the lattice structure in a bottom-up manner. Extensive experiments using real-life datasets and pervasive performance comparison with the existing works prove the efficiency and scalability of our proposed approach. … (more)
- Is Part Of:
- Expert systems with applications. Volume 67(2017)
- Journal:
- Expert systems with applications
- Issue:
- Volume 67(2017)
- Issue Display:
- Volume 67, Issue 2017 (2017)
- Year:
- 2017
- Volume:
- 67
- Issue:
- 2017
- Issue Sort Value:
- 2017-0067-2017-0000
- Page Start:
- 252
- Page End:
- 271
- Publication Date:
- 2017-01
- Subjects:
- Data mining -- Association rules -- Frequent closed itemset -- Frequent closed itemset lattice
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2016.09.023 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7541.xml