Mining top-k frequent-regular closed patterns. Issue 21 (30th November 2015)
- Record Type:
- Journal Article
- Title:
- Mining top-k frequent-regular closed patterns. Issue 21 (30th November 2015)
- Main Title:
- Mining top-k frequent-regular closed patterns
- Authors:
- Amphawan, Komate
Lenca, Philippe - Abstract:
- Highlights: Mining top-k frequent-regular closed patterns with minimal length is proposed. A new compact bit-vector representation is designed. An efficient single-pass algorithm is proposed. Abstract: Frequent-regular pattern mining has attracted recently many works. Most of the approaches focus on discovering a complete set of patterns under the user-given support and regularity threshold constraints. This leads to several quantitative and qualitative drawbacks. First, it is often difficult to set appropriate support threshold. Second, algorithms produce a huge number of patterns, many of them being redundant. Third, most of the patterns are of very small size and it is arduous to extract interesting relationship among items. To reduce the number of patterns a common solution is to consider the desired number k of outputs and to mine the top-k patterns. In addition, this approach does not require to set a support threshold. To cope with redundancy and interestingness relationship among items, we suggest to focus on closed patterns and introduce a minimal length constraint. We thus propose to mine the top-k frequent-regular closed patterns with minimal length . An efficient single-pass algorithm, called TFRC-Mine, and a new compact bit-vector representation which allows to prune uninteresting candidate, are designed. Experiments show that the proposed algorithm is efficient to produce longer – non redundant – patterns, and that the new data representation is efficient forHighlights: Mining top-k frequent-regular closed patterns with minimal length is proposed. A new compact bit-vector representation is designed. An efficient single-pass algorithm is proposed. Abstract: Frequent-regular pattern mining has attracted recently many works. Most of the approaches focus on discovering a complete set of patterns under the user-given support and regularity threshold constraints. This leads to several quantitative and qualitative drawbacks. First, it is often difficult to set appropriate support threshold. Second, algorithms produce a huge number of patterns, many of them being redundant. Third, most of the patterns are of very small size and it is arduous to extract interesting relationship among items. To reduce the number of patterns a common solution is to consider the desired number k of outputs and to mine the top-k patterns. In addition, this approach does not require to set a support threshold. To cope with redundancy and interestingness relationship among items, we suggest to focus on closed patterns and introduce a minimal length constraint. We thus propose to mine the top-k frequent-regular closed patterns with minimal length . An efficient single-pass algorithm, called TFRC-Mine, and a new compact bit-vector representation which allows to prune uninteresting candidate, are designed. Experiments show that the proposed algorithm is efficient to produce longer – non redundant – patterns, and that the new data representation is efficient for both computational time and memory usage. … (more)
- Is Part Of:
- Expert systems with applications. Volume 42:Issue 21(2015)
- Journal:
- Expert systems with applications
- Issue:
- Volume 42:Issue 21(2015)
- Issue Display:
- Volume 42, Issue 21 (2015)
- Year:
- 2015
- Volume:
- 42
- Issue:
- 21
- Issue Sort Value:
- 2015-0042-0021-0000
- Page Start:
- 7882
- Page End:
- 7894
- Publication Date:
- 2015-11-30
- Subjects:
- Frequent pattern -- Regular pattern -- Closed pattern -- Bit-vector
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2015.06.021 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12853.xml