Efficient Mining of Discriminating Relationships Among Attributes Involving Arithmetic Operations. (31st July 2014)
- Record Type:
- Journal Article
- Title:
- Efficient Mining of Discriminating Relationships Among Attributes Involving Arithmetic Operations. (31st July 2014)
- Main Title:
- Efficient Mining of Discriminating Relationships Among Attributes Involving Arithmetic Operations
- Authors:
- Duan, Lei
Dong, Guozhu
Wang, Xianming
Tang, Changjie - Abstract:
- Abstract : Contrast patterns describe differences between two or more data sets or data classes; they have been proven to be useful for solving many kinds of problems, such as building accurate classifiers, defining clustering quality measures, and analyzing disease subtypes. This article investigates the mining of a new kind of contrast patterns, namely discriminating inter‐attribute functions (DIFs), which represent arithmetic‐expression‐based inter‐attribute relationships that distinguish classes of data. DIFs are an expressive and practical alternative of item‐based contrast patterns and can express discriminating relationships such as " weight /( height ) 2 is more likely to be ≤25 in one class than in another class." Besides introducing the DIF mining problem, this article makes theoretical and algorithmic contributions on the problem. We prove that DIF mining is MAX SNP‐hard. Regarding how to efficiently mine DIFs, we present a set of rules to prune the search space of arithmetic expressions by eliminating redundant ones (equivalent to some others). We give two algorithms: one for finding all DIFs satisfying given thresholds and another for finding certain optimal DIFs using genetic computation techniques. The former is useful when the number of attributes is small, whereas the latter is useful when that number is large; both use the redundant arithmetic‐expression pruning rules. A performance study shows that our techniques are effective and efficient for findingAbstract : Contrast patterns describe differences between two or more data sets or data classes; they have been proven to be useful for solving many kinds of problems, such as building accurate classifiers, defining clustering quality measures, and analyzing disease subtypes. This article investigates the mining of a new kind of contrast patterns, namely discriminating inter‐attribute functions (DIFs), which represent arithmetic‐expression‐based inter‐attribute relationships that distinguish classes of data. DIFs are an expressive and practical alternative of item‐based contrast patterns and can express discriminating relationships such as " weight /( height ) 2 is more likely to be ≤25 in one class than in another class." Besides introducing the DIF mining problem, this article makes theoretical and algorithmic contributions on the problem. We prove that DIF mining is MAX SNP‐hard. Regarding how to efficiently mine DIFs, we present a set of rules to prune the search space of arithmetic expressions by eliminating redundant ones (equivalent to some others). We give two algorithms: one for finding all DIFs satisfying given thresholds and another for finding certain optimal DIFs using genetic computation techniques. The former is useful when the number of attributes is small, whereas the latter is useful when that number is large; both use the redundant arithmetic‐expression pruning rules. A performance study shows that our techniques are effective and efficient for finding DIFs. … (more)
- Is Part Of:
- Computational intelligence. Volume 32:Number 1(2016)
- Journal:
- Computational intelligence
- Issue:
- Volume 32:Number 1(2016)
- Issue Display:
- Volume 32, Issue 1 (2016)
- Year:
- 2016
- Volume:
- 32
- Issue:
- 1
- Issue Sort Value:
- 2016-0032-0001-0000
- Page Start:
- 102
- Page End:
- 126
- Publication Date:
- 2014-07-31
- Subjects:
- contrast mining -- contrast pattern -- discriminating arithmetic function -- gene expression programming
Artificial intelligence -- Periodicals
Computational linguistics -- Periodicals
006.3 - Journal URLs:
- http://www.blackwellpublishing.com/journal.asp?ref=0824-7935&site=1 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/coin.12052 ↗
- Languages:
- English
- ISSNs:
- 0824-7935
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.595000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 2097.xml