A graph theoretic approach for unsupervised feature selection. (September 2015)
- Record Type:
- Journal Article
- Title:
- A graph theoretic approach for unsupervised feature selection. (September 2015)
- Main Title:
- A graph theoretic approach for unsupervised feature selection
- Authors:
- Moradi, Parham
Rostami, Mehrdad - Abstract:
- Abstract: Feature subset selection is a major problem in data mining which can help to reduce computation time, improve prediction performance, and build understandable models. Specifically, feature selection realized in the absence of class labels, namely unsupervised feature selection, is challenging and interesting. In this paper a novel graph-theoretic approach for unsupervised feature selection has been proposed. The proposed method works in three steps. In the first step, the entire feature set is represented as a weighted graph. In the second step, the features are divided into several clusters using a community detection algorithm and finally in the third step, a novel iterative search strategy based on node centrality is developed to select the final subset of features. The proposed feature selection method offers two major advantages: first, our method groups features into different clusters based on their similarities, in which the features in the same cluster are similar to each other, and to obtain the reduced redundancy set, the final subset of features is selected from different clusters. Second, the node centrality measure and term variance are used to identify the most representative and informative feature subset; hence, the optimal size of the feature subset can be automatically determined. The performance of the proposed method has been compared to those of the state-of-the-art unsupervised and supervised feature selection methods on eight benchmarkAbstract: Feature subset selection is a major problem in data mining which can help to reduce computation time, improve prediction performance, and build understandable models. Specifically, feature selection realized in the absence of class labels, namely unsupervised feature selection, is challenging and interesting. In this paper a novel graph-theoretic approach for unsupervised feature selection has been proposed. The proposed method works in three steps. In the first step, the entire feature set is represented as a weighted graph. In the second step, the features are divided into several clusters using a community detection algorithm and finally in the third step, a novel iterative search strategy based on node centrality is developed to select the final subset of features. The proposed feature selection method offers two major advantages: first, our method groups features into different clusters based on their similarities, in which the features in the same cluster are similar to each other, and to obtain the reduced redundancy set, the final subset of features is selected from different clusters. Second, the node centrality measure and term variance are used to identify the most representative and informative feature subset; hence, the optimal size of the feature subset can be automatically determined. The performance of the proposed method has been compared to those of the state-of-the-art unsupervised and supervised feature selection methods on eight benchmark classification problems. The results show that our method has produced consistently better classification accuracies. Highlights: A novel graph-theoretic approach for unsupervised feature selection is proposed. Our method integrates graph clustering with a novel iterative search strategy. A node centrality measure is used to identify representative and informative features. The size of final feature set is determined automatically. Our method has been compared to the well-known and state-of-the-art methods. … (more)
- Is Part Of:
- Engineering applications of artificial intelligence. Volume 44(2015:Aug.)
- Journal:
- Engineering applications of artificial intelligence
- Issue:
- Volume 44(2015:Aug.)
- Issue Display:
- Volume 44 (2015)
- Year:
- 2015
- Volume:
- 44
- Issue Sort Value:
- 2015-0044-0000-0000
- Page Start:
- 33
- Page End:
- 45
- Publication Date:
- 2015-09
- Subjects:
- Unsupervised feature selection -- Filter method -- Graph clustering -- Node centrality
Engineering -- Data processing -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Ingénierie -- Informatique -- Périodiques
Intelligence artificielle -- Périodiques
Systèmes experts (Informatique) -- Périodiques
Artificial intelligence
Engineering -- Data processing
Expert systems (Computer science)
Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09521976 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.engappai.2015.05.005 ↗
- Languages:
- English
- ISSNs:
- 0952-1976
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3755.704500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7823.xml