Feature selection for high dimensional imbalanced class data using harmony search. (January 2017)
- Record Type:
- Journal Article
- Title:
- Feature selection for high dimensional imbalanced class data using harmony search. (January 2017)
- Main Title:
- Feature selection for high dimensional imbalanced class data using harmony search
- Authors:
- Moayedikia, Alireza
Ong, Kok-Leong
Boo, Yee Ling
Yeoh, William GS
Jensen, Richard - Abstract:
- Abstract: Misclassification costs of minority class data in real-world applications can be very high. This is a challenging problem especially when the data is also high in dimensionality because of the increase in overfitting and lower model interpretability. Feature selection is recently a popular way to address this problem by identifying features that best predict a minority class. This paper introduces a novel feature selection method call SYMON which uses symmetrical uncertainty and harmony search. Unlike existing methods, SYMON uses symmetrical uncertainty to weigh features with respect to their dependency to class labels. This helps to identify powerful features in retrieving the least frequent class labels. SYMON also uses harmony search to formulate the feature selection phase as an optimisation problem to select the best possible combination of features. The proposed algorithm is able to deal with situations where a set of features have the same weight, by incorporating two vector tuning operations embedded in the harmony search process. In this paper, SYMON is compared against various benchmark feature selection algorithms that were developed to address the same issue. Our empirical evaluation on different micro-array data sets using G-Mean and AUC measures confirm that SYMON is a comparable or a better solution to current benchmarks.
- Is Part Of:
- Engineering applications of artificial intelligence. Volume 57(2016:Sep.)
- Journal:
- Engineering applications of artificial intelligence
- Issue:
- Volume 57(2016:Sep.)
- Issue Display:
- Volume 57 (2016)
- Year:
- 2016
- Volume:
- 57
- Issue Sort Value:
- 2016-0057-0000-0000
- Page Start:
- 38
- Page End:
- 49
- Publication Date:
- 2017-01
- Subjects:
- Feature selection -- Harmony search -- High-dimensionality -- Imbalanced class -- Symmetrical uncertainty
Engineering -- Data processing -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Ingénierie -- Informatique -- Périodiques
Intelligence artificielle -- Périodiques
Systèmes experts (Informatique) -- Périodiques
Artificial intelligence
Engineering -- Data processing
Expert systems (Computer science)
Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09521976 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.engappai.2016.10.008 ↗
- Languages:
- English
- ISSNs:
- 0952-1976
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3755.704500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 2033.xml