A hybrid feature selection scheme for high-dimensional data. (August 2022)
- Record Type:
- Journal Article
- Title:
- A hybrid feature selection scheme for high-dimensional data. (August 2022)
- Main Title:
- A hybrid feature selection scheme for high-dimensional data
- Authors:
- Ganjei, Mohammad Ahmadi
Boostani, Reza - Abstract:
- Abstract: There is a growing interest in developing feature subset selection schemes for high-dimensional datasets by filter, wrapper, embedded, and hybrid manners. In this paper, we propose a new hybrid (filter-wrapper) feature selection approach. At first, in the filter step, we rank input features according to their relevance with the class label. Afterwards, we apply different clustering methods for the classification of the selected features. We perform an inner and outer cluster ranking based on the primary feature ranking in the next step. Then, different search strategies are performed on the best cluster of features in the wrapper phase. Moreover, we add some of them to the feature set based on the classifiers (nearest neighbor, decision tree, support vector machine, and random forests) feedback. Then, the algorithm goes to the next cluster, and this process is continued till all clusters are met. Finally, we compare the results of the proposed method to the state-of-the-art schemes. Comparison results imply the superiority of the proposed method to the counterparts on eight high-dimensional datasets in terms of accuracy and computational complexity. Highlights: A new hybrid feature selection framework named the HyCluster is proposed. Our algorithm significantly improves the time complexity using feature clustering. It selects relevant and irredundant features and significantly improves accuracy rates. Comparison results imply the superiority of the proposed methodAbstract: There is a growing interest in developing feature subset selection schemes for high-dimensional datasets by filter, wrapper, embedded, and hybrid manners. In this paper, we propose a new hybrid (filter-wrapper) feature selection approach. At first, in the filter step, we rank input features according to their relevance with the class label. Afterwards, we apply different clustering methods for the classification of the selected features. We perform an inner and outer cluster ranking based on the primary feature ranking in the next step. Then, different search strategies are performed on the best cluster of features in the wrapper phase. Moreover, we add some of them to the feature set based on the classifiers (nearest neighbor, decision tree, support vector machine, and random forests) feedback. Then, the algorithm goes to the next cluster, and this process is continued till all clusters are met. Finally, we compare the results of the proposed method to the state-of-the-art schemes. Comparison results imply the superiority of the proposed method to the counterparts on eight high-dimensional datasets in terms of accuracy and computational complexity. Highlights: A new hybrid feature selection framework named the HyCluster is proposed. Our algorithm significantly improves the time complexity using feature clustering. It selects relevant and irredundant features and significantly improves accuracy rates. Comparison results imply the superiority of the proposed method to the counterparts. … (more)
- Is Part Of:
- Engineering applications of artificial intelligence. Volume 113(2022)
- Journal:
- Engineering applications of artificial intelligence
- Issue:
- Volume 113(2022)
- Issue Display:
- Volume 113, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 113
- Issue:
- 2022
- Issue Sort Value:
- 2022-0113-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-08
- Subjects:
- Hybrid feature selection -- High-dimensional datasets -- Wrapper -- Classification -- Clustering -- Microarray
Engineering -- Data processing -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Ingénierie -- Informatique -- Périodiques
Intelligence artificielle -- Périodiques
Systèmes experts (Informatique) -- Périodiques
Artificial intelligence
Engineering -- Data processing
Expert systems (Computer science)
Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09521976 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.engappai.2022.104894 ↗
- Languages:
- English
- ISSNs:
- 0952-1976
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3755.704500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21819.xml