Imbalanced Learning with Oversampling based on Classification Contribution Degree. Issue 5 (26th March 2021)
- Record Type:
- Journal Article
- Title:
- Imbalanced Learning with Oversampling based on Classification Contribution Degree. Issue 5 (26th March 2021)
- Main Title:
- Imbalanced Learning with Oversampling based on Classification Contribution Degree
- Authors:
- Jiang, Zhenhao
Yang, Jie
Liu, Yan - Abstract:
- Abstract: Imbalanced datasets exist commonly in the real world, which leads to poor performance of general machine learning models because of skewed class distribution. To address the data‐imbalance problem, a novel oversampling method based on classification contribution degree, called OS‐CCD is presented. First a new concept, classification contribution degree, is established based on micro and macro information extracted from raw datasets. With the classification contribution degree, OS‐CCD enables positive samples near the class boundary and located in an area with high density of positive samples to generate more synthetic samples than others. Furthermore, the neighbor selection for oversampling is no longer random but in the light of a selected probability. Experimental results on 12 benchmark datasets substantiate that four commonly used classifiers with the oversampling method outperform those with six popular oversampling methods in terms of accuracy, F1‐score and AUC. Abstract : This paper presents a novel oversampling method based on classification contribution degree which can reflect the importance of minority samples when oversampling. This approach extracts local and global information of distribution of samples and can be used to address the problems of noise interference, sample overlapping, uninformative sampling, and random neighbor selection of SMOTE.
- Is Part Of:
- Advanced theory and simulations. Volume 4:Issue 5(2021)
- Journal:
- Advanced theory and simulations
- Issue:
- Volume 4:Issue 5(2021)
- Issue Display:
- Volume 4, Issue 5 (2021)
- Year:
- 2021
- Volume:
- 4
- Issue:
- 5
- Issue Sort Value:
- 2021-0004-0005-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2021-03-26
- Subjects:
- imbalanced learning -- oversampling -- classification contribution degree -- synthetic minority oversampling technique
Science -- Simulation methods -- Periodicals
Science -- Methodology -- Periodicals
Engineering -- Simulation methods -- Periodicals
Engineering -- Methodology -- Periodicals
507.21 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/adts.202100031 ↗
- Languages:
- English
- ISSNs:
- 2513-0390
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0696.935575
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16740.xml