F-measure curves: A tool to visualize classifier performance under imbalance. (April 2020)
- Record Type:
- Journal Article
- Title:
- F-measure curves: A tool to visualize classifier performance under imbalance. (April 2020)
- Main Title:
- F-measure curves: A tool to visualize classifier performance under imbalance
- Authors:
- Soleymani, Roghayeh
Granger, Eric
Fumera, Giorgio - Abstract:
- Highlights: The F-measure space is proposed to visualize classifier performance. The preference between precision and recall is controlled by a factor. A crisp classifier is represented as a curve for a range of P(+). A soft classifier is shown as the upper envelope of curves of its decision thresholds The proposed Iterative Boolean Combination(IBC) algorithm is optimized in this space. The proposed IBC selects and combines classifiers in this space. Abstract: Learning from imbalanced data is a challenging problem in many real-world machine learning applications due in part to the bias of performance in most classification systems. This bias may exist due to three reasons: (1) Classification systems are often optimized and compared using performance measurements that are unsuitable for imbalance problems; (2) most learning algorithms are designed and tested on a fixed imbalance level of data, which may differ from operational scenarios; (3) the preference of correct classification of classes is different from one application to another. This paper investigates specialized performance evaluation metrics and tools for imbalance problem, including scalar metrics that assume a given operating condition (skew level and relative preference of classes), and global evaluation curves or metrics that consider a range of operating conditions. We focus on the case in which the scalar metric F-measure is preferred over other scalar metrics, and propose a new global evaluation space forHighlights: The F-measure space is proposed to visualize classifier performance. The preference between precision and recall is controlled by a factor. A crisp classifier is represented as a curve for a range of P(+). A soft classifier is shown as the upper envelope of curves of its decision thresholds The proposed Iterative Boolean Combination(IBC) algorithm is optimized in this space. The proposed IBC selects and combines classifiers in this space. Abstract: Learning from imbalanced data is a challenging problem in many real-world machine learning applications due in part to the bias of performance in most classification systems. This bias may exist due to three reasons: (1) Classification systems are often optimized and compared using performance measurements that are unsuitable for imbalance problems; (2) most learning algorithms are designed and tested on a fixed imbalance level of data, which may differ from operational scenarios; (3) the preference of correct classification of classes is different from one application to another. This paper investigates specialized performance evaluation metrics and tools for imbalance problem, including scalar metrics that assume a given operating condition (skew level and relative preference of classes), and global evaluation curves or metrics that consider a range of operating conditions. We focus on the case in which the scalar metric F-measure is preferred over other scalar metrics, and propose a new global evaluation space for the F-measure that is analogous to the cost curves for expected cost. In this space, a classifier is represented as a curve that shows its performance over all of its decision thresholds and a range of possible imbalance levels for the desired preference of true positive rate to precision. Curves obtained in the F-measure space are compared to those of existing spaces (ROC, precision-recall and cost) and analogously to cost curves. The proposed F-measure space allows to visualize and compare classifiers' performance under different operating conditions more easily than in ROC and precision-recall spaces. This space allows us to set the optimal decision threshold of a soft classifier and to select the best classifier among a group. This space also allows to empirically improve the performance obtained with ensemble learning methods specialized for class imbalance, by selecting and combining the base classifiers for ensembles using a modified version of the iterative Boolean combination algorithm that is optimized using the F-measure instead of AUC. Experiments on a real-world dataset for video face recognition show the advantages of evaluating and comparing different classifiers in the F-measure space versus ROC, precision-recall, and cost spaces. In addition, it is shown that the performance evaluated using the F-measure of Bagging ensemble method can improve considerably by using the modified iterative Boolean combination algorithm. … (more)
- Is Part Of:
- Pattern recognition. Volume 100(2020:Apr.)
- Journal:
- Pattern recognition
- Issue:
- Volume 100(2020:Apr.)
- Issue Display:
- Volume 100 (2020)
- Year:
- 2020
- Volume:
- 100
- Issue Sort Value:
- 2020-0100-0000-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-04
- Subjects:
- Pattern classification -- Class imbalance -- Performance metrics -- F-measure -- Visualization tools -- Video face recognition
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2019.107146 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23137.xml