Heterogeneous fault prediction with cost‐sensitive domain adaptation. (26th January 2018)
- Record Type:
- Journal Article
- Title:
- Heterogeneous fault prediction with cost‐sensitive domain adaptation. (26th January 2018)
- Main Title:
- Heterogeneous fault prediction with cost‐sensitive domain adaptation
- Authors:
- Li, Zhiqiang
Jing, Xiao‐Yuan
Zhu, Xiaoke - Abstract:
- Summary: In the early phases of software testing, projects may have only limited historical defect data. Learning prediction model with such insufficient training data will limit the efficacy of learned predictor. In practice, there are usually many publicly available fault prediction datasets. Recently, heterogeneous fault prediction (HFP) has been proposed. However, existing HFP models do not investigate how to use mixed project data to predict target. Furthermore, defect data are often imbalanced. The imbalanced data distribution of source usually leads to serious misclassification of fault‐prone instances, which will degrade the predictor's performance. Existing HFP methods do not consider the class imbalance problem in the training stages. In this paper, we propose a novel Cost‐sensitive Label and Structure‐consistent Unilateral Projection (CLSUP) approach for HFP. CLSUP can not only make better use of the within‐project and cross‐project data but also alleviate the class imbalance problem by setting different misclassification costs for fault‐prone and non–fault‐prone instances. Extensive experiments on 30 projects demonstrate the effectiveness of CLSUP. Abstract : This paper proposes a novel heterogeneous fault prediction (HFP) approach, named Cost‐sensitive Label‐and‐Structure‐consistent Unilateral Projection (CLSUP). CLSUP is among the first to exploit the mixed‐project (within‐project and cross‐project) data in HFP. It can not only make better use of theSummary: In the early phases of software testing, projects may have only limited historical defect data. Learning prediction model with such insufficient training data will limit the efficacy of learned predictor. In practice, there are usually many publicly available fault prediction datasets. Recently, heterogeneous fault prediction (HFP) has been proposed. However, existing HFP models do not investigate how to use mixed project data to predict target. Furthermore, defect data are often imbalanced. The imbalanced data distribution of source usually leads to serious misclassification of fault‐prone instances, which will degrade the predictor's performance. Existing HFP methods do not consider the class imbalance problem in the training stages. In this paper, we propose a novel Cost‐sensitive Label and Structure‐consistent Unilateral Projection (CLSUP) approach for HFP. CLSUP can not only make better use of the within‐project and cross‐project data but also alleviate the class imbalance problem by setting different misclassification costs for fault‐prone and non–fault‐prone instances. Extensive experiments on 30 projects demonstrate the effectiveness of CLSUP. Abstract : This paper proposes a novel heterogeneous fault prediction (HFP) approach, named Cost‐sensitive Label‐and‐Structure‐consistent Unilateral Projection (CLSUP). CLSUP is among the first to exploit the mixed‐project (within‐project and cross‐project) data in HFP. It can not only make better use of the within‐project and cross‐project data but also alleviate the class imbalance problem by setting different misclassification costs for fault‐prone and non‐fault‐prone instances. Extensive experiments on 30 projects demonstrate the effectiveness of CLSUP. … (more)
- Is Part Of:
- Software testing, verification & reliability. Volume 28:Number 2(2018)
- Journal:
- Software testing, verification & reliability
- Issue:
- Volume 28:Number 2(2018)
- Issue Display:
- Volume 28, Issue 2 (2018)
- Year:
- 2018
- Volume:
- 28
- Issue:
- 2
- Issue Sort Value:
- 2018-0028-0002-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2018-01-26
- Subjects:
- cost‐sensitive learning -- class imbalance -- heterogeneous domain adaptation -- heterogeneous fault prediction -- mixed project -- software quality assurance
Computer software -- Testing -- Periodicals
Computer software -- Verification -- Periodicals
Computer software -- Reliability -- Periodicals
005.14 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/stvr.1658 ↗
- Languages:
- English
- ISSNs:
- 0960-0833
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8321.457500
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 5848.xml