Deployment of a change‐level software defect prediction solution into an industrial setting. Issue 11 (24th September 2021)
- Record Type:
- Journal Article
- Title:
- Deployment of a change‐level software defect prediction solution into an industrial setting. Issue 11 (24th September 2021)
- Main Title:
- Deployment of a change‐level software defect prediction solution into an industrial setting
- Authors:
- Eken, Beyza
Tufan, Selda
Tunaboylu, Alper
Guler, Tevfik
Atar, Rifat
Tosun, Ayse - Abstract:
- Abstract: Applying change‐level software defect prediction (SDP) in practice has several challenges regarding model validation techniques, data accuracy, and prediction performance consistency. A few studies report on these challenges in an industrial context. We share our experience in integrating an SDP into an industrial context. We investigate whether an "offline" SDP could reflect its "online" (real‐life) performance, and other deployment decisions: the model re‐training process and update period. We employ an online prediction strategy by considering the actual labels of training commits at the time of prediction and compare its performance against an offline prediction. We empirically assess the online SDP's performance with various lengths of the time gap between the train and test set and model update periods. Our online SDP's performance could successfully reach its offline performance. The time gap between the train and test commits, and model update period significantly impacts the online performance by 37% and 18% in terms of probability of detection (pd), respectively. We deploy the best SDP solution (73% pd) with an 8‐month time gap and a 3‐day update period. Contextual factors may determine the model performance in practice, its consistency, and trustworthiness. As future work, we plan to investigate the reasons for fluctuations in model performance over time. Abstract : We share our experience in integrating a change‐level software defect prediction (SDP)Abstract: Applying change‐level software defect prediction (SDP) in practice has several challenges regarding model validation techniques, data accuracy, and prediction performance consistency. A few studies report on these challenges in an industrial context. We share our experience in integrating an SDP into an industrial context. We investigate whether an "offline" SDP could reflect its "online" (real‐life) performance, and other deployment decisions: the model re‐training process and update period. We employ an online prediction strategy by considering the actual labels of training commits at the time of prediction and compare its performance against an offline prediction. We empirically assess the online SDP's performance with various lengths of the time gap between the train and test set and model update periods. Our online SDP's performance could successfully reach its offline performance. The time gap between the train and test commits, and model update period significantly impacts the online performance by 37% and 18% in terms of probability of detection (pd), respectively. We deploy the best SDP solution (73% pd) with an 8‐month time gap and a 3‐day update period. Contextual factors may determine the model performance in practice, its consistency, and trustworthiness. As future work, we plan to investigate the reasons for fluctuations in model performance over time. Abstract : We share our experience in integrating a change‐level software defect prediction (SDP) model into an industrial context. We empirically investigate whether an 'offline' SDP could reflect its 'online' (real‐life) performance, and other deployment decisions: The model re‐training process and update period. Our online SDP's performance could successfully reach its offline performance. The time gap between the train and test commits, and model update period significantly impact the online performance by 37% and 18% in terms of probability of detection, respectively. … (more)
- Is Part Of:
- Journal of software. Volume 33:Issue 11(2021)
- Journal:
- Journal of software
- Issue:
- Volume 33:Issue 11(2021)
- Issue Display:
- Volume 33, Issue 11 (2021)
- Year:
- 2021
- Volume:
- 33
- Issue:
- 11
- Issue Sort Value:
- 2021-0033-0011-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2021-09-24
- Subjects:
- change‐level defect prediction -- deployment -- industrial case study -- online prediction
Software engineering -- Periodicals
Computer software -- Development -- Periodicals
Software maintenance -- Periodicals
005.1 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)2047-7481 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/smr.2381 ↗
- Languages:
- English
- ISSNs:
- 2047-7473
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20593.xml