Balancing sequential data to predict students at-risk using adversarial networks. (July 2021)
- Record Type:
- Journal Article
- Title:
- Balancing sequential data to predict students at-risk using adversarial networks. (July 2021)
- Main Title:
- Balancing sequential data to predict students at-risk using adversarial networks
- Authors:
- Waheed, Hajra
Anas, Muhammad
Hassan, Saeed-Ul
Aljohani, Naif Radi
Alelyani, Salem
Edifor, Ernest Edem
Nawaz, Raheel - Abstract:
- Highlights: This study presents a novel adversarial-based approach to up-sample the sequential data in an educational setting. The proposed method generates new student sequences such that the past behavior of each student is encapsulated in its next sequence. The data from the Open University (UK) is transformed into a sequential format and used as a case study to eliminate the imbalance in students' academic performances. The proposed approach outperforms the conventional state-of-the-art Random Over-sampling and Synthetic Minority Over-sampling techniques with an improved AUC of 7.07% and 6.53%, respectively. Abstract: Class imbalance is a challenging problem especially in a supervised learning setup, as most classification algorithms are designed for balanced class distributions. Although various up-sampling approaches exist for eliminating the class imbalance, however, they do not handle the complexities of sequential data. In this study, using the data of over 30, 000 students from the Open University (UK), we implement a deep-learning-based approach using adversarial networks, Sequential Conditional Generative Adversarial Network (SC-GAN) that encapsulates the past behavior of each student for its previous sequences and generates synthetic student records for the next timestamp. The proposed approach is devised to generate instances, which are augmented with the actual data to eliminate class imbalance. A performance comparison of the proposed SC-GAN with the standardHighlights: This study presents a novel adversarial-based approach to up-sample the sequential data in an educational setting. The proposed method generates new student sequences such that the past behavior of each student is encapsulated in its next sequence. The data from the Open University (UK) is transformed into a sequential format and used as a case study to eliminate the imbalance in students' academic performances. The proposed approach outperforms the conventional state-of-the-art Random Over-sampling and Synthetic Minority Over-sampling techniques with an improved AUC of 7.07% and 6.53%, respectively. Abstract: Class imbalance is a challenging problem especially in a supervised learning setup, as most classification algorithms are designed for balanced class distributions. Although various up-sampling approaches exist for eliminating the class imbalance, however, they do not handle the complexities of sequential data. In this study, using the data of over 30, 000 students from the Open University (UK), we implement a deep-learning-based approach using adversarial networks, Sequential Conditional Generative Adversarial Network (SC-GAN) that encapsulates the past behavior of each student for its previous sequences and generates synthetic student records for the next timestamp. The proposed approach is devised to generate instances, which are augmented with the actual data to eliminate class imbalance. A performance comparison of the proposed SC-GAN with the standard up-sampling methods is also presented and the results validate the proposed method with an improved AUC of 7.07% and 6.53%, respectively, when compared with conventional Random Over-sampling and Sythetic Minority Oversampling techniques. Graphical abstract: Image, graphical abstract … (more)
- Is Part Of:
- Computers & electrical engineering. Volume 93(2021)
- Journal:
- Computers & electrical engineering
- Issue:
- Volume 93(2021)
- Issue Display:
- Volume 93, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 93
- Issue:
- 2021
- Issue Sort Value:
- 2021-0093-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-07
- Subjects:
- Students At-Risk -- CGAN -- Class Imbalance -- Sequential Data -- Time-Series -- Sythetic Minority Oversampling technique
Computer engineering -- Periodicals
Electrical engineering -- Periodicals
Electrical engineering -- Data processing -- Periodicals
Ordinateurs -- Conception et construction -- Périodiques
Électrotechnique -- Périodiques
Électrotechnique -- Informatique -- Périodiques
Computer engineering
Electrical engineering
Electrical engineering -- Data processing
Periodicals
Electronic journals
621.302854 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00457906/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compeleceng.2021.107274 ↗
- Languages:
- English
- ISSNs:
- 0045-7906
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.680000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 18882.xml