Accuracy and diversity-aware multi-objective approach for random forest construction. (1st September 2023)
- Record Type:
- Journal Article
- Title:
- Accuracy and diversity-aware multi-objective approach for random forest construction. (1st September 2023)
- Main Title:
- Accuracy and diversity-aware multi-objective approach for random forest construction
- Authors:
- Karabadji, Nour El Islem
Amara Korba, Abdelaziz
Assi, Ali
Seridi, Hassina
Aridhi, Sabeur
Dhifli, Wajdi - Abstract:
- Abstract: Random Forest is an ensemble classification approach. It aims to design a discrete finite group of decision trees constructed based on bootstrap samples and random attribute selection. Random Forests have strong generalization capacities due to the variance in the training and attribute couple subsets used for constructing different decision trees in the forest. However, to construct a robust and effective random forest, two main issues need to be taken into account namely: (1) increasing the accuracy and diversity of decision trees; (2) decreasing the number of decision trees. In this paper, a genetic algorithm-based approach to tackle the aforementioned challenges related to random forest construction is proposed. Three objectives are taken into consideration. First, strengthening the classification accuracy of individual decision trees as well as that of the forest. Second, making use of diversity measures among the decision trees to improve the generalization of the constructed model. Third, minimizing the number of trees in the forest and finding an optimal subset of the random forest. An experimental evaluation on several datasets from the UCI Machine Learning Repository is conducted. The obtained results show that the proposed approach outperforms state-of-the-art classical as well as evolutionary random forest construction methods. Finally, the proposed approach is used to build a reliable random forest model for detecting Botnet traffic in Internet ofAbstract: Random Forest is an ensemble classification approach. It aims to design a discrete finite group of decision trees constructed based on bootstrap samples and random attribute selection. Random Forests have strong generalization capacities due to the variance in the training and attribute couple subsets used for constructing different decision trees in the forest. However, to construct a robust and effective random forest, two main issues need to be taken into account namely: (1) increasing the accuracy and diversity of decision trees; (2) decreasing the number of decision trees. In this paper, a genetic algorithm-based approach to tackle the aforementioned challenges related to random forest construction is proposed. Three objectives are taken into consideration. First, strengthening the classification accuracy of individual decision trees as well as that of the forest. Second, making use of diversity measures among the decision trees to improve the generalization of the constructed model. Third, minimizing the number of trees in the forest and finding an optimal subset of the random forest. An experimental evaluation on several datasets from the UCI Machine Learning Repository is conducted. The obtained results show that the proposed approach outperforms state-of-the-art classical as well as evolutionary random forest construction methods. Finally, the proposed approach is used to build a reliable random forest model for detecting Botnet traffic in Internet of Things environment. Highlights: We propose a genetic algorithm to construct an improved random forest. We design a theoretic solution allowing a smart convergence to the best solutions. We propose a Powerset system to optimize the encoding of the forests in chromosomes. Our objective function optimizes the accuracy and diversity of the trees of models. Application to the detection of Botnet traffic in Internet of Things networks. … (more)
- Is Part Of:
- Expert systems with applications. Volume 225(2023)
- Journal:
- Expert systems with applications
- Issue:
- Volume 225(2023)
- Issue Display:
- Volume 225, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 225
- Issue:
- 2023
- Issue Sort Value:
- 2023-0225-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-09-01
- Subjects:
- Random forest -- Decision tree -- Classification -- Genetic algorithm -- Diversity -- Internet of Things
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2023.120138 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 27066.xml