Bayes classifiers for imbalanced traffic accidents datasets. (March 2016)
- Record Type:
- Journal Article
- Title:
- Bayes classifiers for imbalanced traffic accidents datasets. (March 2016)
- Main Title:
- Bayes classifiers for imbalanced traffic accidents datasets
- Authors:
- Mujalli, Randa Oqab
López, Griselda
Garach, Laura - Abstract:
- Highlights: Different Bayes classifiers and balancing techniques were used. Traffic accidents severity in urban and suburban areas was analyzed. Variables associated with killed and severe injury were determined. Classifiers performance improved when using oversampling technique with BNs. Abstract: Traffic accidents data sets are usually imbalanced, where the number of instances classified under the killed or severe injuries class (minority) is much lower than those classified under the slight injuries class (majority). This, however, supposes a challenging problem for classification algorithms and may cause obtaining a model that well cover the slight injuries instances whereas the killed or severe injuries instances are misclassified frequently. Based on traffic accidents data collected on urban and suburban roads in Jordan for three years (2009–2011); three different data balancing techniques were used: under-sampling which removes some instances of the majority class, oversampling which creates new instances of the minority class and a mix technique that combines both. In addition, different Bayes classifiers were compared for the different imbalanced and balanced data sets: Averaged One-Dependence Estimators, Weightily Average One-Dependence Estimators, and Bayesian networks in order to identify factors that affect the severity of an accident. The results indicated that using the balanced data sets, especially those created using oversampling techniques, with BayesianHighlights: Different Bayes classifiers and balancing techniques were used. Traffic accidents severity in urban and suburban areas was analyzed. Variables associated with killed and severe injury were determined. Classifiers performance improved when using oversampling technique with BNs. Abstract: Traffic accidents data sets are usually imbalanced, where the number of instances classified under the killed or severe injuries class (minority) is much lower than those classified under the slight injuries class (majority). This, however, supposes a challenging problem for classification algorithms and may cause obtaining a model that well cover the slight injuries instances whereas the killed or severe injuries instances are misclassified frequently. Based on traffic accidents data collected on urban and suburban roads in Jordan for three years (2009–2011); three different data balancing techniques were used: under-sampling which removes some instances of the majority class, oversampling which creates new instances of the minority class and a mix technique that combines both. In addition, different Bayes classifiers were compared for the different imbalanced and balanced data sets: Averaged One-Dependence Estimators, Weightily Average One-Dependence Estimators, and Bayesian networks in order to identify factors that affect the severity of an accident. The results indicated that using the balanced data sets, especially those created using oversampling techniques, with Bayesian networks improved classifying a traffic accident according to its severity and reduced the misclassification of killed and severe injuries instances. On the other hand, the following variables were found to contribute to the occurrence of a killed causality or a severe injury in a traffic accident: number of vehicles involved, accident pattern, number of directions, accident type, lighting, surface condition, and speed limit. This work, to the knowledge of the authors, is the first that aims at analyzing historical data records for traffic accidents occurring in Jordan and the first to apply balancing techniques to analyze injury severity of traffic accidents. … (more)
- Is Part Of:
- Accident analysis and prevention. Volume 88(2016)
- Journal:
- Accident analysis and prevention
- Issue:
- Volume 88(2016)
- Issue Display:
- Volume 88, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 88
- Issue:
- 2016
- Issue Sort Value:
- 2016-0088-2016-0000
- Page Start:
- 37
- Page End:
- 51
- Publication Date:
- 2016-03
- Subjects:
- Bayesian networks -- Traffic accidents -- Urban area -- Imbalanced data set -- SMOTE
Accidents -- Prevention -- Periodicals
Accident Prevention -- Periodicals
Accidents -- Prévention -- Périodiques
363.106 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00014575 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.aap.2015.12.003 ↗
- Languages:
- English
- ISSNs:
- 0001-4575
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0573.130000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 2031.xml