Intelligent detection of hate speech in Arabic social network: A machine learning approach. (August 2021)

Record Type:: Journal Article
Title:: Intelligent detection of hate speech in Arabic social network: A machine learning approach. (August 2021)
Main Title:: Intelligent detection of hate speech in Arabic social network: A machine learning approach
Authors:: Aljarah, Ibrahim
Habib, Maria
Hijazi, Neveen
Faris, Hossam
Qaddoura, Raneem
Hammo, Bassam
Abushariah, Mohammad
Alfawareh, Mohammad
Abstract:: Nowadays, cyber hate speech is increasingly growing, which forms a serious problem worldwide by threatening the cohesion of civil societies. Hate speech relates to using expressions or phrases that are violent, offensive or insulting for a person or a minority of people. In particular, in the Arab region, the number of Arab social media users is growing rapidly, which is accompanied with high increasing rate of cyber hate speech. This drew our attention to aspire healthy online environments that are free of hatred and discrimination. Therefore, this article aims to detect cyber hate speech based on Arabic context over Twitter platform, by applying Natural Language Processing (NLP) techniques, and machine learning methods. The article considers a set of tweets related to racism, journalism, sports orientation, terrorism and Islam. Several types of features and emotions are extracted and arranged in 15 different combinations of data. The processed dataset is experimented using Support Vector Machine (SVM), Naive Bayes (NB), Decision Tree (DT) and Random Forest (RF), in which RF with the feature set of Term Frequency-Inverse Document Frequency (TF-IDF) and profile-related features achieves the best results. Furthermore, a feature importance analysis is conducted based on RF classifier in order to quantify the predictive ability of features in regard to the hate class.
Is Part Of:: Journal of information science. Volume 47:Number 4(2021)
Journal:: Journal of information science
Issue:: Volume 47:Number 4(2021)
Issue Display:: Volume 47, Issue 4 (2021)
Year:: 2021
Volume:: 47
Issue:: 4
Issue Sort Value:: 2021-0047-0004-0000
Page Start:: 483
Page End:: 501
Publication Date:: 2021-08
Subjects:: Hate speech -- machine learning -- text vectorization -- Twitter
Information science -- Periodicals
Information science
Periodicals
020.5
Journal URLs:: http://jis.sagepub.com/archive/ ↗
http://www.ingenta.com/journals/browse/bks/jis?mode=direct ↗
http://www.uk.sagepub.com/home.nav ↗
http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=0165-5515;screen=info;ECOIP ↗
DOI:: 10.1177/0165551520917651 ↗
Languages:: English
ISSNs:: 0165-5515
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 16695.xml