Multi-label Arabic text classification in Online Social Networks. Issue 100 (September 2021)
- Record Type:
- Journal Article
- Title:
- Multi-label Arabic text classification in Online Social Networks. Issue 100 (September 2021)
- Main Title:
- Multi-label Arabic text classification in Online Social Networks
- Authors:
- Omar, Ahmed
Mahmoud, Tarek M.
Abd-El-Hafeez, Tarek
Mahfouz, Ahmed - Abstract:
- Abstract: Online Social Networks (OSNs) are the most popular interactive media for communicating, posting, and sharing indefinite amounts of personal information. However, along with interesting and attractive topics and contents, some users neither like the fact that certain topics that are not among their interests can fill their personal pages nor do they wish to see disappointing negative posts that may appear repeatedly. Also, people sometimes post inappropriate or abusive content on these networks, such as insults or pornography. Most of the efforts in the field of text classification have focused on the English language, while research on the Arabic language, which has numerous challenges is scarce. In this paper, we constructed a standard multi-label Arabic dataset using manual annotation and a semi-supervised annotation technique that can be used for short text classification, sentiment analysis, and multilabel classification. Then, we evaluated the topics classification, sentiment analysis, and multilabel classification. Based on that evaluation we found a relationship between topics published in OSNs and hate speech. The experimental results validate the effectiveness of the proposed technique. Highlights: We construct a standard multi-label Arabic dataset using manual annotation and semi-supervised annotation techniques. We train machine-learning models for topic classification, sentiment analysis, and multilabel classification in OSNs. We examine theAbstract: Online Social Networks (OSNs) are the most popular interactive media for communicating, posting, and sharing indefinite amounts of personal information. However, along with interesting and attractive topics and contents, some users neither like the fact that certain topics that are not among their interests can fill their personal pages nor do they wish to see disappointing negative posts that may appear repeatedly. Also, people sometimes post inappropriate or abusive content on these networks, such as insults or pornography. Most of the efforts in the field of text classification have focused on the English language, while research on the Arabic language, which has numerous challenges is scarce. In this paper, we constructed a standard multi-label Arabic dataset using manual annotation and a semi-supervised annotation technique that can be used for short text classification, sentiment analysis, and multilabel classification. Then, we evaluated the topics classification, sentiment analysis, and multilabel classification. Based on that evaluation we found a relationship between topics published in OSNs and hate speech. The experimental results validate the effectiveness of the proposed technique. Highlights: We construct a standard multi-label Arabic dataset using manual annotation and semi-supervised annotation techniques. We train machine-learning models for topic classification, sentiment analysis, and multilabel classification in OSNs. We examine the relationship between topics published in OSNs and hate speech. We propose a technique to filter social networks contents. … (more)
- Is Part Of:
- Information systems. Issue 100(2021)
- Journal:
- Information systems
- Issue:
- Issue 100(2021)
- Issue Display:
- Volume 100, Issue 100 (2021)
- Year:
- 2021
- Volume:
- 100
- Issue:
- 100
- Issue Sort Value:
- 2021-0100-0100-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-09
- Subjects:
- Arabic natural language processing -- Arabic text classification -- Arabic sentiment analysis
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2021.101785 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17090.xml