A comparison of text classification methods using different stemming techniques. (22nd July 2019)
- Record Type:
- Journal Article
- Title:
- A comparison of text classification methods using different stemming techniques. (22nd July 2019)
- Main Title:
- A comparison of text classification methods using different stemming techniques
- Authors:
- Bounabi, Mariem
Moutaouakil, Karim El
Satori, Khalid - Abstract:
- In the retrieval of information, two factors have an important impact on the performance of systems: the extract features and the matching process. In this work, we compare three well-known stemming techniques: Lovins stemmer, iterated Lovins and snowball stemmer. Concerning the classification phase, we compare, experimentally, six methods: BNET, NBMU, CNB, RF, SLogicF, and SVM. Basing on this comparison, we propose a new retrieval system by calling the voting method, as a matching tool, to improve the performance of the classical systems. In this paper, we use the TF-IDF algorithm to extract features. The envisaged systems are tested on two databases: BBCNEWS and BBCSPORT. The systems based on Lovins stemmers and on the voting technique give the best results. In fact, for the first databases, the best accuracy observed is for the system Lovins + Vote with a recognition rate of 97%. Concerning the second database, the system snowball +Vote gives us 99% as recognition rate.
- Is Part Of:
- International journal of computer applications technology. Volume 60:Number 4(2019)
- Journal:
- International journal of computer applications technology
- Issue:
- Volume 60:Number 4(2019)
- Issue Display:
- Volume 60, Issue 4 (2019)
- Year:
- 2019
- Volume:
- 60
- Issue:
- 4
- Issue Sort Value:
- 2019-0060-0004-0000
- Page Start:
- 298
- Page End:
- 306
- Publication Date:
- 2019-07-22
- Subjects:
- NBMU -- SVM -- RF -- NB -- SLogiF -- CNB -- voting technique -- classification -- stemmer -- term-weighting
Technology -- Data processing -- Periodicals
620.00285 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijcat ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 0952-8091
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10844.xml