An Efficient Text Classification Scheme Using Clustering. (2016)
- Record Type:
- Journal Article
- Title:
- An Efficient Text Classification Scheme Using Clustering. (2016)
- Main Title:
- An Efficient Text Classification Scheme Using Clustering
- Authors:
- Thomas, Anisha Mariam
Resmipriya, M.G. - Abstract:
- Abstract: Text classification method that uses efficient similarity measures to achieve better performance is being proposed in this paper. Semi-supervised clustering is used as a complementary step to text classification and is used to identify the components in text collection. Clustering makes use of labeled texts to capture silhouettes of text clusters and unlabeled texts to adapt its centroids. The category of each text cluster is labeled by the label of texts in it. Thus here the text clustering is used to generate the classification model for the next text classification step. When a new unlabeled text is incoming, measure its similarity with the centroids of the text clusters and give its label with that of the nearest text cluster. The similarity is calculated using different similarity measures. Results and evaluations are summarized and it is found that the system provides better accuracy when a Similarity Measure for Text Processing (SMTP) used for the distance calculation.
- Is Part Of:
- Procedia technology. Volume 24(2016)
- Journal:
- Procedia technology
- Issue:
- Volume 24(2016)
- Issue Display:
- Volume 24, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 24
- Issue:
- 2016
- Issue Sort Value:
- 2016-0024-2016-0000
- Page Start:
- 1220
- Page End:
- 1225
- Publication Date:
- 2016
- Subjects:
- Data Mining -- Classification -- Semi-supervised Clustering -- Similarity Measures
Technology -- Congresses
Technology -- Periodicals
Engineering -- Congresses
Engineering -- Periodicals
Engineering
Technology
Conference proceedings
Periodicals
605 - Journal URLs:
- http://www.sciencedirect.com/science/journal/22120173 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.protcy.2016.05.095 ↗
- Languages:
- English
- ISSNs:
- 2212-0173
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 2229.xml