Parallel noise eliminate: A parallel noise elimination algorithm for massive text categorization. Issue 4 (December 2018)
- Record Type:
- Journal Article
- Title:
- Parallel noise eliminate: A parallel noise elimination algorithm for massive text categorization. Issue 4 (December 2018)
- Main Title:
- Parallel noise eliminate: A parallel noise elimination algorithm for massive text categorization
- Authors:
- Hu, Xiaojuan
Liu, Lei
Qiu, Ningjia
Li, Meng - Abstract:
- Noise data in text are one of the main factors affecting the quality of text categorization. A parallel noise data elimination algorithm based on principal component analysis method and term frequency-inverse document frequency method for the noise data issue of massive text categorization is proposed. Five types of noise data which may occur during text categorization process are analyzed and summarized in this paper. Before text categorization, a redundant noise elimination algorithm based on key feature selection is presented for redundant noise features. During the process of text categorization, the error noise detection algorithm is given for inaccurate noise features. The proposed method is compared with other four typical noise processing methods in different noise ratios on two common corpora. The results show that the proposed method is feasible and can maintain more stable and excellent classification performance and lower running time.
- Is Part Of:
- Journal of algorithms & computational technology. Volume 12:Issue 4(2018)
- Journal:
- Journal of algorithms & computational technology
- Issue:
- Volume 12:Issue 4(2018)
- Issue Display:
- Volume 12, Issue 4 (2018)
- Year:
- 2018
- Volume:
- 12
- Issue:
- 4
- Issue Sort Value:
- 2018-0012-0004-0000
- Page Start:
- 342
- Page End:
- 350
- Publication Date:
- 2018-12
- Subjects:
- Massive data -- text categorization -- noise feature reduction -- error feature -- key feature selection -- parallelization
Computer algorithms -- Periodicals
Numerical calculations -- Periodicals
Computer algorithms
Numerical calculations
Periodicals
518.1 - Journal URLs:
- http://act.sagepub.com/ ↗
http://www.ingentaconnect.com/content/mscp/jact ↗
http://www.multi-science.co.uk/ ↗ - DOI:
- 10.1177/1748301818779047 ↗
- Languages:
- English
- ISSNs:
- 1748-3018
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8934.xml