A novel approach for effective web page classification. (1st January 2013)
- Record Type:
- Journal Article
- Title:
- A novel approach for effective web page classification. (1st January 2013)
- Main Title:
- A novel approach for effective web page classification
- Authors:
- Mangai, J. Alamelu
Kumar, V. Santhosh
Balamurugan, S. Appavu - Abstract:
- With the exponential increase in volume of the WWW every day, web page classification has become tedious. Since with no quality data there is no quality mining results, it is worth to emphasise on fine tuning the data for classification, rather than improving the classifiers themselves. This paper investigates the methods for improving web page classification by feature extraction, selection and data tuning. This paper also proposes a new classification model for web page classification called a probabilistic web page classifier (PWPC). It is based on a probabilistic framework and attribute-value similarity measure (AVS). The proposed method is tested on a benchmarking dataset, WebKB and the performance of PWPC on the fine tuned web pages has exhibited significant accuracy over the traditional machine learning classifiers.
- Is Part Of:
- International journal of data mining, modelling and management. Volume 5:Number 3(2013)
- Journal:
- International journal of data mining, modelling and management
- Issue:
- Volume 5:Number 3(2013)
- Issue Display:
- Volume 5, Issue 3 (2013)
- Year:
- 2013
- Volume:
- 5
- Issue:
- 3
- Issue Sort Value:
- 2013-0005-0003-0000
- Page Start:
- 233
- Page End:
- 245
- Publication Date:
- 2013-01-01
- Subjects:
- feature selection -- data tuning -- web page classification -- machine learning -- WebKB
Data mining -- Periodicals
Information science -- Periodicals
Databases -- Periodicals
005.7 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijdmmm ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1759-1163
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 8532.xml