Automatic annotation of online articles based on visual feature classification. (18th July 2011)
- Record Type:
- Journal Article
- Title:
- Automatic annotation of online articles based on visual feature classification. (18th July 2011)
- Main Title:
- Automatic annotation of online articles based on visual feature classification
- Authors:
- Burget, Radek
Burgetova, Ivana - Abstract:
- When applying the traditional data mining methods to World Wide Web documents, the typical problem is that a normal web page contains a variety of information of different kinds in addition to its main content. This additional information such as navigation, advertisement or copyright notices negatively influences the results of the data mining methods as for example the content classification. In this paper, we present a method of interesting area detection in a web page. This method is inspired by an assumed human reader approach to this task. First, basic visual blocks are detected in the page and subsequently, the purpose of these blocks is guessed based on their visual appearance. We describe a page segmentation method used for the visual block detection, we propose a way of the block classification based on the visual features and finally, we provide an experimental evaluation of the method on real-world data.
- Is Part Of:
- International journal of intelligent information and database systems. Volume 5:Number 4(2011)
- Journal:
- International journal of intelligent information and database systems
- Issue:
- Volume 5:Number 4(2011)
- Issue Display:
- Volume 5, Issue 4 (2011)
- Year:
- 2011
- Volume:
- 5
- Issue:
- 4
- Issue Sort Value:
- 2011-0005-0004-0000
- Page Start:
- 338
- Page End:
- 360
- Publication Date:
- 2011-07-18
- Subjects:
- automatic annotation -- online articles -- page segmentation -- document preprocessing -- visual features -- visual analysis -- data mining -- visual feature classification -- web documents -- online papers -- web pages -- visual block detection -- visual blocks -- document annotation
Database management -- Computer programs -- Periodicals
Information retrieval -- Computer programs -- Periodicals
Information storage and retrieval systems -- Computer programs -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Intelligent agents (Computer software) -- Periodicals
006.33 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijiids ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1751-5858
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 8684.xml