Open Directory Project based universal taxonomy for Personalization of Online (Re)sources. Issue 17 (October 2015)
- Record Type:
- Journal Article
- Title:
- Open Directory Project based universal taxonomy for Personalization of Online (Re)sources. Issue 17 (October 2015)
- Main Title:
- Open Directory Project based universal taxonomy for Personalization of Online (Re)sources
- Authors:
- Ševa, Jurica
Schatten, Markus
Grd, Petra - Abstract:
- Highlights: We examine the overall quality of Open Directory Project as a classifier. We compare several data grouping schemes and evaluate each of them. We show that ODP can be used both as a horizontal as well as vertical classifier. GENERAL grouping scheme yields best results for horizontal classification. Number of documents ( limit models) yield best results for vertical classification. Abstract: Content personalization reflects the ability of content classification into (predefined) thematic units or information domains. Content nodes in a single thematic unit are related to a greater or lesser extent. An existing connection between two available content nodes assumes that the user will be interested in both resources (but not necessarily to the same extent). Such a connection (and its value) can be established through the process of automatic content classification and labeling. One approach for the classification of content nodes is the use of a predefined classification taxonomy. With the help of such classification taxonomy it is possible to automatically classify and label existing content nodes as well as create additional descriptors for future use in content personalization and recommendation systems. For these purposes existing web directories can be used in creating a universal, purely content based, classification taxonomy. This work analyzes Open Directory Project (ODP) web directory and proposes a novel use of its structure and content as the basis forHighlights: We examine the overall quality of Open Directory Project as a classifier. We compare several data grouping schemes and evaluate each of them. We show that ODP can be used both as a horizontal as well as vertical classifier. GENERAL grouping scheme yields best results for horizontal classification. Number of documents ( limit models) yield best results for vertical classification. Abstract: Content personalization reflects the ability of content classification into (predefined) thematic units or information domains. Content nodes in a single thematic unit are related to a greater or lesser extent. An existing connection between two available content nodes assumes that the user will be interested in both resources (but not necessarily to the same extent). Such a connection (and its value) can be established through the process of automatic content classification and labeling. One approach for the classification of content nodes is the use of a predefined classification taxonomy. With the help of such classification taxonomy it is possible to automatically classify and label existing content nodes as well as create additional descriptors for future use in content personalization and recommendation systems. For these purposes existing web directories can be used in creating a universal, purely content based, classification taxonomy. This work analyzes Open Directory Project (ODP) web directory and proposes a novel use of its structure and content as the basis for such a classification taxonomy. The goal of a unified classification taxonomy is to allow for content personalization from heterogeneous sources. In this work we focus on the overall quality of ODP as the basis for such a classification taxonomy and the use of its hierarchical structure for automatic labeling. Due to the structure of data in ODP different grouping schemes are devised and tested to find the optimal content and structure combination for a proposed classification taxonomy as well as automatic labeling processes. The results provide an in-depth analysis of ODP and ODP based content classification and automatic labeling models. Although the use of ODP is well documented, this question has not been answered to date. … (more)
- Is Part Of:
- Expert systems with applications. Volume 42:Issue 17/18(2015)
- Journal:
- Expert systems with applications
- Issue:
- Volume 42:Issue 17/18(2015)
- Issue Display:
- Volume 42, Issue 17/18 (2015)
- Year:
- 2015
- Volume:
- 42
- Issue:
- 17/18
- Issue Sort Value:
- 2015-0042-NaN-0000
- Page Start:
- 6306
- Page End:
- 6314
- Publication Date:
- 2015-10
- Subjects:
- Recommendation systems -- Content personalization -- Automatic content classification -- Automatic content labeling -- Information extraction -- Information retrieval -- Open Directory Project -- Vector Space Modeling -- TF-IDF
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2015.04.033 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7016.xml