An empirical study on the effect of data sparsity and data overlap on cross domain collaborative filtering performance. (15th December 2017)
- Record Type:
- Journal Article
- Title:
- An empirical study on the effect of data sparsity and data overlap on cross domain collaborative filtering performance. (15th December 2017)
- Main Title:
- An empirical study on the effect of data sparsity and data overlap on cross domain collaborative filtering performance
- Authors:
- Hwangbo, Hyunwoo
Kim, Yangsok - Abstract:
- Highlights: Suggest various cross-domain collaborative filtering approaches. Evaluate them with a large fashion company comprised of three domains. Demonstrate the recommendation score-based approach is prominent over others. Abstract: In the present day, the oversaturation of data has complicated the process of finding information from a data source. Recommender systems aim to alleviate this problem in various domains by actively suggesting selective information to potential users based on their personal preferences. Amongst these approaches, collaborative filtering based recommenders (CF recommenders), which make use of users' implicit and explicit ratings for items, are widely regarded as the most successful type of recommender system. However, CF recommenders are sensitive to issues caused by data sparsity, where users rate very few items, or items receive very few ratings from users, meaning there is not enough data to give a recommendation. The majority of studies have attempted to solve these issues by focusing on developing new algorithms within a single domain. Recently, cross-domain recommenders that use multiple domain datasets have attracted increasing attention amongst the research community. Cross-domain recommenders assume that users who express their preferences in one domain (called the target domain) will also express their preferences in another domain (called the source domain), and that these additional preferences will improve precision and recall ofHighlights: Suggest various cross-domain collaborative filtering approaches. Evaluate them with a large fashion company comprised of three domains. Demonstrate the recommendation score-based approach is prominent over others. Abstract: In the present day, the oversaturation of data has complicated the process of finding information from a data source. Recommender systems aim to alleviate this problem in various domains by actively suggesting selective information to potential users based on their personal preferences. Amongst these approaches, collaborative filtering based recommenders (CF recommenders), which make use of users' implicit and explicit ratings for items, are widely regarded as the most successful type of recommender system. However, CF recommenders are sensitive to issues caused by data sparsity, where users rate very few items, or items receive very few ratings from users, meaning there is not enough data to give a recommendation. The majority of studies have attempted to solve these issues by focusing on developing new algorithms within a single domain. Recently, cross-domain recommenders that use multiple domain datasets have attracted increasing attention amongst the research community. Cross-domain recommenders assume that users who express their preferences in one domain (called the target domain) will also express their preferences in another domain (called the source domain), and that these additional preferences will improve precision and recall of recommendations to the user. The purpose of this study is to investigate the effects of various data sparsity and data overlap issues on the performance of cross-domain CF recommenders, using various aggregation functions. In this study, several different cross-domain recommenders were created by collecting three datasets from three separate domains of a large Korean fashion company and combining them with different algorithms and different aggregation approaches. The cross-recommenders that used high performance, high overlap domains showed significant improvement of precision and recall of recommendation when the recommendation scores of individual domains were combined using the summation aggregation function. However, the cross-recommenders that used low performance, low overlap domains showed little or no performance improvement in all areas. This result implies that the use of cross-domain recommenders do not guarantee performance improvement, rather that it is necessary to consider relevant factors carefully to achieve performance improvement when using cross-domain recommenders. … (more)
- Is Part Of:
- Expert systems with applications. Volume 89(2017)
- Journal:
- Expert systems with applications
- Issue:
- Volume 89(2017)
- Issue Display:
- Volume 89, Issue 2017 (2017)
- Year:
- 2017
- Volume:
- 89
- Issue:
- 2017
- Issue Sort Value:
- 2017-0089-2017-0000
- Page Start:
- 254
- Page End:
- 265
- Publication Date:
- 2017-12-15
- Subjects:
- Cross-domain recommender -- Collaborative filtering -- E-commerce application -- Data sparsity
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2017.07.041 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 4634.xml