1. Improving the Quality of Recommendations for Users and Items in the Tail of Distribution
- Author
-
Jie Wang, Jian Cao, Longbing Cao, Liang Hu, Zhiping Gu, and Guandong Xu
- Subjects
Information retrieval ,Computer science ,business.industry ,Small number ,RSS ,02 engineering and technology ,computer.file_format ,Recommender system ,Machine learning ,computer.software_genre ,General Business, Management and Accounting ,Computer Science Applications ,020204 information systems ,Prior probability ,Credibility ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Information Systems ,Factor analysis - Abstract
Short-head and long-tail distributed data are widely observed in the real world. The same is true of recommender systems (RSs), where a small number of popular items dominate the choices and feedback data while the rest only account for a small amount of feedback. As a result, most RS methods tend to learn user preferences from popular items since they account for most data. However, recent research in e-commerce and marketing has shown that future businesses will obtain greater profit from long-tail selling. Yet, although the number of long-tail items and users is much larger than that of short-head items and users, in reality, the amount of data associated with long-tail items and users is much less. As a result, user preferences tend to be popularity-biased. Furthermore, insufficient data makes long-tail items and users more vulnerable to shilling attack. To improve the quality of recommendations for items and users in the tail of distribution, we propose a coupled regularization approach that consists of two latent factor models: C-HMF, for enhancing credibility, and S-HMF, for emphasizing specialty on user choices. Specifically, the estimates learned from C-HMF and S-HMF recurrently serve as the empirical priors to regularize one another. Such coupled regularization leads to the comprehensive effects of final estimates, which produce more qualitative predictions for both tail users and tail items. To assess the effectiveness of our model, we conduct empirical evaluations on large real-world datasets with various metrics. The results prove that our approach significantly outperforms the compared methods.
- Published
- 2017
- Full Text
- View/download PDF