Now, we have a problem, given 10 million users and their preference on X items, how to make recommendation for users by user-based collaborative filtering?
The key step of usercf is to calculate user-user similarity. However, calculate user-user similarity between 10 million users in a 4G RAM computer is impossible. Thus, we can only use random method, such as random decision tree (RDT).
The basic idea of RDT is to split space randomly, and then given more accurate results by bagging all results for different random split. By this idea, we can also design an efficient way to calculate user-based recommendation for 10M users.
The detail method is a secret. However, I think may researchers can think out better ideas than my solution.
Related Posts
- Random Decision Tree (0.800)
- user-based推荐在GRSuggest中的应用 (0.200)
- user-based和item-based算法,谁能产生多样化的结果 (0.200)
- 用户对某类型算法的适应度 (0.200)
- Aardvark 专家推荐式的问答系统 (0.200)
Post a Comment