After Netflix Prize, Netflix inc. announce that they will hold another competition by using more user data (gender, zipcode, age, etc) to improve prediction accuracy of recommender system. However, many Netflix users think this is violate to they privacy rules and do not want Netflix to reveal such data.
http://www.wired.com/threatlevel/2009/12/netflix-privacy-lawsuit/
As a competitor of Netflix Prize, I also agree with these users and I think we do not need more user data to improve accuracy. I think, we should focus on different problem. Netflix Prize use RMSE to measure the quality of different recommending algorithms. However, in real systems, our main task is recommendation not prediction.
In real recommender system, we have to recommend a list of product a user may like. This task is different from the problem solved by Netflix Prize. The problem of Netflix prize is to predict the rating a user will assign to a movie. In this competition, we already know what movie a user is watching and our task is only to predict rating. In this way, Netflix Prize does not solve the main problem of recommendation: how to recommend a list of items to active user?
Further more, users do not only rating items in a system, they may view/buy/comment items and rating behavior is only a small part of user behaviors. So, I think, Netflix do not need to reveal user data, they can reveal more behavior data rather than rating.
Netflix can also reveal nothing to hold another competition only data revealed in NetflixPrize1. They can propose different problems, such as how to recommend a list of items for an active user. One disadvantage of Netflix Prize is that, after Netflix Prize, most of researchers in recommender system thinks rating prediction is the only thing in recommendation. I think, Netflix should not focus on rating prediction now. They should promote research in other recommendation problems, such as Top-N problem, diversity, novelty, etc.
Related Posts
- NetflixPrize 2 officially cancelled (0.952)
- Netflix又要办比赛了? Next Netflix Prize 2.0 (0.918)
- 内容信息在推荐系统中的作用 (0.129)
- 到目前为止的进度 (0.082)
- 时间信息在推荐系统中的作用 (0.082)
Comments 3
While I agree it would be a privacy violation, it seems me we can’t yet conclude the data wouldn’t be useful. The more pressing question as you said is what the objective should be.
It’s sad, but what I consider to be the most promising avenues for further development do not really work well in a contest format. Will the users have fun using the system? How can we make recommendations transparent, interactive and explainable?
Posted 22 一 2010 at 3:00 下午 ¶Yes, I mean we can do many thing else without user profile data
Posted 22 一 2010 at 4:35 下午 ¶This is not good to make user’s private information public.
Posted 11 二 2010 at 2:03 下午 ¶Trackbacks & Pingbacks 1
[...] This post was mentioned on Twitter by xlvector, xlvector. xlvector said: Netflix Can use similar data to promote research in other recommendation problem http://bit.ly/5sxG9o [...]
Post a Comment