Netflix Can use similar data to promote research in other recommendation problem

After Netflix Prize, Netflix inc. announce that they will hold another competition by using more user data (gender, zipcode, age, etc) to improve prediction accuracy of recommender system. However, many Netflix users think this is violate to they privacy rules and do not want Netflix to reveal such data.

http://www.wired.com/threatlevel/2009/12/netflix-privacy-lawsuit/

As a competitor of Netflix Prize, I also agree with these users and I think we do not need more user data to improve accuracy. I think, we should focus on different problem. Netflix Prize use RMSE to measure the quality of different recommending algorithms. However, in real systems, our main task is recommendation not prediction.

In real recommender system, we have to recommend a list of product a user may like. This task is different from the problem solved by Netflix Prize. The problem of Netflix prize is to predict the rating a user will assign to a movie. In this competition, we already know what movie a user is watching and our task is only to predict rating. In this way, Netflix Prize does not solve the main problem of recommendation: how to recommend a list of items to active user?

Further more, users do not only rating items in a system, they may view/buy/comment items and rating behavior is only a small part of user behaviors. So, I think, Netflix do not need to reveal user data, they can reveal more behavior data rather than rating.

Netflix can also reveal nothing to hold another competition only data revealed in NetflixPrize1. They can propose different problems, such as how to recommend a list of items for an active user. One disadvantage of Netflix Prize is that, after Netflix Prize, most of researchers in recommender system thinks rating prediction is the only thing in recommendation. I think, Netflix should not focus on rating prediction now. They should promote research in other recommendation problems, such as Top-N problem, diversity, novelty, etc.

Comments 3

  1. Daniel Haran wrote:

    While I agree it would be a privacy violation, it seems me we can’t yet conclude the data wouldn’t be useful. The more pressing question as you said is what the objective should be.

    It’s sad, but what I consider to be the most promising avenues for further development do not really work well in a contest format. Will the users have fun using the system? How can we make recommendations transparent, interactive and explainable?

    Posted 22 一 2010 at 3:00 下午
  2. xlvector wrote:

    Yes, I mean we can do many thing else without user profile data

    Posted 22 一 2010 at 4:35 下午
  3. Netflix wrote:

    This is not good to make user’s private information public.

    Posted 11 二 2010 at 2:03 下午

Trackbacks & Pingbacks 1

  1. From Tweets that mention Netflix Can use similar data to promote research in other recommendation problem | xlvector – Recommender System -- Topsy.com on 23 一 2010 at 12:36 上午

    [...] This post was mentioned on Twitter by xlvector, xlvector. xlvector said: Netflix Can use similar data to promote research in other recommendation problem http://bit.ly/5sxG9o [...]

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
To prove you're a person (not a spam script), type the security word shown in the picture. Click on the picture to hear an audio file of the word.
Click to hear an audio file of the anti-spam word