Exact Block-Constant Rating Matrix Recovery from a Few Noisy Observations

Xu, Jiaming; Wu, Rui; Zhu, Kai; Hajek, Bruce; Srikant, R.; Ying, Lei

Abstract:Recommender systems predict user preferences based on a small number of observed, possibly noisy ratings. To allow accurate predictions, one common assumption is that the rating matrix has low-rank. This paper considers a more structured movie rating model in which (1) users and movies form clusters, (2) users from the same cluster give the same rating to movies in the same cluster, and (3) the ratings are either +1 or -1. The corresponding rating matrix is a block-constant matrix with binary entries, which is a special type of low-rank \ matrix.
Consider a system with $n$ users and $n$ movies and $r$ user clusters and $r$ movie clusters of equal sizes, and assume that we observe $m$ ratings. In the ideal case where the observations are noiseless, predicting the ratings reduces to clustering the users and movies, and we show that a simple algorithm based on finding the maximum clique succeeds as soon as $m=\Omega(n r^{1/2} \log^{1/2} n)$. This is fewer than the number of observations required if we only make a low-rank assumption. For the more general noisy setting, we propose a convex program to recover the rating matrix: among matrices with entries in the range $[-1,1]$, it maximizes a weighted sum of the correlation with observed ratings and the nuclear norm. This convex program is provably correct when $m=\Omega(nr^2)$, but we conjecture that $m=\Omega(nr \log n)$ is sufficient. Again, our block-constant and binary assumptions allow us to exactly recover the matrix with fewer observations, and a larger fraction of noisy entries. Additionally, our analysis is novel and considerably simpler than previous works on low-rank matrix completion.

Subjects:	Machine Learning (stat.ML)
Cite as:	arXiv:1310.0512 [stat.ML]
	(or arXiv:1310.0512v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1310.0512

Statistics > Machine Learning

Title:Exact Block-Constant Rating Matrix Recovery from a Few Noisy Observations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators