Scalable Neural Contextual Bandit for Recommender Systems

Zhu, Zheqing; Van Roy, Benjamin

Computer Science > Information Retrieval

arXiv:2306.14834 (cs)

[Submitted on 26 Jun 2023 (v1), last revised 19 Aug 2023 (this version, v3)]

Title:Scalable Neural Contextual Bandit for Recommender Systems

Authors:Zheqing Zhu, Benjamin Van Roy

View PDF

Abstract:High-quality recommender systems ought to deliver both innovative and relevant content through effective and exploratory interactions with users. Yet, supervised learning-based neural networks, which form the backbone of many existing recommender systems, only leverage recognized user interests, falling short when it comes to efficiently uncovering unknown user preferences. While there has been some progress with neural contextual bandit algorithms towards enabling online exploration through neural networks, their onerous computational demands hinder widespread adoption in real-world recommender systems. In this work, we propose a scalable sample-efficient neural contextual bandit algorithm for recommender systems. To do this, we design an epistemic neural network architecture, Epistemic Neural Recommendation (ENR), that enables Thompson sampling at a large scale. In two distinct large-scale experiments with real-world tasks, ENR significantly boosts click-through rates and user ratings by at least 9% and 6% respectively compared to state-of-the-art neural contextual bandit algorithms. Furthermore, it achieves equivalent performance with at least 29% fewer user interactions compared to the best-performing baseline algorithm. Remarkably, while accomplishing these improvements, ENR demands orders of magnitude fewer computational resources than neural contextual bandit baseline algorithms.

Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.14834 [cs.IR]
	(or arXiv:2306.14834v3 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2306.14834
Journal reference:	ACM International Conference on Information and Knowledge Management (CIKM 2023) 32nd ACM International Conference on Information and Knowledge Management (CIKM 2023)

Submission history

From: Zheqing Zhu [view email]
[v1] Mon, 26 Jun 2023 16:39:39 UTC (3,183 KB)
[v2] Sun, 30 Jul 2023 09:01:01 UTC (3,183 KB)
[v3] Sat, 19 Aug 2023 03:32:53 UTC (3,183 KB)

Computer Science > Information Retrieval

Title:Scalable Neural Contextual Bandit for Recommender Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Scalable Neural Contextual Bandit for Recommender Systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators