Adaptive Sampled Softmax with Kernel Based Sampling

Blanc, Guy; Rendle, Steffen

Computer Science > Machine Learning

arXiv:1712.00527 (cs)

[Submitted on 2 Dec 2017 (v1), last revised 1 Aug 2018 (this version, v2)]

Title:Adaptive Sampled Softmax with Kernel Based Sampling

Authors:Guy Blanc, Steffen Rendle

View PDF

Abstract:Softmax is the most commonly used output function for multiclass problems and is widely used in areas such as vision, natural language processing, and recommendation. A softmax model has linear costs in the number of classes which makes it too expensive for many real-world problems. A common approach to speed up training involves sampling only some of the classes at each training step. It is known that this method is biased and that the bias increases the more the sampling distribution deviates from the output distribution. Nevertheless, almost any recent work uses simple sampling distributions that require a large sample size to mitigate the bias. In this work, we propose a new class of kernel based sampling methods and develop an efficient sampling algorithm. Kernel based sampling adapts to the model as it is trained, thus resulting in low bias. Kernel based sampling can be easily applied to many models because it relies only on the model's last hidden layer. We empirically study the trade-off of bias, sampling distribution and sample size and show that kernel based sampling results in low bias with few samples.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1712.00527 [cs.LG]
	(or arXiv:1712.00527v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1712.00527

Submission history

From: Steffen Rendle [view email]
[v1] Sat, 2 Dec 2017 00:39:49 UTC (153 KB)
[v2] Wed, 1 Aug 2018 18:32:05 UTC (170 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2017-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Guy Blanc
Steffen Rendle

export BibTeX citation

Computer Science > Machine Learning

Title:Adaptive Sampled Softmax with Kernel Based Sampling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adaptive Sampled Softmax with Kernel Based Sampling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators