Leveraging Local Variation in Data: Sampling and Weighting Schemes for Supervised Deep Learning

Novello, Paul; Poëtte, Gaël; Lugato, David; Congedo, Pietro

doi:10.1615/JMachLearnModelComput.2022041819

Statistics > Machine Learning

arXiv:2101.07561 (stat)

[Submitted on 19 Jan 2021 (v1), last revised 27 Sep 2022 (this version, v3)]

Title:Leveraging Local Variation in Data: Sampling and Weighting Schemes for Supervised Deep Learning

Authors:Paul Novello, Gaël Poëtte, David Lugato, Pietro Congedo

View PDF

Abstract:In the context of supervised learning of a function by a neural network, we claim and empirically verify that the neural network yields better results when the distribution of the data set focuses on regions where the function to learn is steep. We first traduce this assumption in a mathematically workable way using Taylor expansion and emphasize a new training distribution based on the derivatives of the function to learn. Then, theoretical derivations allow constructing a methodology that we call Variance Based Samples Weighting (VBSW). VBSW uses labels local variance to weight the training points. This methodology is general, scalable, cost-effective, and significantly increases the performances of a large class of neural networks for various classification and regression tasks on image, text, and multivariate data. We highlight its benefits with experiments involving neural networks from linear models to ResNet and Bert.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
Cite as:	arXiv:2101.07561 [stat.ML]
	(or arXiv:2101.07561v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2101.07561
Related DOI:	https://doi.org/10.1615/JMachLearnModelComput.2022041819

Submission history

From: Paul Novello [view email] [via CCSD proxy]
[v1] Tue, 19 Jan 2021 11:08:40 UTC (1,253 KB)
[v2] Thu, 28 Jan 2021 12:50:28 UTC (1,274 KB)
[v3] Tue, 27 Sep 2022 15:37:42 UTC (4,717 KB)

Statistics > Machine Learning

Title:Leveraging Local Variation in Data: Sampling and Weighting Schemes for Supervised Deep Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Leveraging Local Variation in Data: Sampling and Weighting Schemes for Supervised Deep Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators