Learning High-Degree Parities: The Crucial Role of the Initialization

Abbe, Emmanuel; Cornacchia, Elisabetta; Hązła, Jan; Kougang-Yombi, Donald

Computer Science > Machine Learning

arXiv:2412.04910 (cs)

[Submitted on 6 Dec 2024 (v1), last revised 5 Mar 2025 (this version, v3)]

Title:Learning High-Degree Parities: The Crucial Role of the Initialization

Authors:Emmanuel Abbe, Elisabetta Cornacchia, Jan Hązła, Donald Kougang-Yombi

View PDF HTML (experimental)

Abstract:Parities have become a standard benchmark for evaluating learning algorithms. Recent works show that regular neural networks trained by gradient descent can efficiently learn degree $k$ parities on uniform inputs for constant $k$, but fail to do so when $k$ and $d-k$ grow with $d$ (here $d$ is the ambient dimension). However, the case where $k=d-O_d(1)$ (almost-full parities), including the degree $d$ parity (the full parity), has remained unsettled. This paper shows that for gradient descent on regular neural networks, learnability depends on the initial weight distribution. On one hand, the discrete Rademacher initialization enables efficient learning of almost-full parities, while on the other hand, its Gaussian perturbation with large enough constant standard deviation $\sigma$ prevents it. The positive result for almost-full parities is shown to hold up to $\sigma=O(d^{-1})$, pointing to questions about a sharper threshold phenomenon. Unlike statistical query (SQ) learning, where a singleton function class like the full parity is trivially learnable, our negative result applies to a fixed function and relies on an initial gradient alignment measure of potential broader relevance to neural networks learning.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2412.04910 [cs.LG]
	(or arXiv:2412.04910v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.04910

Submission history

From: Donald Kougang Yombi [view email]
[v1] Fri, 6 Dec 2024 10:05:10 UTC (378 KB)
[v2] Thu, 27 Feb 2025 16:08:40 UTC (391 KB)
[v3] Wed, 5 Mar 2025 08:37:17 UTC (391 KB)

Computer Science > Machine Learning

Title:Learning High-Degree Parities: The Crucial Role of the Initialization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning High-Degree Parities: The Crucial Role of the Initialization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators