On Breiman's Dilemma in Neural Networks: Phase Transitions of Margin Dynamics

Zhu, Weizhi; Huang, Yifei; Yao, Yuan

Computer Science > Machine Learning

arXiv:1810.03389v1 (cs)

[Submitted on 8 Oct 2018 (this version), latest version 1 Jan 2021 (v3)]

Title:On Breiman's Dilemma in Neural Networks: Phase Transitions of Margin Dynamics

Authors:Weizhi Zhu, Yifei Huang, Yuan Yao

View PDF

Abstract:Margin enlargement over training data has been an important strategy since perceptrons in machine learning for the purpose of boosting the robustness of classifiers toward a good generalization ability. Yet Breiman shows a dilemma (Breiman, 1999) that a uniform improvement on margin distribution \emph{does not} necessarily reduces generalization errors. In this paper, we revisit Breiman's dilemma in deep neural networks with recently proposed spectrally normalized margins. A novel perspective is provided to explain Breiman's dilemma based on phase transitions in dynamics of normalized margin distributions, that reflects the trade-off between expressive power of models and complexity of data. When data complexity is comparable to the model expressiveness in the sense that both training and test data share similar phase transitions in normalized margin dynamics, two efficient ways are derived to predict the trend of generalization or test error via classic margin-based generalization bounds with restricted Rademacher complexities. On the other hand, over-expressive models that exhibit uniform improvements on training margins, as a distinct phase transition to test margin dynamics, may lose such a prediction power and fail to prevent the overfitting. Experiments are conducted to show the validity of the proposed method with some basic convolutional networks, AlexNet, VGG-16, and ResNet-18, on several datasets including Cifar10/100 and mini-ImageNet.

Comments:	34 pages
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1810.03389 [cs.LG]
	(or arXiv:1810.03389v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.03389

Submission history

From: Yuan Yao [view email]
[v1] Mon, 8 Oct 2018 12:04:39 UTC (6,071 KB)
[v2] Thu, 18 Oct 2018 13:50:52 UTC (6,072 KB)
[v3] Fri, 1 Jan 2021 14:42:39 UTC (6,073 KB)

Computer Science > Machine Learning

Title:On Breiman's Dilemma in Neural Networks: Phase Transitions of Margin Dynamics

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On Breiman's Dilemma in Neural Networks: Phase Transitions of Margin Dynamics

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators