Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

Ponti, Moacir Antonelli; Santos, Fernando Pereira dos; Ribeiro, Leo Sampaio Ferraz; Cavallari, Gabriel Biscaro

Computer Science > Machine Learning

arXiv:2109.02752 (cs)

[Submitted on 6 Sep 2021 (v1), last revised 13 Oct 2021 (this version, v2)]

Title:Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

Authors:Moacir Antonelli Ponti, Fernando Pereira dos Santos, Leo Sampaio Ferraz Ribeiro, Gabriel Biscaro Cavallari

View PDF

Abstract:Training deep neural networks may be challenging in real world data. Using models as black-boxes, even with transfer learning, can result in poor generalization or inconclusive results when it comes to small datasets or specific applications. This tutorial covers the basic steps as well as more recent options to improve models, in particular, but not restricted to, supervised learning. It can be particularly useful in datasets that are not as well-prepared as those in challenges, and also under scarce annotation and/or small data. We describe basic procedures: as data preparation, optimization and transfer learning, but also recent architectural choices such as use of transformer modules, alternative convolutional layers, activation functions, wide and deep networks, as well as training procedures including as curriculum, contrastive and self-supervised learning.

Comments:	Extended version of SIBGRAPI 2021 Tutorial Paper
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2109.02752 [cs.LG]
	(or arXiv:2109.02752v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.02752

Submission history

From: Moacir Antonelli Ponti [view email]
[v1] Mon, 6 Sep 2021 21:31:42 UTC (3,350 KB)
[v2] Wed, 13 Oct 2021 12:51:50 UTC (3,365 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.CV

References & Citations

DBLP - CS Bibliography

listing | bibtex

Moacir Antonelli Ponti
Fernando Pereira dos Santos

export BibTeX citation

Computer Science > Machine Learning

Title:Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators