Not to Overfit or Underfit the Source Domains? An Empirical Study of Domain Generalization in Question Answering

Sultan, Md Arafat; Sil, Avirup; Florian, Radu

Computer Science > Computation and Language

arXiv:2205.07257 (cs)

[Submitted on 15 May 2022 (v1), last revised 25 Oct 2022 (this version, v3)]

Title:Not to Overfit or Underfit the Source Domains? An Empirical Study of Domain Generalization in Question Answering

Authors:Md Arafat Sultan, Avirup Sil, Radu Florian

View PDF

Abstract:Machine learning models are prone to overfitting their training (source) domains, which is commonly believed to be the reason why they falter in novel target domains. Here we examine the contrasting view that multi-source domain generalization (DG) is first and foremost a problem of mitigating source domain underfitting: models not adequately learning the signal already present in their multi-domain training data. Experiments on a reading comprehension DG benchmark show that as a model learns its source domains better -- using familiar methods such as knowledge distillation (KD) from a bigger model -- its zero-shot out-of-domain utility improves at an even faster pace. Improved source domain learning also demonstrates superior out-of-domain generalization over three popular existing DG approaches that aim to limit overfitting. Our implementation of KD-based domain generalization is available via PrimeQA at: this https URL.

Comments:	Accepted at EMNLP 2022
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2205.07257 [cs.CL]
	(or arXiv:2205.07257v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.07257

Submission history

From: Md Arafat Sultan [view email]
[v1] Sun, 15 May 2022 10:53:40 UTC (53 KB)
[v2] Sun, 23 Oct 2022 11:07:18 UTC (45 KB)
[v3] Tue, 25 Oct 2022 00:54:24 UTC (45 KB)

Computer Science > Computation and Language

Title:Not to Overfit or Underfit the Source Domains? An Empirical Study of Domain Generalization in Question Answering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Not to Overfit or Underfit the Source Domains? An Empirical Study of Domain Generalization in Question Answering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators