Computer Science > Machine Learning
[Submitted on 2 Mar 2021 (v1), revised 12 Dec 2021 (this version, v5), latest version 24 May 2022 (v7)]
Title:Generalizing to Unseen Domains: A Survey on Domain Generalization
View PDFAbstract:Machine learning systems generally assume that the training and testing distributions are the same. To this end, a key requirement is to develop models that can generalize to unseen distributions. Domain generalization (DG), i.e., out-of-distribution generalization, has attracted increasing interests in recent years. Domain generalization deals with a challenging setting where one or several different but related domain(s) are given, and the goal is to learn a model that can generalize to an unseen test domain. Great progress has been made in the area of domain generalization for years. This paper presents the first review of recent advances in this area. First, we provide a formal definition of domain generalization and discuss several related fields. We then thoroughly review the theories related to domain generalization and carefully analyze the theory behind generalization. We categorize recent algorithms into three classes: data manipulation, representation learning, and learning strategy, and present several popular algorithms in detail for each category. Third, we introduce the commonly used datasets, applications, and our open-sourced codebase for fair evaluation. Finally, we summarize existing literature and present some potential research topics for the future.
Submission history
From: Jindong Wang [view email][v1] Tue, 2 Mar 2021 06:04:11 UTC (366 KB)
[v2] Wed, 10 Mar 2021 06:11:06 UTC (367 KB)
[v3] Sat, 1 May 2021 02:21:03 UTC (2,436 KB)
[v4] Tue, 13 Jul 2021 03:31:28 UTC (2,436 KB)
[v5] Sun, 12 Dec 2021 08:24:04 UTC (398 KB)
[v6] Fri, 8 Apr 2022 01:46:58 UTC (2,486 KB)
[v7] Tue, 24 May 2022 02:40:03 UTC (4,954 KB)
Current browse context:
cs.LG
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender
(What is IArxiv?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.