AutoML Two-Sample Test

Kübler, Jonas M.; Stimper, Vincent; Buchholz, Simon; Muandet, Krikamol; Schölkopf, Bernhard

Computer Science > Machine Learning

arXiv:2206.08843 (cs)

[Submitted on 17 Jun 2022 (v1), last revised 15 Jan 2023 (this version, v3)]

Title:AutoML Two-Sample Test

Authors:Jonas M. Kübler, Vincent Stimper, Simon Buchholz, Krikamol Muandet, Bernhard Schölkopf

View PDF

Abstract:Two-sample tests are important in statistics and machine learning, both as tools for scientific discovery as well as to detect distribution shifts. This led to the development of many sophisticated test procedures going beyond the standard supervised learning frameworks, whose usage can require specialized knowledge about two-sample testing. We use a simple test that takes the mean discrepancy of a witness function as the test statistic and prove that minimizing a squared loss leads to a witness with optimal testing power. This allows us to leverage recent advancements in AutoML. Without any user input about the problems at hand, and using the same method for all our experiments, our AutoML two-sample test achieves competitive performance on a diverse distribution shift benchmark as well as on challenging two-sample testing problems.
We provide an implementation of the AutoML two-sample test in the Python package autotst.

Comments:	NeurIPS 2022
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2206.08843 [cs.LG]
	(or arXiv:2206.08843v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.08843

Submission history

From: Jonas M. Kübler [view email]
[v1] Fri, 17 Jun 2022 15:41:07 UTC (455 KB)
[v2] Tue, 29 Nov 2022 12:43:39 UTC (457 KB)
[v3] Sun, 15 Jan 2023 12:45:39 UTC (457 KB)

Computer Science > Machine Learning

Title:AutoML Two-Sample Test

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:AutoML Two-Sample Test

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators