A Simple Way to Deal with Cherry-picking

Komiyama, Junpei; Maehara, Takanori

Statistics > Methodology

arXiv:1810.04996 (stat)

[Submitted on 11 Oct 2018]

Title:A Simple Way to Deal with Cherry-picking

Authors:Junpei Komiyama, Takanori Maehara

View PDF

Abstract:Statistical hypothesis testing serves as statistical evidence for scientific innovation. However, if the reported results are intentionally biased, hypothesis testing no longer controls the rate of false discovery. In particular, we study such selection bias in machine learning models where the reporter is motivated to promote an algorithmic innovation. When the number of possible configurations (e.g., datasets) is large, we show that the reporter can falsely report an innovation even if there is no improvement at all. We propose a `post-reporting' solution to this issue where the bias of the reported results is verified by another set of results. The theoretical findings are supported by experimental results with synthetic and real-world datasets.

Subjects:	Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1810.04996 [stat.ME]
	(or arXiv:1810.04996v1 [stat.ME] for this version)
	https://doi.org/10.48550/arXiv.1810.04996

Submission history

From: Junpei Komiyama [view email]
[v1] Thu, 11 Oct 2018 13:06:48 UTC (239 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat.ME

< prev | next >

new | recent | 2018-10

Change to browse by:

cs
cs.LG
stat
stat.ML

References & Citations

export BibTeX citation

Statistics > Methodology

Title:A Simple Way to Deal with Cherry-picking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Methodology

Title:A Simple Way to Deal with Cherry-picking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators