The Effectiveness of Johnson-Lindenstrauss Transform for High Dimensional Optimization With Adversarial Outliers, and the Recovery

Ding, Hu; Qin, Ruizhe; Huang, Jiawei

Computer Science > Computational Geometry

arXiv:2002.11923 (cs)

[Submitted on 27 Feb 2020 (v1), last revised 21 Feb 2021 (this version, v5)]

Title:The Effectiveness of Johnson-Lindenstrauss Transform for High Dimensional Optimization With Adversarial Outliers, and the Recovery

Authors:Hu Ding, Ruizhe Qin, Jiawei Huang

View PDF

Abstract:In this paper, we consider robust optimization problems in high dimensions. Because a real-world dataset may contain significant noise or even specially crafted samples from some attacker, we are particularly interested in the optimization problems with arbitrary (and potentially adversarial) outliers. We focus on two fundamental optimization problems: {\em SVM with outliers} and {\em $k$-center clustering with outliers}. They are in fact extremely challenging combinatorial optimization problems, since we cannot impose any restriction on the adversarial outliers. Therefore, their computational complexities are quite high especially when we consider the instances in high dimensional spaces. The {\em Johnson-Lindenstrauss (JL) Transform} is one of the most popular methods for dimension reduction. Though the JL transform has been widely studied in the past decades, its effectiveness for dealing with adversarial outliers has never been investigated before (to the best of our knowledge). Based on some novel insights from the geometry, we prove that the complexities of these two problems can be significantly reduced through the JL transform. Moreover, we prove that the solution in the dimensionality-reduced space can be efficiently recovered in the original $\mathbb{R}^d$ while the quality is still preserved. In the experiments, we compare JL transform with several other well known dimension reduction methods, and study their performances on synthetic and real datasets.

Subjects:	Computational Geometry (cs.CG); Machine Learning (cs.LG)
Cite as:	arXiv:2002.11923 [cs.CG]
	(or arXiv:2002.11923v5 [cs.CG] for this version)
	https://doi.org/10.48550/arXiv.2002.11923

Submission history

From: Fan Yang [view email]
[v1] Thu, 27 Feb 2020 05:23:35 UTC (1,157 KB)
[v2] Wed, 9 Sep 2020 00:14:01 UTC (1,155 KB)
[v3] Thu, 10 Sep 2020 12:07:19 UTC (1,155 KB)
[v4] Thu, 17 Sep 2020 09:01:54 UTC (1,276 KB)
[v5] Sun, 21 Feb 2021 13:18:12 UTC (4,525 KB)

Computer Science > Computational Geometry

Title:The Effectiveness of Johnson-Lindenstrauss Transform for High Dimensional Optimization With Adversarial Outliers, and the Recovery

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computational Geometry

Title:The Effectiveness of Johnson-Lindenstrauss Transform for High Dimensional Optimization With Adversarial Outliers, and the Recovery

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators