RULLS: Randomized Union of Locally Linear Subspaces for Feature Engineering

Lokare, Namita; Silva, Jorge; Kabul, Ilknur Kaynar

Computer Science > Machine Learning

arXiv:1804.09770 (cs)

[Submitted on 25 Apr 2018]

Title:RULLS: Randomized Union of Locally Linear Subspaces for Feature Engineering

Authors:Namita Lokare, Jorge Silva, Ilknur Kaynar Kabul

View PDF

Abstract:Feature engineering plays an important role in the success of a machine learning model. Most of the effort in training a model goes into data preparation and choosing the right representation. In this paper, we propose a robust feature engineering method, Randomized Union of Locally Linear Subspaces (RULLS). We generate sparse, non-negative, and rotation invariant features in an unsupervised fashion. RULLS aggregates features from a random union of subspaces by describing each point using globally chosen landmarks. These landmarks serve as anchor points for choosing subspaces. Our method provides a way to select features that are relevant in the neighborhood around these chosen landmarks. Distances from each data point to $k$ closest landmarks are encoded in the feature matrix. The final feature representation is a union of features from all chosen subspaces.
The effectiveness of our algorithm is shown on various real-world datasets for tasks such as clustering and classification of raw data and in the presence of noise. We compare our method with existing feature generation methods. Results show a high performance of our method on both classification and clustering tasks.

Comments:	9 pages
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1804.09770 [cs.LG]
	(or arXiv:1804.09770v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1804.09770

Submission history

From: Namita Lokare [view email]
[v1] Wed, 25 Apr 2018 19:37:55 UTC (1,340 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-04

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Namita Lokare
Jorge Silva
Ilknur Kaynar Kabul

export BibTeX citation

Computer Science > Machine Learning

Title:RULLS: Randomized Union of Locally Linear Subspaces for Feature Engineering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:RULLS: Randomized Union of Locally Linear Subspaces for Feature Engineering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators