Combining Heterogeneous Classifiers for Relational Databases

Manjunatha, Geetha; Murty, M Narasimha; Sitaram, Dinkar

Computer Science > Machine Learning

arXiv:1201.2925v1 (cs)

A newer version of this paper has been withdrawn by Geetha Manjunath

[Submitted on 13 Jan 2012 (this version), latest version 12 Mar 2012 (v2)]

Title:Combining Heterogeneous Classifiers for Relational Databases

Authors:Geetha Manjunatha, M Narasimha Murty, Dinkar Sitaram

View PDF

Abstract:Most enterprise data is distributed in multiple relational databases with expert-designed schema. Using traditional single-table machine learning techniques over such data not only incur a computational penalty for converting to a 'flat' form (mega-join), even the human-specified semantic information present in the relations is lost. In this paper, we present a practical, two-phase hierarchical meta-classification algorithm for relational databases with a semantic divide and conquer approach. We propose a recursive, prediction aggregation technique over heterogeneous classifiers applied on individual database tables. The proposed algorithm was evaluated on three diverse datasets, namely TPCH, PKDD and UCI benchmarks and showed considerable reduction in classification time without any loss of prediction accuracy.

Comments:	22 pages
Subjects:	Machine Learning (cs.LG); Databases (cs.DB)
Cite as:	arXiv:1201.2925 [cs.LG]
	(or arXiv:1201.2925v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1201.2925

Submission history

From: Geetha Manjunath [view email]
[v1] Fri, 13 Jan 2012 19:54:27 UTC (55 KB)
[v2] Mon, 12 Mar 2012 20:23:24 UTC (1 KB) (withdrawn)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2012-01

Change to browse by:

cs
cs.DB

References & Citations

DBLP - CS Bibliography

listing | bibtex

Geetha Manjunath
M. Narasimha Murty
Dinkar Sitaram

export BibTeX citation

Computer Science > Machine Learning

Title:Combining Heterogeneous Classifiers for Relational Databases

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Combining Heterogeneous Classifiers for Relational Databases

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators