Decision trees are PAC-learnable from most product distributions: a smoothed analysis

Kalai, Adam Tauman; Teng, Shang-Hua

Computer Science > Machine Learning

arXiv:0812.0933 (cs)

[Submitted on 4 Dec 2008]

Title:Decision trees are PAC-learnable from most product distributions: a smoothed analysis

Authors:Adam Tauman Kalai, Shang-Hua Teng

View PDF

Abstract: We consider the problem of PAC-learning decision trees, i.e., learning a decision tree over the n-dimensional hypercube from independent random labeled examples. Despite significant effort, no polynomial-time algorithm is known for learning polynomial-sized decision trees (even trees of any super-constant size), even when examples are assumed to be drawn from the uniform distribution on {0,1}^n. We give an algorithm that learns arbitrary polynomial-sized decision trees for {\em most product distributions}. In particular, consider a random product distribution where the bias of each bit is chosen independently and uniformly from, say, [.49,.51]. Then with high probability over the parameters of the product distribution and the random examples drawn from it, the algorithm will learn any tree. More generally, in the spirit of smoothed analysis, we consider an arbitrary product distribution whose parameters are specified only up to a [-c,c] accuracy (perturbation), for an arbitrarily small positive constant c.

Subjects:	Machine Learning (cs.LG); Computational Complexity (cs.CC)
Cite as:	arXiv:0812.0933 [cs.LG]
	(or arXiv:0812.0933v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.0812.0933

Submission history

From: Adam Kalai [view email]
[v1] Thu, 4 Dec 2008 13:34:26 UTC (14 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CC

< prev | next >

new | recent | 2008-12

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Adam Tauman Kalai
Shang-Hua Teng

export BibTeX citation

Computer Science > Machine Learning

Title:Decision trees are PAC-learnable from most product distributions: a smoothed analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Decision trees are PAC-learnable from most product distributions: a smoothed analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators