Size Bounds for Conjunctive Queries with General Functional Dependencies

Valiant, Gregory; Valiant, Paul

Computer Science > Databases

arXiv:0909.2030 (cs)

[Submitted on 10 Sep 2009 (v1), last revised 12 Dec 2009 (this version, v2)]

Title:Size Bounds for Conjunctive Queries with General Functional Dependencies

Authors:Gregory Valiant, Paul Valiant

View PDF

Abstract: This paper extends the work of Gottlob, Lee, and Valiant (PODS 2009)[GLV], and considers worst-case bounds for the size of the result Q(D) of a conjunctive query Q to a database D given an arbitrary set of functional dependencies. The bounds in [GLV] are based on a "coloring" of the query variables. In order to extend the previous bounds to the setting of arbitrary functional dependencies, we leverage tools from information theory to formalize the original intuition that each color used represents some possible entropy of that variable, and bound the maximum possible size increase via a linear program that seeks to maximize how much more entropy is in the result of the query than the input. This new view allows us to precisely characterize the entropy structure of worst-case instances for conjunctive queries with simple functional dependencies (keys), providing new insights into the results of [GLV]. We extend these results to the case of general functional dependencies, providing upper and lower bounds on the worst-case size increase. We identify the fundamental connection between the gap in these bounds and a central open question in information theory.
Finally, we show that, while both the upper and lower bounds are given by exponentially large linear programs, one can distinguish in polynomial time whether the result of a query with an arbitrary set of functional dependencies can be any larger than the input database.

Comments:	22 pages, 2 figures
Subjects:	Databases (cs.DB); Data Structures and Algorithms (cs.DS)
ACM classes:	H.2.4; F.2.0
Cite as:	arXiv:0909.2030 [cs.DB]
	(or arXiv:0909.2030v2 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.0909.2030

Submission history

From: Gregory Valiant [view email]
[v1] Thu, 10 Sep 2009 20:28:20 UTC (12 KB)
[v2] Sat, 12 Dec 2009 08:00:21 UTC (139 KB)

Computer Science > Databases

Title:Size Bounds for Conjunctive Queries with General Functional Dependencies

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:Size Bounds for Conjunctive Queries with General Functional Dependencies

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators