Rk-means: Fast Clustering for Relational Data image

Rk-means: Fast Clustering for Relational Data

This RelationalAI Research paper introduces Rk-means, or relationalk-means algorithm, for clustering relational data tuples without having to access the full data matrix.

Defensive Points-To Analysis: Effective Soundness via Laziness image

Defensive Points-To Analysis: Effective Soundness via Laziness

In this work, we present a more principled approach for identifying precision-critical methods, based on general patterns of value flows that explain where most of the imprecision arises in context-insensitive pointer analysis.

Worst-Case Optimal Join Algorithms: Techniques, Results and Open Problems image

Worst-Case Optimal Join Algorithms: Techniques, Results and Open Problems

Worst-case optimal join algorithms are the class of join algorithms whose runtime match the worst-case output size of a given join query. While the first provably worst-case optimal join algorithm was discovered relatively recently, the techniques and results surrounding these algorithms grow out of decades of research from a wide range of areas, intimately connecting graph theory, algorithms, information theory, constraint satisfaction, database theory, and geometric inequalities.

What Do Shannon-type Inequalities, Submodular Width, and Disjunctive Datalog Have to Do with One Another? image

What Do Shannon-type Inequalities, Submodular Width, and Disjunctive Datalog Have to Do with One Another?

Recent works on bounding the output size of a conjunctive query with functional dependencies and degree bounds have shown a deep connection between fundamental questions in information theory and database theory. This paper connects semantic query optimization, physical query optimization & cost estimation, to information theory with provable bounds.

Comprehensive Survey of Recursive Query Processing and Optimization Techniques using Datalog image

Comprehensive Survey of Recursive Query Processing and Optimization Techniques using Datalog

In recent years, we have witnessed a revival of the use of recursive queries in a variety of emerging application domains such as data integration and exchange, information extraction, networking, and program analysis. A popular language used for expressing these queries is Datalog.

Functional Aggregate Query (FAQ): Questions Asked Frequently image

Functional Aggregate Query (FAQ): Questions Asked Frequently

We define and study the Functional Aggregate Query (FAQ) problem, which encompasses many frequently asked questions in constraint satisfaction, databases, matrix operations, probabilistic graphical models and logic. This is our main conceptual contribution.