enter search term and/or author name
The goal of the article is to bridge the difference between theoretical and practical approaches to answering queries over databases with nulls. Theoretical research has long ago identified the notion of correctness of query answering over...
Materialization Optimizations for Feature Selection Workloads
Ce Zhang, Arun Kumar, Christopher Ré
Article No.: 2
There is an arms race in the data management industry to support statistical analytics. Feature selection, the process of selecting a feature set that will be used to build a statistical model, is widely regarded as the most critical step of...
ENFrame: A Framework for Processing Probabilistic Data
Dan Olteanu, Sebastiaan J. Van Schaik
Article No.: 3
This article introduces ENFrame, a framework for processing probabilistic data. Using ENFrame, users can write programs in a fragment of Python with constructs such as loops, list comprehension, aggregate operations on lists, and calls to external...
Dichotomies for Queries with Negation in Probabilistic Databases
Robert Fink, Dan Olteanu
Article No.: 4
This article charts the tractability frontier of two classes of relational algebra queries in tuple-independent probabilistic databases. The first class consists of queries with join, projection, selection, and negation but without repeating...
Query autocompletion has become a standard feature in many search applications, especially for search engines. A recent trend is to support the error-tolerant autocompletion, which increases the usability significantly by matching prefixes...
Declarative Cleaning of Inconsistencies in Information Extraction
Ronald Fagin, Benny Kimelfeld, Frederick Reiss, Stijn Vansummeren
Article No.: 6
The population of a predefined relational schema from textual content, commonly known as Information Extraction (IE), is a pervasive task in contemporary computational challenges associated with Big Data. Since the textual content varies widely in...
Inferring Social Strength from Spatiotemporal Data
Huy Pham, Cyrus Shahabi, Yan Liu
Article No.: 7
The advent of geolocation technologies has generated unprecedented rich datasets of people’s location information at a very high fidelity. These location datasets can be used to study human behavior; for example, social studies have shown...