Back to Search Start Over

A Join-Like Operator to Combine Data Cubes and Answer Queries from Multiple Data Cubes

Authors :
Francesco M. Malvestuto
Source :
ACM Transactions on Database Systems. 39:1-31
Publication Year :
2014
Publisher :
Association for Computing Machinery (ACM), 2014.

Abstract

In order to answer a “joint” query from multiple data cubes, Pourabass and Shoshani [2007] distinguish the data cube on the measure of interest (called the “primary” data cube) from the other data cubes (called “proxy” data cubes) that are used to involve the dimensions (in the query) not in the primary data cube. They demonstrate in study cases that, if the measures of the primary and proxy data cubes are correlated, then the answer to a joint query is an accurate estimate of its true value. Needless to say, for two or more proxy data cubes, the result depends upon the way the primary and proxy data cubes are combined together; however, for certain combination schemes Pourabass and Shoshani provide a sufficient condition , that they call proxy noncommonality , for the invariance of the result. In this article, we introduce: (1) a merge operator combining the contents of a primary data cube with the contents of a proxy data cube, (2) merge expressions for general combination schemes, and (3) an equivalence relation between merge expressions having the same pattern. Then, we prove that proxy noncommonality characterizes patterns for which every two merge expressions are equivalent. Moreover, we provide an efficient procedure for answering joint queries in the special case of perfect merge expressions. Finally, we show that our results apply to data cubes in which measures are obtained from unaggregated data using the aggregate functions SUM, COUNT, MAX, and MIN, and a lot more.

Details

ISSN :
15574644 and 03625915
Volume :
39
Database :
OpenAIRE
Journal :
ACM Transactions on Database Systems
Accession number :
edsair.doi.dedup.....bec939a3e9c126cc6e6413caad52491b