Back to Search Start Over

Dremel: Interactive Analysis of Web-Scale Datasets.

Authors :
Melnik, Sergey
Gubarev, Andrey
Jing Jing Long
Romer, Geoffrey
Shivakumar, Shiva
Tolton, Matt
Vassilakis, Theo
Source :
Communications of the ACM; Jun2011, Vol. 54 Issue 6, p114-123, 10p, 9 Diagrams, 1 Chart, 7 Graphs
Publication Year :
2011

Abstract

Dremel is a scalable, interactive ad hoc query system for analysis of read-only nested data. By combining multilevel execution trees and columnar data layout, it is capable of running aggregation queries over trillion-row tables in seconds. The system scales to thousands of CPUs and petabytes of data, and has thousands of users at Google. In this paper, we describe the architecture and implementation of Dremel, and explain how it complements MapReducebased computing. We present a novel columnar storage representation for nested records and discuss experiments on few- thousand node instances of the system. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00010782
Volume :
54
Issue :
6
Database :
Complementary Index
Journal :
Communications of the ACM
Publication Type :
Periodical
Accession number :
63231745
Full Text :
https://doi.org/10.1145/1953122.1953148