1. GraphZero
- Author
-
Connor Holmes, Bo Wu, Sam Reinehr, Daniel Mawhirter, and Tongping Liu
- Subjects
Schedule ,Theoretical computer science ,Matching (graph theory) ,Computer science ,02 engineering and technology ,Clique (graph theory) ,computer.software_genre ,Set (abstract data type) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Redundancy (engineering) ,General Earth and Planetary Sciences ,Graph (abstract data type) ,Overhead (computing) ,020201 artificial intelligence & image processing ,Compiler ,computer ,General Environmental Science - Abstract
Subgraph matching is a fundamental task in many applications which identifies all the embeddings of a query pattern in an input graph. Compilation-based subgraph matching systems generate specialized implementations for the provided patterns and often substantially outperform other systems. However, the generated code causes significant computation redundancy and the compilation process incurs too much overhead to be used online, both due to the inherent symmetry in the structure of the query pattern. In this paper, we propose an optimizing query compiler, named GraphZero, to completely address these limitations through symmetry breaking based on group theory. GraphZero implements three novel techniques. First, its schedule explorer efficiently prunes the schedule space without missing any high-performance schedule. Second, it automatically generates and enforces a set of restrictions to eliminate computation redundancy. Third, it generalizes orientation, a surprisingly effective optimization that was only used for clique patterns, to apply to arbitrary patterns. Evaluation on multiple query patterns shows that GraphZero outperforms two state-of-the-art compilation and non-compilation based systems by up to 40X and 2654X, respectively.
- Published
- 2021