1. Mining explainable local and global subgraph patterns with surprising densities
- Author
-
Junning Deng, Bo Kang, Tijl De Bie, and Jefrey Lijffijt
- Subjects
Theoretical computer science ,Computer Networks and Communications ,Computer science ,Graph summarization ,Of the form ,02 engineering and technology ,Graph ,Computer Science Applications ,Homogeneous ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Partition (number theory) ,020201 artificial intelligence & image processing ,Special case ,Prior information ,Information Systems ,Clustering coefficient - Abstract
The connectivity structure of graphs is typically related to the attributes of the vertices. In social networks for example, the probability of a friendship between any pair of people depends on a range of attributes, such as their age, residence location, workplace, and hobbies. The high-level structure of a graph can thus possibly be described well by means of patterns of the form ‘the subgroup of all individuals with certain properties X are often (or rarely) friends with individuals in another subgroup defined by properties Y’, ideally relative to their expected connectivity. Such rules present potentially actionable and generalizable insight into the graph. Prior work has already considered the search for dense subgraphs (‘communities’) with homogeneous attributes. The first contribution in this paper is to generalize this type of pattern to densities between apair of subgroups, as well as betweenall pairs from a set of subgroups that partition the vertices. Second, we develop a novel information-theoretic approach for quantifying the subjective interestingness of such patterns, by contrasting them with prior information an analyst may have about the graph’s connectivity. We demonstrate empirically that in the special case of dense subgraphs, this approach yields results that are superior to the state-of-the-art. Finally, we propose algorithms for efficiently finding interesting patterns of these different types.
- Published
- 2020