1. Representation, ranking and bias of minorities in sampling attributed networks.
- Author
-
Antunes, Nelson, Banerjee, Sayan, Bhamidi, Shankar, and Pipiras, Vladas
- Abstract
We investigate three related problems concerning sampling minorities in attributed networks. This is guided by a general attributed network model which can incorporate several levels of homophily and heterophily, and whose degree and Page-rank distributions have known properties. The first problem investigates sampling schemes that favor the representation of the minority over majority nodes and give preference to "more popular" minority nodes (i.e. higher degree/Page-rank) for a given homophily scenario. We show that (in-)degree and Page-rank sampling schemes increase the probability of sampling a minority node. The second problem concerns the relative ranking of minorities compared to majorities in degree and Page-rank based sampling schemes for several homophily and heterophily scenarios. We provide analytical conditions for the minority nodes to rank higher as a function of the model parameters for the degree based samplings and investigate the problem numerically for Page-rank based sampling schemes. The third problem considers subgraph sampling schemes and the bias of the proportion of minority nodes in top ranked degree nodes in several homophily and heterophily scenarios. Finally, the results and findings obtained from the sampling analysis are assessed on real-world networks. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF