Blocking-based Neighbor Sampling for Large-scale Graph Neural Networks

Kai-Lang Yao; Wu-Jun Li

doi:10.24963/ijcai.2021/455

Blocking-based Neighbor Sampling for Large-scale Graph Neural Networks

Kai-Lang Yao, Wu-Jun Li

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence

Main Track. Pages 3307-3313. https://doi.org/10.24963/ijcai.2021/455

PDF BibTeX

The exponential increase in computation and memory complexity with the depth of network has become the main impediment to the successful application of graph neural networks (GNNs) on large-scale graphs like graphs with hundreds of millions of nodes. In this paper, we propose a novel neighbor sampling strategy, dubbed blocking-based neighbor sampling (BNS), for efficient training of GNNs on large-scale graphs. Specifically, BNS adopts a policy to stochastically block the ongoing expansion of neighboring nodes, which can reduce the rate of the exponential increase in computation and memory complexity of GNNs. Furthermore, a reweighted policy is applied to graph convolution, to adjust the contribution of blocked and non-blocked neighbors to central nodes. We theoretically prove that BNS provides an unbiased estimation for the original graph convolution operation. Extensive experiments on three benchmark datasets show that, on large-scale graphs, BNS is 2X~5X faster than state-of-the-art methods when achieving the same accuracy. Moreover, even on the small-scale graphs, BNS also demonstrates the advantage of low time cost.

Keywords:

Machine Learning: Relational Learning

Machine Learning Applications: Big data; Scalability

Data Mining: Mining Graphs, Semi Structured Data, Complex Data