Accelerating Scalable Graph Neural Network Inference with Node-Adaptive Propagation

Loading...
Thumbnail Image
File version

Accepted Manuscript (AM)

Author(s)
Gao, Xinyi
Zhang, Wentao
Yu, Junliang
Shao, Yingxia
Nguyen, Quoc Viet Hung
Cui, Bin
Yin, Hongzhi
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2024
Size
File type(s)
Location

Utrecht, Netherlands

License
Abstract

Graph neural networks (GNNs) have exhibited exceptional efficacy in a diverse array of applications. However, the sheer size of large-scale graphs presents a significant challenge to real-time inference with GNNs. Although existing Scalable GNNs leverage linear propagation to preprocess the features and accelerate the training and inference procedure, these methods still suffer from scalability issues when making inferences on unseen nodes, as the feature preprocessing requires the graph to be known and fixed. To further accelerate Scalable GNNs inference in this inductive setting, we propose an online propagation framework and two novel node-adaptive propagation methods that can customize the optimal propagation depth for each node based on its topological information and thereby avoid redundant feature propagation. The trade-off between accuracy and latency can be flexibly managed through simple hyper-parameters to accommodate various latency constraints. Moreover, to compensate for the inference accuracy loss caused by the potential early termination of propagation, we further propose Inception Distillation to exploit the multi-scale receptive field information within graphs. The rigorous and comprehensive experimental study on public datasets with varying scales and characteristics demonstrates that the proposed inference acceleration framework outperforms existing state-of-the-art graph inference acceleration methods in terms of accuracy and efficiency. Particularly, the superiority of our approach is notable on datasets with larger scales, yielding a 75× inference speedup on the largest Ogbn-products dataset.

Journal Title
Conference Title

2024 IEEE 40th International Conference on Data Engineering (ICDE)

Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)

ARC

Grant identifier(s)

DP240101108

Rights Statement
Rights Statement

This work is covered by copyright. You must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a specified licence, refer to the licence for details of permitted re-use. If you believe that this work infringes copyright please make a copyright takedown request using the form at https://www.griffith.edu.au/copyright-matters.

Item Access Status
Note
Access the data
Related item(s)
Subject

Neural networks

Persistent link to this record
Citation

Gao, X; Zhang, W; Yu, J; Shao, Y; Nguyen, QVH; Cui, B; Yin, H, Accelerating Scalable Graph Neural Network Inference with Node-Adaptive Propagation, 2024 IEEE 40th International Conference on Data Engineering (ICDE), 2024, pp. 3042-3055