DeepCGC: Unveiling the Deep Clustering Mechanism of Fast Graph Condensation

Loading...
Thumbnail Image
File version

Accepted Manuscript (AM)

Author(s)
Gao, X
Li, W
Chen, T
Zhao, X
Nguyen, QVH
Yin, H
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2026
Size
File type(s)
Location
License
Abstract

Graph condensation (GC) improves the efficiency of GNN training by condensing a large-scale graph into a compact synthetic graph. However, existing GC methods suffer from time-consuming optimization processes, and the underlying mechanisms driving their effectiveness remain unexplored. In this paper, we provide novel insights into the optimization strategies of GC, demonstrating that various methods ultimately converge to the class-level feature matching between the original and condensed graphs. Building on this understanding, we further refine the unified class-to-class matching paradigm into a fine-grained class-to-node paradigm, unveiling that the core mechanism of GC is a class-wise clustering problem in the latent space. Accordingly, we propose Deep Clustering-based Graph Condensation (DeepCGC), an efficient GC framework that integrates a clustering-based optimization objective with an invertible relay model. Extensive experiments show that DeepCGC achieves state-of-the-art efficiency and accuracy. Notably, it condenses the million-scale Ogbn-products graph in around 40 seconds—a 102× to 104× speedup over existing methods—while boosting accuracy by up to 4.6%.

Journal Title

IEEE Transactions on Knowledge and Data Engineering

Conference Title
Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)

ARC

Grant identifier(s)

DP240101108

Rights Statement
Rights Statement

This work is covered by copyright. You must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a specified licence, refer to the licence for details of permitted re-use. If you believe that this work infringes copyright please make a copyright takedown request using the form at https://www.griffith.edu.au/copyright-matters.

Item Access Status
Note
Access the data
Related item(s)
Subject

Data management and data science

Neural networks

Machine learning

Information and computing sciences

Persistent link to this record
Citation

Gao, X; Li, W; Chen, T; Zhao, X; Nguyen, QVH; Yin, H, DeepCGC: Unveiling the Deep Clustering Mechanism of Fast Graph Condensation, IEEE Transactions on Knowledge and Data Engineering, 2026

Collections