Towards Federated Long-Tailed Graph Learning: An Energy-Guided Dual Decoupling Approach
Quick Answer
The FedEPD framework enhances Federated Graph Learning by addressing long-tailed data distributions, achieving up to 4.97% accuracy and 5.48% Macro-F1 improvements.
Quick Take
The FedEPD framework enhances Federated Graph Learning by addressing long-tailed data distributions, achieving up to 4.97% accuracy and 5.48% Macro-F1 improvements. It employs dual decoupling for topological purification and semantic recalibration, effectively protecting minority nodes from structural noise.
Key Points
- FedEPD addresses long-tailed distributions in Federated Graph Learning.
- Utilizes Dirichlet energy pruning for filtering heterophilic edges.
- Implements a two-stage optimization to protect majority decision boundaries.
- Achieves state-of-the-art performance on long-tailed benchmarks.
- Demonstrates significant improvements in accuracy and Macro-F1 scores.
Paper Resources
Article Content
From source RSS / original summaryarXiv:2606. 24237v1 Announce Type: new Abstract: Federated Graph Learning facilitates collaborative graph modeling across distributed clients while preserving data privacy. However, real-world data categories frequently exhibit long-tailed distributions. Such statistical scarcity severely degrades performance in two ways: it biases the global model toward majority classes, and it structurally isolates minority nodes by submerging them in heterophilic, head-dominated neighborhoods.
While existing methods attempt topology-agnostic statistical compensations, they often fail under data scarcity. Instead of recovering tail nodes, they overfit the structural noise from adjacent dominant classes, leading to representation degradation. To address these limitations, we propose FedEPD, a framework built on a dual decoupling paradigm that separates topological purification from semantic recalibration.
Specifically, FedEPD utilizes distribution-aware Dirichlet energy pruning to filter spatial heterophilic edges. It then overcomes Non-IID distribution shifts by extracting robust global prototypes from topologically central nodes, which are incorporated into local representations via a spatial low-pass prototype injection. Furthermore, a two stage alternating optimization strategy strictly protects majority decision boundaries while improving minority accuracy.
Extensive experiments demonstrate that FedEPD achieves state-of-the-art performance across diverse long-tailed benchmarks, yielding absolute improvements of up to 4. 97% in Accuracy and 5. 48% in Macro-F1.
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.AI
See more →Arbor: Tree Search as a Cognition Layer for Autonomous Agents
Arbor introduces a framework utilizing structured tree search for optimizing LLM inference, achieving up to 193% throughput-latency improvement compared to vendor-optimized systems. It employs an Orchestrator and Critic agent for stability and coordination, demonstrating hardware-agnostic performance with minimal variance.