Generic Adversarial Attack Framework Against Vertical Federated Learning

Yimin Liu; Peng Jiang

doi:10.24963/ijcai.2025/646

Generic Adversarial Attack Framework Against Vertical Federated Learning

Yimin Liu, Peng Jiang

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

Main Track. Pages 5806-5814. https://doi.org/10.24963/ijcai.2025/646

PDF BibTeX

Vertical federated learning (VFL) enables feature-level collaboration by incorporating scattered attributes from aligned samples, and allows each party to contribute its personalized input to joint training and inference. The injection of adversarial inputs can mislead the joint inference towards the attacker’s will, forcing other benign parties to make negligible contributions and losing rewards regarding the importance of their contributions. However, most attacks require server model queries, subsets of complete test samples, or labeled auxiliary images from the training domain. These extra requirements are not practical for real-world VFL applications. In this paper, we propose PGAC, a novel and practical attack framework for crafting adversarial inputs to dominate joint inference, which does not rely on the above requirements. PGAC advances prior attacks by requiring only access to auxiliary images from non-training domains. PGAC learns generalized label-indicative embeddings and estimates class-transferable probabilities across domains to generate a proxy model that closely approximates the server model. PGAC then augments images by emphasizing salient regions with class activation maps, creating a diverse shadow input set that resembles influential test inputs. With proxy fidelity and input diversity, PGAC crafts transferable adversarial inputs. Evaluation on diverse model architectures confirms the effectiveness of PGAC.

Keywords:

Machine Learning: ML: Federated learning

AI Ethics, Trust, Fairness: ETF: Other

Multidisciplinary Topics and Applications: MTA: Security and privacy