ForgDiffuser: General Image Forgery Localization with Diffusion Models

Mengxi Wang; Shaozhang Niu; Jiwei Zhang

doi:10.24963/ijcai.2025/218

ForgDiffuser: General Image Forgery Localization with Diffusion Models

Mengxi Wang, Shaozhang Niu, Jiwei Zhang

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

Main Track. Pages 1954-1962. https://doi.org/10.24963/ijcai.2025/218

PDF BibTeX

Current general image forgery localization (GIFL) methods confront two main challenges: decoder overconffdence causing misidentiffcation of the authentic regions or incomplete predicted masks, and limited accuracy in localizing forgery details. Recently, diffusion models have excelled as dominant approach for generative models, particularly effective in capturing complex scene details. However, their potential for GIFL remains underexplored. Therefore, we propose a GIFL framework named ForgDiffuser with diffusion models. The core of ForgDiffuser lies in leveraging diffusion models conditioned on the forgery image to efffciently generate the segmentation mask for tampered regions. Speciffcally, we introduce the attentionguided module (AGM) to aggregate and enhance image feature representations. Meanwhile, we design the boundary-driven module (BDM) with edge supervision to improve the localization accuracy of boundary details. Additionally, the probabilistic modeling and stochastic sampling mechanisms of diffusion models effectively alleviate the overconffdence issue commonly observed in traditional decoders. Experiments on six benchmark datasets demonstrate that ForgDiffuser outperforms existing mainstream GIFL methods in both localization accuracy and robustness, especially under challenging manipulation conditions.

Keywords:

Computer Vision: CV: Recognition (object detection, categorization)

Computer Vision: CV: Image and video synthesis and generation