ExpertDiff: Head-less Model Reprogramming with Diffusion Classifiers for Out-of-Distribution Generalization

Jee Seok Yoon; Junghyo Sohn; Wootaek Jeong; Heung-Il Suk

doi:10.24963/ijcai.2025/764

ExpertDiff: Head-less Model Reprogramming with Diffusion Classifiers for Out-of-Distribution Generalization

Jee Seok Yoon, Junghyo Sohn, Wootaek Jeong, Heung-Il Suk

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

Main Track. Pages 6866-6874. https://doi.org/10.24963/ijcai.2025/764

PDF BibTeX

Vision-language models have achieved remarkable performance across various tasks by leveraging large-scale multimodal training data. However, their ability to generalize to out-of-distribution (OOD) domains requiring expert-level knowledge remains an open challenge. To address this, we investigate cross-domain transfer learning approaches for efficiently adapting diffusion classifiers to new target domains demanding expert-level domain knowledge. Specifically, we propose ExpertDiff, a head-less model reprogramming technique that optimizes the instruction-following abilities of text-to-image diffusion models via learnable prompts, while leveraging the diffusion classifier objective as a modular plug-and-play adaptor. Our approach eliminates the need for conventional output mapping layers (e.g., linear probes), enabling seamless integration with off-the-shelf diffusion frameworks like Stable Diffusion. We demonstrate the effectiveness of ExpertDiff on the various OOD datasets (i.e., medical and satellite imagery). Furthermore, we qualitatively showcase ExpertDiff’s ability to faithfully reconstruct input images, highlighting its potential for both downstream discriminative and upstream generative tasks. Our work paves the way for effectively repurposing powerful foundation models for novel OOD applications requiring domain expertise.

Keywords:

Machine Learning: ML: Learnware/model reuse/transfer learning

Computer Vision: CV: Biomedical image analysis

Computer Vision: CV: Transfer, low-shot, semi- and un- supervised learning

Machine Learning: ML: Generative models