Towards Region-Adaptive Feature Disentanglement and Enhancement for Small Object Detection

Yanchao Bi; Yang Ning; Xiushan Nie; Xiankai Lu; Yongshun Gong; Leida Li

doi:10.24963/ijcai.2025/78

Towards Region-Adaptive Feature Disentanglement and Enhancement for Small Object Detection

Yanchao Bi, Yang Ning, Xiushan Nie, Xiankai Lu, Yongshun Gong, Leida Li

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

Main Track. Pages 693-701. https://doi.org/10.24963/ijcai.2025/78

PDF BibTeX

Current feature fusion strategies often fail to adequately account for the influence of activation intensity across different scales on small object features, which impedes the effective detection of small objects. To address this limitation, we propose the Region-Adaptive Feature Disentanglement and Enhancement (RAFDE) strategy, which improves both downsampling and feature fusion by leveraging activation intensity variations at multiple scales. First, we introduce the Boundary Transitional Region-enhanced Downsampling (BTRD) module, which enhances boundary transitional regions containing both strongly and weakly activated features, thereby mitigating the loss of crucial boundary information for small objects. Second, we present the Regional-Adaptive Feature Fusion (RAFF) module, which adaptively disentangles and fuses co-activated and uni-activated regions from adjacent levels into the current level, effectively reducing the risk of small objects being overwhelmed. Extensive experiments on several public datasets demonstrate that the RAFDE strategy is highly effective and outperforms state-of-the-art methods. The code is available at https://github.com/b-yanchao/RAFDE.git.

Keywords:

Computer Vision: CV: Machine learning for vision

Computer Vision: CV: Recognition (object detection, categorization)

Machine Learning: ML: Classification

Machine Learning: ML: Convolutional networks