Stochastic Feature Averaging for Learning with Long-Tailed Noisy Labels

Hao-Tian Li; Tong Wei; Hao Yang; Kun Hu; Chong Peng; Li-Bo Sun; Xun-Liang Cai; Min-Ling Zhang

doi:10.24963/ijcai.2023/434

Stochastic Feature Averaging for Learning with Long-Tailed Noisy Labels

Hao-Tian Li, Tong Wei, Hao Yang, Kun Hu, Chong Peng, Li-Bo Sun, Xun-Liang Cai, Min-Ling Zhang

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence

Main Track. Pages 3902-3910. https://doi.org/10.24963/ijcai.2023/434

PDF BibTeX

Deep neural networks have shown promising results on a wide variety of tasks using large-scale and well-annotated training datasets. However, data collected from real-world applications can suffer from two prevalent biases, i.e., long-tailed class distribution and label noise. Previous efforts on long-tailed learning and label-noise learning can only address a single type of data bias, leading to a severe deterioration of their performance. In this paper, we propose a distance-based sample selection algorithm called Stochastic Feature Averaging (SFA), which fits a Gaussian using the exponential running average of class centroids to capture uncertainty in representation space due to label noise and data scarcity. With SFA, we detect noisy samples based on their distances to class centroids sampled from this Gaussian distribution. Based on the identified clean samples, we then propose to train an auxiliary balanced classifier to improve the generalization for the minority class and facilitate the update of Gaussian parameters. Extensive experimental results show that SFA can enhance the performance of existing methods on both simulated and real-world datasets. Further, we propose to combine SFA with the sample-selection approach, distribution-robust, and noise-robust loss functions, resulting in significant improvement in performance over the baselines. Our code is available at https://github.com/HotanLee/SFA

Keywords:

Machine Learning: ML: Weakly supervised learning

Machine Learning: ML: Multi-label

Machine Learning: ML: Semi-supervised learning