Adaptive Deep Learning from Crowds

Hang Yang; Zhiwu Li; Witold Pedrycz

doi:10.24963/ijcai.2025/475

Adaptive Deep Learning from Crowds

Hang Yang, Zhiwu Li, Witold Pedrycz

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

Main Track. Pages 4263-4272. https://doi.org/10.24963/ijcai.2025/475

PDF BibTeX

In the data-driven era, collecting high-quality labeled data requiring human labor is a common approach for training data-hungry models, called crowdsourcing. Recently, end-to-end learning from crowds has shown its flexibility and practicality. However, existing works in an end-to-end manner focus on learning after collecting labels, which results in noisy annotations and also requires cost. Inspired by computerized adaptive testing, we argue that the characteristics of workers should be mined as soon as possible to make the best use of talents. To this end, we propose an adaptive learning from crowds method, AdaCrowd, as a cost-effective solution. Specifically, we propose a probabilistic model to capture the informativeness of possible instances for each worker. The informativeness is considered to be the uncertainty of the annotation prediction model output in its current status. The adaptive learning procedure is optimized by maximizing data likelihood and can be used with existing crowdsourcing models. Extensive experiments are conducted on real-world datasets, LabelMe and CIFAR-10H. The experimental results, e.g., the reduction of annotations without performance degradation, demonstrate the effectiveness.

Keywords:

Humans and AI: HAI: Human computation and crowdsourcing

Machine Learning: ML: Active learning

Machine Learning: ML: Cost-sensitive learning

Uncertainty in AI: UAI: Uncertainty representations