Proceedings Abstracts of the Twenty-Fifth International Joint Conference on Artificial Intelligence

Toward a Robust and Universal Crowd-Labeling Framework / 4006
Faiza Khan Khattak

One of the main challenges in crowd-labeling is to control for or determine in advance the proportion of low-quality/malicious labelers. We propose methods that estimate the labeler and data instance related parameters using frequentist and Bayesian approaches. All these approaches are based on expert-labeled instance (ground truth) for a small percentage of data to learn the parameters. We also derive a lower bound on the number of expert-labeled instances needed to get better quality labels.