Bayesian Aggregation of Categorical Distributions with Applications in Crowdsourcing

Bayesian Aggregation of Categorical Distributions with Applications in Crowdsourcing

Alexandry Augustin, Matteo Venanzi, Alex Rogers, Nicholas R. Jennings

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
Main track. Pages 1411-1417. https://doi.org/10.24963/ijcai.2017/195

A key problem in crowdsourcing is the aggregation of judgments of proportions. For example, workers might be presented with a news article or an image, and be asked to identify the proportion of each topic, sentiment, object, or colour present in it. These varying judgments then need to be aggregated to form a consensus view of the document’s or image’s contents. Often, however, these judgments are skewed by workers who provide judgments randomly. Such spammers make the cost of acquiring judgments more expensive and degrade the accuracy of the aggregation. For such cases, we provide a new Bayesian framework for aggregating these responses (expressed in the form of categorical distributions) that for the first time accounts for spammers. We elicit 796 judgments about proportions of objects and coloursin images. Experimental results show comparable aggregation accuracy when 60% of the workers are spammers, as other state of the art approaches do when there are no spammers.
Keywords:
Machine Learning: Ensemble Methods
Machine Learning: Machine Learning
Machine Learning: Unsupervised Learning