Clickbait Detection via Contrastive Variational Modelling of Text and Label

Xiaoyuan Yi; Jiarui Zhang; Wenhao Li; Xiting Wang; Xing Xie

doi:10.24963/ijcai.2022/621

Clickbait Detection via Contrastive Variational Modelling of Text and Label

Xiaoyuan Yi, Jiarui Zhang, Wenhao Li, Xiting Wang, Xing Xie

Watch video

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence

Main Track. Pages 4475-4481. https://doi.org/10.24963/ijcai.2022/621

PDF BibTeX

Clickbait refers to deliberately created sensational or deceptive text for tricking readers into clicking, which severely hurts the web ecosystem. With a growing number of clickbaits on social media, developing automatic detection methods becomes essential. Nonetheless, the performance of existing neural classifiers is limited due to the underutilization of small labelled datasets. Inspired by related pedagogy theories that learning to write can promote comprehension ability, we propose a novel Contrastive Variational Modelling (CVM) framework to exploit the labelled data better. CVM models the conditional distributions of text and clickbait labels by predicting labels from text and generating text from labels simultaneously with Variational AutoEncoder and further differentiates the learned spaces under each label by a mixed contrastive learning loss. In this way, CVM can capture more underlying textual properties and hence utilize label information to its full potential, boosting detection performance. We theoretically demonstrate CVM as learning a joint distribution of text, clickbait label, and latent variable. Experiments on three clickbait detection datasets show our method's robustness to inadequate and biased labels, outperforming several recent strong baselines.

Keywords:

Natural Language Processing: Text Classification