GANs for Semi-Supervised Opinion Spam Detection

Gray Stanton; Athirai A. Irissappane

GANs for Semi-Supervised Opinion Spam Detection

Gray Stanton, Athirai A. Irissappane

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence

Main track. Pages 5204-5210. https://doi.org/10.24963/ijcai.2019/723

PDF BibTeX

Online reviews have become a vital source of information in purchasing a service (product). Opinion spammers manipulate reviews, affecting the overall perception of the service. A key challenge in detecting opinion spam is obtaining ground truth. Though there exists a large set of reviews, only a few of them have been labeled spam or non-spam. We propose spamGAN, a generative adversarial network which relies on limited labeled data as well as unlabeled data for opinion spam detection. spamGAN improves the state-of-the-art GAN based techniques for text classification. Experiments on TripAdvisor data show that spamGAN outperforms existing techniques when labeled data is limited. spamGAN can also generate reviews with reasonable perplexity.

Keywords:

Natural Language Processing: Text Classification

Machine Learning: Deep Learning

Multidisciplinary Topics and Applications: Security and Privacy