Learning Few-shot Sample-set Operations for Noisy Multi-label Aspect Category Detection

Learning Few-shot Sample-set Operations for Noisy Multi-label Aspect Category Detection

Shiman Zhao, Wei Chen, Tengjiao Wang

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
Main Track. Pages 5306-5313. https://doi.org/10.24963/ijcai.2023/589

Multi-label Aspect Category Detection (MACD) is essential for aspect-based sentiment analysis, which aims to identify multiple aspect categories in a given sentence. Few-shot MACD is critical due to the scarcity of labeled data. However, MACD is a high-noise task, and existing methods fail to address it with only two or three training samples per class, which limits the application in practice. To solve above issues, we propose a group of Few-shot Sample-set Operations (FSO) to solve noisy MACD in fewer sample scenarios by identifying the semantic contents of samples. Learning interactions among intersection, subtraction, and union networks, the FSO imitates arithmetic operations on samples to distinguish relevant and irrelevant aspect contents. Eliminating the negative effect caused by noises, the FSO extracts discriminative prototypes and customizes a dedicated query vector for each class. Besides, we design a multi-label architecture, which integrates with score-wise loss and multi-label loss to optimize the FSO for multi-label prediction, avoiding complex threshold training or selection. Experiments show that our method achieves considerable performance. Significantly, it improves by 11.01% at most and an average of 8.59% Macro-F in fewer sample scenarios.
Keywords:
Natural Language Processing: NLP: Sentiment analysis, stylistic analysis, and argument mining
Machine Learning: ML: Few-shot learning
Natural Language Processing: NLP: Dialogue and interactive systems