Learning Causally Disentangled Representations for Fair Personality Detection
Learning Causally Disentangled Representations for Fair Personality Detection
Yangfu Zhu, Meiling Li, Yuting Wei, Di Liu, Yuqing Li , Bin Wu
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence
Main Track. Pages 8411-8419.
https://doi.org/10.24963/ijcai.2025/935
Personality detection aims to identify the personality traits implied in social posts. Existing methods mainly focus on learning the mapping between user-generated posts and personality trait labels but inevitably suffer from potential harm caused by individual bias, as these posts are written by authors from different backgrounds. Learning such spurious associations between posts and traits may lead to the formation of stereotypes, ultimately restricting the detection of personality in different kind of individual. To tackle the issue, we first investigate individual bias in personality detection from the causality perspective. We propose an Interventional Personality Detection Network (IPDN) to learn implicit confounders in user-generated posts and exploit the true causal effect to train the detection model. Specifically, our IPDN disentangled the causal and biased features behind user-generated posts, and then the biased features are accumulatively clustered as confounder prototypes as the training iterations increase. In parallel, the reconstruction network is reused to approximate backdoor adjustment on raw posts, ensuring that traits see each confounder equally before detection. Extensive experiments conducted on three real-world datasets demonstrate that our IPDN outperforms state-of-the-art methods in personality detection.
Keywords:
Natural Language Processing: NLP: Psycholinguistics
Natural Language Processing: NLP: Applications
Natural Language Processing: NLP: Sentiment analysis, stylistic analysis, and argument mining
