Hashtag2Vec: Learning Hashtag Representation with Relational Hierarchical Embedding Model

Hashtag2Vec: Learning Hashtag Representation with Relational Hierarchical Embedding Model

Jie Liu, Zhicheng He, Yalou Huang

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
Main track. Pages 3456-3462. https://doi.org/10.24963/ijcai.2018/480

Hashtags have always been important elements in many social network platforms and micro-blog services. Semantic understanding of hashtags is a critical and fundamental task for many applications on social networks, such as event analysis, theme discovery, information retrieval, etc. However, this task is challenging due to the sparsity, polysemy, and synonymy of hashtags. In this paper, we investigate the problem of hashtag embedding by combining the short text content with the various heterogeneous relations in social networks. Specifically, we first establish a network with hashtags as its nodes. Hierarchically, each of the hashtag nodes is associated with a set of tweets and each tweet contains a set of words. Then we devise an embedding model, called Hashtag2Vec, which exploits multiple relations of hashtag-hashtag, hashtag-tweet, tweet-word, and word-word relations based on the hierarchical heterogeneous network. In addition to embedding the hashtags, our proposed framework is capable of embedding the short social texts as well. Extensive experiments are conducted on two real-world datasets, and the results demonstrate the effectiveness of the proposed method.
Keywords:
Machine Learning Applications: Networks
Machine Learning: Unsupervised Learning
Natural Language Processing: Text Classification
Natural Language Processing: Embeddings