透過您的圖書館登入
IP:18.116.65.119
  • 期刊
  • OpenAccess

Sentiment Analysis on Social Network: Using Emoticon Characteristics for Twitter Polarity Classification

摘要


In this paper, we describe a sentiment analysis system implemented for the semantic-evaluation task of message polarity classification for English on Twitter. Our system contains modules of data pre-processing, word embedding, and sentiment classification. In order to decrease the data complexity and increase the coverage of the word vector model for better learning, we perform a series of data pre-processing tasks, including emoticon normalization, specific suffix splitting, and hashtag segmentation. In word embedding, we utilize the pre-trained word vector provided by GloVe. We believe that emojis in tweets are important characteristics for Twitter sentiment classification, but most pre-trained sets of word vectors contain few or no emoji representations. Thus, we propose embedding emojis into the vector space by neural network models. We train the emoji vector with relevant words that contain descriptions and contexts of emojis. The models of long short-term memory (LSTM) and convolutional neural network (CNN) are used as our sentiment classifiers. The proposed emoji embedding is evaluated on the SemEval 2017 tasks. Using emoji embedding, we achieved recall rates of 0.652 with the LSTM classifier and 0.640 with the CNN classifier.

參考文獻


Barbieri, F.,Ronzano, F.,Saggion, H.(2016).What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis.Proceedings of the LREC 2016.(Proceedings of the LREC 2016).
Black, E.,Abney, S.,Flickenger, D.,Gdaniec, C.,Grishman, R.,Harrison, P.,Strzalkowski, T.(1991).A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars.Proceedings of the Workshop on Speech and Natural language (HLT '91).(Proceedings of the Workshop on Speech and Natural language (HLT '91)).
Cliché, M.(2017).Task 4: Twitter Sentiment Analysis with CNNs and LSTMs.Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval 2017).(Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval 2017)).
Deriu, J.,Gonzenbach, M.,Uzdilli, F.,Lucchi, A.,De Luca, V.,Jaggi, M.(2016).SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision.Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval 2016).(Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval 2016)).
Go, A.,Bhayani, R.,Huang, L.(2009).Twitter Sentiment Classification using Distant Supervision.CS224N Project Report.(CS224N Project Report).,::.

延伸閱讀