Fine-Grained Emotional Elements Extraction and Affection Analysis Based on Cascaded Model
SUN Xiao, TANG Chen-Yi
Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, School of Computer and Information, Hefei University of Technology, Hefei 230009
Abstract:For the fine-grained emotional elements extraction problem in product reviews,a cascaded model combining conditional random fields (CRFs) and support vector machine (SVM) is put forward. Aiming at the recognition of sentiment objects and emotional words, the review of syntactic and semantic informations are introduced into CRFs model to further improve the robustness of feature templates in CRFs. In SVM model, the features of deep semantic information of sentiment objects and emotional words and basic emotional orientation of emotional words are introduced to improve the traditional bag-of-words model. The sentiment of <sentiment object, emotional word> word pair is classified to acquire key information from product reviews, namely triples of (sentiment object, sentiment word, sentiment trend). Experimental results show that the proposed CRFs and SVM cascaded model efficiently improves the precision of emotional elements extraction and emotion classification.
孙晓,唐陈意. 基于层叠模型细粒度情感要素抽取及倾向分析*[J]. 模式识别与人工智能, 2015, 28(6): 513-520.
SUN Xiao, TANG Chen-Yi. Fine-Grained Emotional Elements Extraction and Affection Analysis Based on Cascaded Model. , 2015, 28(6): 513-520.
[1] Agrawal R, Imielinski T, Swami A. Mining Association Rules between Sets of Items in Large Databases // Proc of the ACM SIGMOD International Conference on Management of Data. Washington, USA, 1993: 207-216 [2] Liu B, Hu M Q, Cheng J S. Opinion Observer: Analyzing and Comparing Opinions on the Web // Proc of the 14th International Conference on World Wide Web. Chiba, Japan, 2005: 342-351 [3] Hu M Q, Liu B. Mining and Summarizing Customer Reviews // Proc of the 10th ACM SIGKDD International Conference on Know-ledge Discovery and Data Mining. Seattle, USA, 2004: 168-177 [4] Kim S M, Hovy E. Determining the Sentiment of Opinions // Proc of the 20th International Conference on Computational Linguistics. Geneva, Switzerland, 2004: 1367-1373 [5] Zhu Y L, Min J, Zhou Y Q, et al. Semantic Orientation Computing Based on HowNet. Journal of Chinese Information Processing, 2006, 20(1): 14-20 (in Chinese) (朱嫣岚,闵 锦,周雅倩,等.基于HowNet的词汇语义倾向计算.中文信息学报, 2006, 20(1): 14-20) [6] Liu Q, Li S J. Word's Semantic Similarity Computation Method Based on HowNet // Proc of the 3rd Chinese Lexical Semantic Workshop. Taibei, China, 2002: 59-76 (in Chinese) (刘 群,李素建.基于知网的词汇语义相似度的计算 // 第三届汉语词汇语义学研讨会论文集.台北, 2002: 59-76) [7] Lou D C, Yao T F. Semantic Polarity Analysis and Opinion Mining on Chinese Review Sentences. Computer Applications, 2006, 26(11): 2622-2625 (in Chinese) (娄德成,姚天日方.汉语句子语义极性分析和观点抽取方法的研究.计算机应用, 2006, 26(11): 2622-2625) [8] Li D, Cao F Y, Cao Y D, et al. Text Sentiment Classification Based on Phrase Patterns. Computer Science, 2008, 35(4): 132-134 (in Chinese) (李 钝,曹付元,曹元大,等.基于短语模式的文本情感分类研究.计算机科学, 2008, 35(4): 132-134) [9] Shi W, Wang H W, He S Y. Study on Public Sentiment Based on Microblogging Platform. Journal of the China Society for Scientific and Technical Information, 2012, 31(11): 1171-1178 (in Chinese) (史 伟,王洪伟,何绍义.基于微博平台的公众情感分析.情报学报, 2012, 31(11): 1171-1178) [10] Kim S M, Hovy E. Automatic Identification of Pro and Con Reasons in Online Reviews // Proc of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Sydney, Australia, 2006: 483-490 [11] Shi H X, Li X J. Research on Sentiment Orientation Analysis for Subjective Sentences. Journal of the China Society for Scientific and Technical Information, 2011, 30(5): 522-529 (in Chinese) (施寒潇,厉小军.主观性句子情感倾向性分析方法的研究.情报学报, 2011, 30(5): 522-529) [12] Zhen L J, Wang H W, Guo K Q. Sentiment Classification of Chinese Online Reviews: A Comparison between Sentences and Paragraphs. Journal of the China Society for Scientific and Technical Information, 2013, 32(4): 376-384 (in Chinese) (郑丽娟,王洪伟,郭恺强.中文网络评论的情感分类: 句子与段落的比较研究.情报学报, 2013, 32(4): 376-384) [13] Li B Y, Guan Y, Dong X S, et al. Single-Label Cascaded Model for Document Sentiment Analysis. Journal of Chinese Information Processing, 2012, 26(4): 3-8, 20 (in Chinese) (李本阳,关 毅,董喜双,等.基于单层标注级联模型的篇章情感倾向分析.中文信息学报, 2012, 26(4): 3-8, 20) [14] Lafferty J, McCallum A, Pereira F C N. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data // Proc of the 18th International Conference on Machine Learning. Williamstown, USA, 2001: 282-289 [15] Xu B, Zhao T J, Wang S Y, et al. Extraction of Opinion Targets Based on Shallow Parsing Features. Acta Automatica Sinica, 2011, 37(10): 1241-1247 (in Chinese) (徐 冰,赵铁军,王山雨,等.基于浅层句法特征的评价对象抽取研究.自动化学报, 2011, 37(10): 1241-1247) [16] Wang W P, Meng C C. Opinion Object Extraction Based on the Syntax Analysis and Dependence Analysis. Computer Systems & Applications, 2011, 20(8): 52-57 (in Chinese) (王卫平,孟翠翠.基于句法分析与依存分析的评价对象抽取.计算机系统应用, 2011, 20(8): 52-57) [17] Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[EB/OL]. [2014-02-20]. http://arxiv.org/pdf/1301.3781v3.pdf [18] Sun X, Huang D G. Chinese Integrative Lexical Analysis Based on Maximum Matching and Second-Maximum Matching Segmentation. Journal of Dalian University of Technology, 2010, 50(6): 1028-1034 (in Chinese) (孙 晓,黄德根.基于最长次长匹配分词一体化中文词法分析.大连理工大学学报, 2010, 50(6): 1028-1034) [19] Sun X, Huang D G, Song H Y, et al. Chinese New Words Identification: A Latent Discriminative Model with Global Features. Journal of Computer Science and Technology, 2011, 26 (1): 14-24