<-上一篇/Previous Article 下一篇/Next Article->

[1]郝洁,谢珺,苏婧琼,等.基于词加权LDA算法的无监督情感分类[J].智能系统学报,2016,11(4):539-545.[doi:10.11992/tis.201606007]
　HAO Jie,XIE Jun,SU Jingqiong,et al.An unsupervised approach for sentiment classification based on weighted latent dirichlet allocation[J].CAAI Transactions on Intelligent Systems,2016,11(4):539-545.[doi:10.11992/tis.201606007]

点击复制

基于词加权LDA算法的无监督情感分类

PDF下载 HTML

《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷: 11 期数: 2016年第4期页码: 539-545 栏目: 学术论文—自然语言处理与理解出版日期: 2016-07-25

Title:: An unsupervised approach for sentiment classification based on weighted latent dirichlet allocation

作者:: 郝洁, 谢珺, 苏婧琼, 续欣莹, 韩晓霞; 太原理工大学信息工程学院, 山西晋中 030600

Author(s):: HAO Jie, XIE Jun, SU Jingqiong, XU Xinying, HAN Xiaoxia; Information Engineering College, Taiyuan University of Technology, Jinzhong 030600, China

关键词:: 情感分类; 主题情感混合模型; 主题模型; LDA; 加权算法

Keywords:: sentiment classification; topic and sentiment unification model; topic model; LDA; weighting algorithm

分类号:: TP391

DOI:: 10.11992/tis.201606007

摘要:: 主题情感混合模型可以有效地提取语料的主题信息和情感倾向。本文针对现有主题/情感分析方法主题间区分度较低的问题提出了一种词加权LDA算法（weighted latent dirichlet allocation algorithm，WLDA），该算法可以实现无监督的主题提取和情感分析。通过计算语料中词汇与情感种子词的距离，在吉布斯采样中对不同词汇赋予不同权重，利用每个主题下的关键词判断主题的情感倾向，进而得到每篇文档的情感分布。这种方法增强了具有情感倾向的词汇在采样过程中的影响，从而改善了主题间的区分性。实验表明，与JST（Joint Sentiment/Topic model）模型相比，WLDA不仅在采样中迭代速度快，也能够更好地实现主题提取和情感分类。

Abstract:: The topic and sentiment unification model can efficiently detect topics and emotions for a given corpus. Faced with the low discriminability of topics in sentiment/topic analysis methods, this paper proposes a novel method, the weighted latent dirichlet allocation algorithm (WLDA), which can acquire sentiments and topics without supervision. The model assigns weights to terms during Gibbs sampling by calculating the distance between seed words and terms, then counts the weights of key words to estimate the sentiment orientation of each topic and obtain the emotional distribution throughout documents. This method enhances the impact of words that convey emotional attitudes and obtains more discriminative topics as a consequence. The experiments show that WLDA, compared with the joint sentiment/topic model (JST), not only has a higher iteration sampling speed, but also gives better results for topic extraction and sentiment classification.

参考文献/References:: [1] AGARWAL B, PORIA S, MITTAL N, et al. Concept-level sentiment analysis with dependency-based semantic parsing:a novel approach[J]. Cognitive computation, 2015, 7(4):487-499.
[2] CAMBRIA E. Affective computing and sentiment analysis[J]. IEEE intelligent systems, 2016, 31(2):102-107.
[3] LIN Chenghua, HE Yulan. Joint sentiment/topic model for sentiment analysis[C]//Proceedings of the 18th ACM Conference on Information and Knowledge Management. Hong Kong, China:ACM, 2009:375-384.
[4] LIN Chenghua, HE Yulan, EVERSON R. A comparative study of Bayesian models for unsupervised sentiment detection[C]//Proceedings of the Fourteenth Conference on Computational Natural Language Learning. Stroudsburg, PA, USA:ACM, 2011:144-152.
[5] TITOV I, MCDONALD R. A joint model of text and aspect ratings for sentiment summarization[C]//Proceedings of Annual Meeting of the Computational Linguistics. Columbus, USA:Association for Computational Linguistics, 2008:308-316.
[6] PAUL M, GIRJU R. A two-dimensional topic-aspect model for discovering multi-faceted topics[C]//Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence. Atlanta, USA:AAAI, 2010:545-550.
[7] MEI Qiaozhu, LING Xu, WONDRA M, et al. Topic sentiment mixture:modeling facets and opinions in weblogs[C]//Proceedings of the 16th International Conference on World Wide Web. North Carolina, USA:ACM, 2010:171-180.
[8] JO Y, OH A H. Aspect and sentiment unification model for online review analysis[C]//Proceedings of the Fourth ACM International Conference on Web Search and Data Mining. Hong Kong, China:ACM, 2011:815-824.
[9] 欧阳继红, 刘燕辉, 李熙铭, 等. 基于LDA的多粒度主题情感混合模型[J]. 电子学报, 2015, 43(9):1875-1880. OUYANG Jihong, LIU Yanhui, LI Ximing, et al. Multi-grain sentiment/topic model based on LDA[J]. Acta electronica sinica, 2015, 43(9):1875-1880.
[10] BLEI D M, NG A Y, JORDAN M I. Latent dirichlet allocation[J]. The journal of machine learning research, 2003, 3:993-1022.
[11] RUBIN T N, CHAMBERS A, SMYTH P, et al. Statistical topic models for multi-label document classification[J]. Machine learning, 2012, 88(1/2):157-208.
[12] ANDRZEJEWSKI D, BUTTLER D. Latent topic feedback for information retrieval[C]//Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Diego, USA:ACM, 2011:600-608.
[13] WALLACH H M. Topic modeling:beyond bag-of-words[C]//Proceedings of the 23rd International Conference on Machine Learning. New York, USA:ACM, 2006:977-984.
[14] CHURCH K W, HANKS P. Word association norms, mutual information, and lexicography[J]. Computational linguistics, 1990, 16(1):22-29.
[15] TURNEY P D, LITTMAN M L. Measuring praise and criticism:inference of semantic orientation from association[J]. ACM transactions on information systems, 2003, 21(4):315-346.
[16] 张小平. 主题模型及其在中医临床诊疗中的应用研究[D]. 北京:北京交通大学, 2011:57-58. ZHANG Xiaoping. Study on topic model and its application to TCM clinical diagnosis and treatment[D]. Beijing:Beijing Jiaotong University, 2011:57-58.
[17] ALSUMAIT L, BARBARá D, GENTLE J, et al. Topic significance ranking of LDA generative models[C]//Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases. Bled, Slovenia:ACM, 2009:67-82.

相似文献/References:: [1]夏睿,宗成庆.情感文本分类混合模型及特征扩展策略[J].智能系统学报,2011,6(6):483.
　XIA Rui,ZONG Chengqing.A hybrid approach to sentiment classification and feature expansion strategy[J].CAAI Transactions on Intelligent Systems,2011,6():483.
[2]高庆吉,赵志华,徐达,等.语音情感识别研究综述[J].智能系统学报,2020,15(1):1.[doi:10.11992/tis.201904065]
　GAO Qingji,ZHAO Zhihua,XU Da,et al.Review on speech emotion recognition research[J].CAAI Transactions on Intelligent Systems,2020,15():1.[doi:10.11992/tis.201904065]
[3]曾碧卿,韩旭丽,王盛玉,等.层次化双注意力神经网络模型的情感分析研究[J].智能系统学报,2020,15(3):460.[doi:10.11992/tis.201812017]
　ZENG Biqing,HAN Xuli,WANG Shengyu,et al.Hierarchical double-attention neural networks for sentiment classification[J].CAAI Transactions on Intelligent Systems,2020,15():460.[doi:10.11992/tis.201812017]
[4]程艳,胡建生,赵松华,等.融合Transformer和交互注意力网络的方面级情感分类模型[J].智能系统学报,2024,19(3):728.[doi:10.11992/tis.202303016]
　CHENG Yan,HU Jiansheng,ZHAO Songhua,et al.Aspect-level sentiment classification model combining Transformer and interactive attention network[J].CAAI Transactions on Intelligent Systems,2024,19():728.[doi:10.11992/tis.202303016]

备注/Memo

收稿日期:2016-06-02。
基金项目:山西省回国留学人员科研项目（2015-045，2013-033）；山西省留学回国人员科技活动择优资助项目（2013）；山西省自然科学基金项目（2014011018-2）.
作者简介:郝洁,女,1992年生,硕士研究生,主要研究方向为自然语言处理、粗糙集;谢珺,女,1979年生,副教授,主要研究方向为粒计算、粗糙集、数据挖掘、智能信息处理;苏婧琼,女,1991年生,硕士研究生,主要研究方向为自然语言处理、粒计算。
通讯作者:谢珺.E-mail:xiejun@tyut.edu.cn.

更新日期/Last Update: 1900-01-01

基于词加权LDA算法的无监督情感分类 PDF下载HTML

备注/Memo

基于词加权LDA算法的无监督情感分类

PDF下载 HTML