[1]陈建美,林鸿飞,杨志豪.基于语法的情感词汇自动获取[J].智能系统学报,2009,4(02):100-106.
 CHEN Jian-mei,LIN Hong-fei,YANG Zhi-hao.Automatic acquisition of emotional vocabulary based on syntax[J].CAAI Transactions on Intelligent Systems,2009,4(02):100-106.
点击复制

基于语法的情感词汇自动获取(/HTML)
分享到:

《智能系统学报》[ISSN:1673-4785/CN:23-1538/TP]

卷:
第4卷
期数:
2009年02期
页码:
100-106
栏目:
出版日期:
2009-04-25

文章信息/Info

Title:
Automatic acquisition of emotional vocabulary based on syntax
文章编号:
1673-4785(2009)02-0100-07
作者:
陈建美林鸿飞杨志豪
大连理工大学电子与信息工程学院,辽宁大连116024
Author(s):
CHEN Jian-mei LIN Hong-fei YANG Zhi-hao
School of Electronic and Information Engineering, Dalian University of Technology, Dalian 116024, China
关键词:
情感词汇词汇自动获取情感计算条件随机域
Keywords:
emotional vocabulary automatic vocabulary acquisition affective computer conditional random field
分类号:
TP391.3
文献标志码:
A
摘要:
情感计算是目前人工智能领域的热门课题,而词汇的情感计算又是准确完成文本情感计算的基础.目前情感词汇的获取大多采用人工获取的方法,如何自动地获取情感词汇,已成为当前情感计算研究亟需解决的问题.提出了情感词汇的自动提取机制,首先分析了情感词汇的一般语法规律,例如,重叠的规律,受否定词、程度副词修饰的规律等.然后在情感词汇的这些语法规律的基础上,运用CRF模型实现了情感词汇的自动获取.最后,分析了不同的语法规律对情感词汇自动获取的作用大小,并对实验结果进行了详细分析,实验结果表明情感词汇自动获取方法是有效的.
Abstract:
Affective computation has received more and more attentions in the field of artificial intelligence; however, the calculation of affective lexicon ontology is a requirement for affective computation of texts. At present, most emotional lexicons are obtained by manual methods. The automatic acquisition of emotional lexicons has become an urgent task that needs to be addressed. This paper presents an automatic acquisition method for emotional lexicons. The authors analyzed the general syntactical rules of emotional lexicons, such as the rules of overlapping words, then rules governing how these acquired emotional words were modified by privatives and degree adverbs. Then we used the conditional random fields (CRF) model to acquire emotional words based on the general rules. Finally, we analyzed the effects of various syntax rules on the automatic acquisition of emotional vocabulary. Experiments were done and the results showed that the proposed method is effective for automatic acquisition of emotional words.

参考文献/References:

[1]PICARD R W.Affective computing[M].Cambridge, MA:The MIT Press, 1997:36.
[2]LIU Hugo, SELKER T, LIEBERMAN H. Visualizing the affective structure of a text document[C]//Conference on Human Factors in Computing Systems. Florida, USA, 2003:740741.
[3]ZHANG Li, BARNDEN J A, HENDLEY R J,et al. Exploitation in affect detection in openended improvisational text[C]// The Annual Meeting of the Association of Computational Iinguistics.Sydney, 2006: 4755.
 [4]董振东, 董 强. 《知网》[EB/OL].[20070101].http://www.keenage.com.
 [5]徐琳宏, 林鸿飞, 潘  宇, 等. 情感词汇本体的构造[J]. 情报学报, 2008, 27(2): 180185.
 XU Linhong,LIN Hongfei, PAN Yu,et al. Constructing the affective lexicon ontology[J].Journal of the China Society for Scientific and Technical Information,2008,27(2):180185.
[6]董大年. 现代汉语分类词典[M]. 上海:汉语大词典出版社,1998:105110.
[7]王国璋. 汉语褒贬义词语用法词典[M]. 北京:华语教学出版社,2001:123128.
 [8]程志强.中华成语大词典[M]. 北京:中国大百科全书出版社,2003:5660.
 [9]许小颖, 陶建华. 汉语情感系统中情感划分的研究[C]//第一届中国情感计算及智能交互学术会议. 北京,2003:199205.
XU Xiaoying,TAO Jianhua.The study on affective word classfication in Chinese affective systems[C]//The Proceedings of the First Chinese Conference on Affective Computing and Intelligent Interaction.Beijing,2003:199205.
 [10]刘桐菊, 于 浩, 杨沐昀. 基于TFIDF的专业领域词汇获取的研究[C]// 第一届学生计算语言学研讨会.北京,2002:263267.
 LIU Tonghui, YU Hao, YANG Mujun.The research of term extraction in professional field[C]//The Proceedings of the First National Student Workshop on Computational Lingustics.Beijing,2002:263267.
 [11]DAILLE B. Study and implementation of combined techniques for automatic extraction of terminology[C]//The 32th Annual Meeting of the Association for Computational Linguistics. New Mexico, USA,1994:2936.
[12]张晓鹏. 汉语特定领域本体的自动构造研究[D].武汉: 华中师范大学, 2007.
ZHANG Xiaopeng.The study on automatic construction of ontology in special areas[D].Wuhan: Huazhong Normal University,2007.
[13]HATZIVASSILOGLOU V, MCKEOWN K R. Predicting the semantic orientation of adjectives[C]//Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and the 8th Conference of the European Chapter of the ACL. Madrid, Spain, 1997:174181
[14]YUEN R W M, CHAN T Y W . Morphemebased derivation of bipolar semantic orientation of Chinese words[C]//Proceedings of the 20th International Conference on Computational Linguistics.Geneva, Switzerland,2004: 10081014.
 [15]TURNEY P D,LITTMAN M L. Measuring praise and criticism: inference of semantic orientation from association[J]. ACM Transactions on Information System (TOIS), 2003,21(4):315346.
[16]MANNING C D, SCHUTZE H.统计自然语言处理基础[M]. 电子工业出版社, 2005: 111114.
[17]吴  晗. 汉语重叠研究综述[J]. 汉语学习, 2000,3:2833.
 [18]朱德熙. 现代汉语形容词研究[J].语言研究, 1956,1:19.
[19]陈 群. 近代汉语:程度副词研究[M]. 四川:巴蜀书社, 2006:3441.
[20]LAFFERTY J, MCCALLUM A, PEREIRA F. Conditional random fields: probabilistic models for segmenting and labeling sequence data[C]//18th International Conf on Machine Learning. San Francisco,USA:Morgan Kaufmann,2001: 282289.
【21]徐琳宏, 林鸿飞, 赵 晶. 情感语料库的构建和分析[J]. 中文信息学报, 2008, 22(1):116-122.

备注/Memo

备注/Memo:
收稿日期:2008-12-16.
基金项目: 国家自然科学基金资助项目(60373095,60673039);国家“863”计划资助项目(2006AA01Z151);教育部留学人员归国启动基金资助项目(教外留司[2007]1108号).
作者简介:
陈建美,女,1985年生,硕士研究生,主要研究方向为情感词汇自动获取和情感词汇消歧. 
 林鸿飞,男,1962年生,教授,博士生导师,现任《中文信息学报》编委,中文信息学会理事,中国中文信息学会信息检索专业委员会委员,中国人工智能学会离散数学专业委员会副主任,中国人工智能学会机器学习专业委员会委员.主要研究方向为搜索引擎、文本挖掘、情感计算和自然语言处理.主持多项国家自然科学基金和863计划项目,发表学术论文100余篇.
杨志豪,男,1973年生,副教授,博士,主要研究方向为文本挖掘和中文信息处理,发表学术论文20 余篇.
更新日期/Last Update: 2009-05-04