期刊文献+

中文基础情感词词典构建方法研究 预览 被引量:54

Research on building Chinese basic semantic lexicon
在线阅读 下载PDF
分享 导出
摘要 词语的情感倾向判别是文章语义情感倾向研究的基础工作。利用中文情感词建立一个基础情感词典,为专一领域情感词识别提供一个核心子集,能够有效地在语料库中识别及扩展情感词集,并提高分类效果。在中文词语相似度计算方法的基础上,提出了一种中文情感词语的情感权值的计算方法,并以HOWNET情感词语集为基准,构建了中文基础情感词典。利用该词典结合TF—IDF特征权值计算方法,对中文文本情感倾向进行判别,实验结果表明,该方法取得了不错的分类效果。 Judging the emotional tendencies of Chinese words is the basic work of the semantic emotional tendency study of text. Building a basic emotional lexicon with Chinese emotional words will provide a core subset for identifying emotional words in a special area. It is able to identify and enlarge emotional word set effectively in corpus and also improve the efficiency of classification. A method of calculating the emotional value of Chinese emotional words on the basis of the similarity of Chinese words was provided. And also a Chinese basic emotional lexicon dictionary was constructed based on the HOWNET emotional word set. The emotional tendencies of Chinese texts were judged through the dictionary together with TFIDF. Experiments show that this method has achieved a satisfying result.
作者 柳位平 朱艳辉 栗春亮 向华政 文志强 LIU Wei-ping, ZHU Yan-hui, LI Chun-liang, XIANG Hua-zheng, WEN Zhi-qiang (Institute of Computer and Communication, Hunan University of Technology, Zhuzhou Hunan 412008, China)
出处 《计算机应用》 CSCD 北大核心 2009年第10期 2875-2877,共3页 journal of Computer Applications
基金 湖南省自然科学基金资助项目(05JJ30122) 中国包装总公司科研资助项目(2008-XK13) 湖南省教育厅科研资助项目(078014) 湖南工业大学研究生创新基金资助项目(CX0812).
关键词 基础情感词词典 倾向性分析 情感权值 种子词 basic semantic lexicon orientation analysis semantic weight seed word
作者简介 柳位平(1981-),男,湖南邵阳人,硕士研究生,主要研究方向:文本分类; 朱艳辉(1968-),女,湖南湘潭人,教授,CCF高级会员,主要研究方向:智能信息处理、信息检索、文本分类; 栗春亮(1984-),男,河北邯郸人,硕士研究生,主要研究方向:文本分类; 向华政(1971-),男,湖南桃源人,副教授,博士研究生,主要研究方向:智能信息处理; 文志强(1973-),男,湖南湘乡人,副教授,博士,主要研究方向:目标检测、智能信息处理。
  • 相关文献

参考文献9

  • 1KU L-W, LO Y-S, CHEN H-H. Using polarity scores of words for sentence-level opinion extraction [ C]// Proceedings of the 6th NTCIR-6 Workshop Meeting. Toyko, Japan: [ s. n. ], 2007:316 - 322. 被引量:1
  • 2王秉卿,张姝,张奇.中文情感词识别[C]//NCIRCS2008:第四届全国信息检索与内容安全学术会议.北京:[出版社不详],2008:63-69. 被引量:2
  • 3刘群 李素建.基于《知网》的词汇语义相似度的计算.中文计算语言学,2002,17(2):59-76. 被引量:10
  • 4王克,张春良,朱慕华,等.基于情感词词典的中文文本主客观分析[C].NCIRCS2008:第四届全国信息检索与内容安全学术会议.北京,2008.56-62. 被引量:2
  • 5知网[EB/OL].[2009-03-12].http://www.keenage.com. 被引量:3
  • 6朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:278
  • 7TURNEY P D. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews [ C]// Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. Morristown, N J, USA: Association for Computational Linguistics, 2002:417-424. 被引量:1
  • 8谭松波.中文情感挖掘语料-ChenSentiCorp[EB/OL].(2008-12-19)[2009-03-12].http://www.searchforum.org.cn/tansongbo/corpus-senti.htm. 被引量:2
  • 9KAJI N, KITSUREGAWA M. Building lexicon for sentiment analysis from massive collection of HTML documents [ C/OL]//EMNLPCoNLL 2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 2007:1075 - 1083 [2009 -03 -08]. http://www. aclweb. org/anthology/D/D07/D07-1115. pdf. 被引量:1

二级参考文献9

  • 1Vasileios Hatzivassiloglou, Kathleen R. McKeown. Predicting the semantic orientation of adjectives[A]. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and the 8th Conference of the European Chapter of the ACL[C], 1997:174- 181. 被引量:1
  • 2Turney, Peter, Littman Michael. Measuring praise and criticism: Inference of semantic orientation from association[J]. ACM Transactions on Information Systems, 2003, 21(4): 315- 346. 被引量:1
  • 3Turney ,Peter. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews[A]. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics[C]. 2002:417 -424. 被引量:1
  • 4Bo Pang,Lillian Lee, Shivanathan Vaithyanathan. Thumbs up? Sentiment classification using machine learning techniques[A]. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing[C]. 2002:79 - 86. 被引量:1
  • 5Bo Pang,Lillian Lee. Seeing Stars: Exploiting Class Relationships for Sentiment Categorizalion with respect to Rating Seales[A]. ACL2005, 115-124. 被引量:1
  • 6K Dave, S lawrence, DM Pennock. , Mining the peanut gallery: opinion extraction and semantic classification of product reviews[A]. WWW2003, 519-28. 被引量:1
  • 7Bing Liu, Minqing Hu, Junsheng Cheng. Opinion observer: analyzing and comparing opinions on the Web[A].WWW2005, 324- 351. 被引量:1
  • 8HowNet[R]. HowNet's Home Page. http://www. keenage.com. 被引量:1
  • 9刘群 李素建.基于《知网》的词汇语义相似度的计算[A]..第三届汉语词汇语义学研讨会[C].台北,2002.. 被引量:16

共引文献286

同被引文献574

引证文献54

二级引证文献283

投稿分析

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部 意见反馈