首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于情感对象识别和情感规则的微博倾向性分析
引用本文:王泽辰,王树鹏,孙立远,张磊,王勇,郝冰川.基于情感对象识别和情感规则的微博倾向性分析[J].北京航空航天大学学报,2022,48(2):301-310.
作者姓名:王泽辰  王树鹏  孙立远  张磊  王勇  郝冰川
作者单位:1.中国科学院信息工程研究所, 北京 100193
基金项目:国家自然科学基金(61931019)~~;
摘    要:微博平台数据中含有大量反映用户情感喜恶的信息,对于涉及博文倾向性分析的应用尤为重要。现有的分析方法往往聚焦在博文情感的简单分类上,无法分析特定类型实体的微博倾向性。为解决微博倾向性分析问题,实现博文立场判定,采用半监督学习的方法,通过协同训练和主动学习,训练实体识别模型,并构建基于主成分分析的情感规则,提取句子的主成分,将口语化的文本规范化为指定格式。再利用指向性实体的正负面性、情感词的褒贬义及情感词充当的句子成分,实现情感分类的更深层次分析——立场判定。针对实际问题进行立场判定实验,在不同规模数据集上的自对比实验和他比实验显示,随着标注实体的博文数量增加,模型对博文立场判断的正确率持续提升,而且所提方法判断博文立场的正确率显著高于对比方法,相较已有研究方法分别提高了2.79%和10.00%。 

关 键 词:情感分析    立场判定    半监督学习    倾向性    情感规则    协同训练    主动学习
收稿时间:2020-08-09

Weibo tendency analysis based on sentimental object recognition and sentimental rules
WANG Zechen,WANG Shupeng,SUN Liyuan,ZHANG Lei,WANG Yong,HAO Bingchuan.Weibo tendency analysis based on sentimental object recognition and sentimental rules[J].Journal of Beijing University of Aeronautics and Astronautics,2022,48(2):301-310.
Authors:WANG Zechen  WANG Shupeng  SUN Liyuan  ZHANG Lei  WANG Yong  HAO Bingchuan
Institution:1.Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100193, China2.National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100085, China
Abstract:Weibo contains a large number of information reflecting users' likes and dislikes, which is important for popular trend judgment, precision marketing, public opinion monitoring, etc. However, the existing methods tend to focus on the classification of Weibo sentiment. In order to solve the problem of Weibo tendentiousness analysis and position detection, we employ semisupervised learning method, through collaborative training and active learning. We train entity recognition models and combine deep learning with emotional rules. Moreover, the sentiment rules based on principal component analysis are constructed to extract the main components of sentences, normalize the spoken text into the specified format. Then we use the positive and negative aspects of directional entities, the positive and negative meanings of emotional words, and the sentence components of emotional words to judge the tendency of blog posts, and conduct deeper analysis on position classification. Finally, the self comparison experiment and other comparison experiment on different scale data sets show that with the increase of the number of blog posts of labeled entities, the accuracy of the model continues to improve, and the accuracy of this method is significantly higher than the comparison method, which is 2.79% and 10.00% higher than the existing research methods. 
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《北京航空航天大学学报》浏览原始摘要信息
点击此处可从《北京航空航天大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号