删除或更新信息,请邮件至freekaoyan#163.com(#换成@)

扩展语义相似情感词的文本情感分类方法

本站小编 Free考研考试/2021-12-21

本文二维码信息
二维码(扫一下试试看!)
扩展语义相似情感词的文本情感分类方法
A Method of Text Sentiment Classification by Extending Semantic Similar Sentiment Words
投稿时间:2017-06-23
DOI:10.15918/j.tbit1001-0645.2018.11.009
中文关键词:词嵌入Adaboost分类模型特征选择中文评论情感分类
English Keywords:word embedingAdaboost classification modelfeature selectionChinese commentsentiment classification
基金项目:北京理工大学基础研究基金资助项目(20160542013);国家"二四二"计划项目(2017A149)
作者单位E-mail
罗森林北京理工大学 信息系统及安全对抗实验中心, 北京 100081
毛焱颖北京理工大学 信息系统及安全对抗实验中心, 北京 1000811023017632@qq.com
潘丽敏北京理工大学 信息系统及安全对抗实验中心, 北京 100081
陈倩柔北京理工大学 信息系统及安全对抗实验中心, 北京 100081
魏超北京理工大学 信息系统及安全对抗实验中心, 北京 100081
摘要点击次数:649
全文下载次数:333
中文摘要:
针对文本情感分类中情感语义特征利用不足、特征降维效果欠佳等影响分类效果的问题,提出了一种通过扩展语义相似的情感词以及引入词语间统计特征的高精度网络评论情感分类方法.该方法利用神经网络Skip-gram模型生成词嵌入,通过词嵌入相似性度量将语义相似的词语扩展为情感特征;再利用词语间的统计特征进行特征降维;通过多个弱分器加权构建Adaboost分类模型实现网络评论情感分类.基于酒店评论和手机评论公开测试集进行实验,结果表明其情感分类的正确率分别达到90.96%和93.67%.方法扩展语义相似情感词有利于丰富文本情感语义特征,引入词语间的统计特征有更好的特征降维效果,可以进一步提升文本情感分类的效果.
English Summary:
To solve the effect problem of sentiment classification due to the insufficient use of emotional semantic features and unpromising dimension reduction effects, a novel high-precision sentiment classification method was proposed in this paper for online comments by extending semantic similar emotional words and employing the statistical features between words. Firstly, a neural network skip-gram model was employed to generate word embedding and extend the semantic similar words to emotional feature by the measure of embedding word similarity. Then the feature dimension was reduced by employing the statistical features between words. At last, sentiment classification of online comments was carried out by the Adaboost classification model which was constructed by weighting multiple weak classifiers. Experiment results on hotel reviews and mobile comments show that, the accuracy of sentiment classification with new method can reach 90.96% and 93.67% respectively. Expanding semantic similarity emotion words is helpful to enrich the semantic features of emotion. Employing statistical features between words has better feature reduction effect. Both two procedures effectively improve the performance of text sentiment classification.
查看全文查看/发表评论下载PDF阅读器
相关话题/北京理工大学 实验 北京 统计 中文