Chinese micro-blog sentiment classification through a novel hybrid learning model

Fang-fang Li , Huan-ting Wang , Rong-chang Zhao , Xi-yao Liu , Yan-zhen Wang , Bei-ji Zou

Journal of Central South University ›› 2017, Vol. 24 ›› Issue (10) : 2322 -2330.

PDF
Journal of Central South University ›› 2017, Vol. 24 ›› Issue (10) : 2322 -2330. DOI: 10.1007/s11771-017-3644-0
Article

Chinese micro-blog sentiment classification through a novel hybrid learning model

Author information +
History +
PDF

Abstract

With the rising and spreading of micro-blog, the sentiment classification of short texts has become a research hotspot. Some methods have been developed in the past decade. However, since the Chinese and English are different in language syntax, semantics and pragmatics, sentiment classification methods that are effective for English twitter may fail on Chinese micro-blog. In addition, the colloquialism and conciseness of short Chinese texts introduces additional challenges to sentiment classification. In this work, a novel hybrid learning model was proposed for sentiment classification of Chinese micro-blogs, which included two stages. In the first stage, emotional scores were calculated over the whole dataset by utilizing an improved Chinese-oriented sentiment dictionary classification method. Data with extremely high or low scores were directly labeled. In the second stage, the remaining data were labeled by using an integrated classification method based on sentiment dictionary, support vector machine (SVM) and k-nearest neighbor (KNN). An improved feature selection method was adopted to enhance the discriminative power of the selected features. The two-stage hybrid framework made the proposed method effective for sentiment classification of Chinese micro-blogs. Experiments on the COAE2014 (Chinese Opinion Analysis Evaluation 2014) dataset show that the proposed method outperforms other schemes.

Keywords

Chinese micro-blog / short text / hybrid learning / sentiment classification

Cite this article

Download citation ▾
Fang-fang Li, Huan-ting Wang, Rong-chang Zhao, Xi-yao Liu, Yan-zhen Wang, Bei-ji Zou. Chinese micro-blog sentiment classification through a novel hybrid learning model. Journal of Central South University, 2017, 24(10): 2322-2330 DOI:10.1007/s11771-017-3644-0

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

CNNIC. Statistical reports from CNNIC [EB/OL]. [2015-12-30]. http://www.cnnic.net.cn/hlwfzyj/hlwxzbg/hlwtjbg/.

[2]

ZhaoY-y, QinB, LiuTing. Sentiment analysis [J]. Journal of Software, 2010, 21(8): 1834-1848

[3]

XieL-x, ZhouM, SunM-song. Hierarchical structure based hybrid approach to sentiment analysis of chinese micro blog and its feature extraction [J]. Journal of Chinese Information Processing, 2012, 26(1): 73-83

[4]

BakliwalA, FosterJ, PuilJ, O’BrienR, TounsiL, HughesM. Sentiment analysis of political tweets: Towards an accurate classifier [C]//. Proceedings of the Workshop on Language in Social Media (LASM 2013), 20134958

[5]

BarbosaL, FengJ. Robust sentiment detection on twitter from biased and noisy data [C]//. International Conference on Computational Linguistics (ICCL 2010), 20103644

[6]

KimSM, HovyE. Automatic detection of opinion bearing words and sentences [C]//. International Joint Conference on Natural Language Processing (IJCNLP 2005), 20056166

[7]

LiS-s, SophiaY, HuangC-r, SuYan. Construction of Chinese sentiment lexicon using bilingual information and label propagation algorithm [J]. Journal of Chinese Information Processing, 2013, 27(6): 75-81

[8]

HanZ-m, ZhangY-s, ZhangH, WanY-l, HuangJ-hui. On effective short text tendency classification algorithm for chinese micro blogging [J]. Computer Applications and Software, 2012, 29(10): 89-93

[9]

PangZ-j, GaoL-b, YaoT-fang. Web text tendency classification based on sentiment phrase [C]//. Chinese Opinion Analysis Evaluation (COAE 2014), 2014179186

[10]

SunS-t, HeY-x, CaiR, LiF, HeF-yan. LEO_WHU’s report on COAE2014 [C]//. Chinese Opinion Analysis Evaluation (COAE 2014), 20142734

[11]

LuoY, LiL, TanS-b, ChenX-qi. Sentiment analysis on Chinese micro-Blog corpus [C]//. Chinese Opinion Analysis Evaluation (COAE 2014), 2014123130

[12]

PangB, LeeL, VaithyanathanS. Thumbs up? Sentiment classification using machine learning techniques [C]//. Conference on Empirical Methods in Natural Language Processing (EMNLP 2002), 20027986

[13]

SunY, ZhouX-g, FuWei. Unsupervised topic and sentiment unification model for sentiment analysis [J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2013, 49(1): 102-108

[14]

TanC-h, LeeL, TangJ, JiangL, ZhouM, LiPing. User-level sentiment analysis incorporating social networks [C]//. International Conference on Knowledge Discovery and Data Mining (KDD 2011), 201113971405

[15]

SocherR, PenningtonJ, HuangE H, NgA Y, ManningC D. Semi-supervised recursive auto-encoders for predicting sentiment distributions [C]//. Conference on Empirical Methods in Natural Language Processing (EMNLP 2011), 2011151161

[16]

LiuZ-g, DongX-s, GuanY, YangJ-feng. Reserved self-training: a semi-supervised sentiment classification method for Chinese Microblogs [C]//. International Joint Conference on Natural Language Processing (IJCNLP 2013), 2013455462

[17]

ParkA, ParoubekP. Twitter as a corpus for sentiment analysis and opinion mining [C]//. International Conference on Language Resources and Evaluation (LREC 2010), 201013201326

[18]

DavidovD, TsurO, RappoportA. Enhanced sentiment learning using twitter hashtags and smileys [C]//. International Conference on Computational Linguistics (ICCL 2010), 2010241249

[19]

RustamovS, ClementsM A. Sentence-level subjectivity detection using neuro-fuzzy models [C]//. Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA 2013), 2013108114

[20]

RenY, KajiN, YoshinagaN, KitsuregawaM. Sentiment classification in under-resourced languages using graph-based semi-supervised learning methods [J]. IEICE Transactions on Information and Systems, 2014, 97(4): 790-797

[21]

MaoX, JiangL, XueY-li. Affect computation of chinese short text [J]. IEICE Transactions on Information and Systems, 2012, 95(11): 2741-2744

[22]

BAIKE. The definition of Chinese micro-blog [EB/OL]. [2015-12-30]. http://www.baike.com/wiki/%E5%BE%AE%E5% 8D%9A.

[23]

HOWNET. The latest hownet news [EB/OL]. [2015-12-30]. http://www.keenage.com/html/e_index.html.

[24]

SHUJUTANG. NTUSD released by the National Taiwan University [EB/OL]. [2015-12-30]. http://www.datatang.com/data/11837.

[25]

NLPIR. ICTCLAS 2015 [EB/OL]. [2015-12-30]. http:// ictclas.nlpir.org/.

[26]

LIBSVM. A library for support vector machines [EB/OL]. [2015-12-30]. http://www.csie.ntu.edu.tw/~cjlin/libsvm/.

[27]

TanS-b, WangS-g, LiaoX-w, LiuKang. Fifth Chinese opinion analysis evaluation report [C]//. Chinese Opinion Analysis Evaluation (COAE 2013), 2013533

AI Summary AI Mindmap
PDF

108

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/