Rumor detection with self-supervised learning on texts and social graph

Yuan GAO; Xiang WANG; Xiangnan HE; Huamin FENG; Yongdong ZHANG

doi:10.1007/s11704-022-1531-9

PDF(6925 KB)

Front. Comput. Sci. ›› 2023, Vol. 17 ›› Issue (4) : 174611. DOI: 10.1007/s11704-022-1531-9

Information Systems

RESEARCH ARTICLE

Rumor detection with self-supervised learning on texts and social graph

Author information +

History +

Abstract

Rumor detection has become an emerging and active research field in recent years. At the core is to model the rumor characteristics inherent in rich information, such as propagation patterns in social network and semantic patterns in post content, and differentiate them from the truth. However, existing works on rumor detection fall short in modeling heterogeneous information, either using one single information source only (e.g., social network, or post content) or ignoring the relations among multiple sources (e.g., fusing social and content features via simple concatenation).

Therefore, they possibly have drawbacks in comprehensively understanding the rumors, and detecting them accurately. In this work, we explore contrastive self-supervised learning on heterogeneous information sources, so as to reveal their relations and characterize rumors better. Technically, we supplement the main supervised task of detection with an auxiliary self-supervised task, which enriches post representations via post self-discrimination.

Specifically, given two heterogeneous views of a post (i.e., representations encoding social patterns and semantic patterns), the discrimination is done by maximizing the mutual information between different views of the same post compared to that of other posts. We devise cluster-wise and instance-wise approaches to generate the views and conduct the discrimination, considering different relations of information sources. We term this framework as self-supervised rumor detection (SRD). Extensive experiments on three real-world datasets validate the effectiveness of SRD for automatic rumor detection on social media.

Graphical abstract

Keywords

rumor detection / graph neural networks / self-supervised learning / social media

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Yuan GAO, Xiang WANG, Xiangnan HE, Huamin FENG, Yongdong ZHANG. Rumor detection with self-supervised learning on texts and social graph. Front. Comput. Sci., 2023, 17(4): 174611 https://doi.org/10.1007/s11704-022-1531-9

This is a preview of subscription content, contact us for subscripton.

Yuan Gao received the MS degree in Electrical and Computer Engineering from University of Michigan, USA in 2019. He is now a PhD student in the School of Cyberspace Science and Technology at the University of Science and Technology of China (USTC), China. His research interest lies in fraud detection, representation learning, and graph learning

Xiang Wang is now a professor at the University of Science and Technology of China (USTC), China. He received his PhD degree from National University of Singapore, Singapore in 2019. His research interests include recommender systems, graph learning, AI explainability, and AI security. He has published some academic papers on international conferences such as NeurIPS, ICLR, KDD, WWW, SIGIR, and AAAI. He serves as a program committee member for several top conferences such as SIGIR and WWW

Xiangnan He is a professor at the University of Science and Technology of China (USTC), China. He received his PhD in Computer Science from the National University of Singapore (NUS), Singapore. His research interests span information retrieval, data mining, and multi-media analytics. He has over 80 publications that appeared in several top conferences such as SIGIR, WWW, and MM, and journals including TKDE, TOIS, and TMM. His work has received the Best Paper Award Honorable Mention in WWW 2018 and ACM SIGIR 2016. He is in the editorial board of journals including Frontiers in Big Data, AI Open. Moreover, he has served as the PC chair of CCIS 2019 and SPC/PC member for several top conferences including SIGIR, WWW, KDD, MM, WSDM, ICML, etc., and the regular reviewer for journals including TKDE, TOIS, TMM

Huamin Feng is now a professor at Beijing Electronic Science and Technology Institute, China. He received his PhD dergree from National University of Singapore, Singapore in 2005. His research interests include multimedia semantic analysis, recommend system, and Web content analysis. He has published some academic papers on international conferences such as WWW, SIGIR, and MMM

Yongdong Zhang (Senior Member, IEEE) received the PhD degree in electronic engineering from Tianjin University, China in 2002. He is currently a Professor with the University of Science and Technology of China. He has authored more than 100 refereed journal and conference papers. His current research interests include multimedia content analysis and understanding, multimedia content security, video encoding, and streaming media technology. He was a recipient of the Best Paper Award in PCM2013, ICIMCS 2013, and ICME 2010; and the Best Paper Candidate in ICME 2011. He serves as an Editorial Board Member for Multimedia Systems journal and Neurocomputing

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Farajtabar M, Yang J, Ye X, Xu H, Trivedi R, Khalil E, Li S, Song L, Zha H. Fake news mitigation via point process based intervention. In: Proceedings of the 34th International Conference on Machine Learning. 2017, 1097–1106

[2]

Jin Z, Cao J, Guo H, Zhang Y, Wang Y, Luo J. Detection and analysis of 2016 US presidential election related rumors on twitter. In: Proceedings of the 10th International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation. 2017, 14–24

[3]	Ruchansky N, Seo S, Liu Y. CSI: a hybrid deep model for fake news detection. In: Proceedings of 2017 ACM on Conference on Information and Knowledge Management. 2017, 797–806

[4]	Shu K, Cui L, Wang S, Lee D, Liu H. dEFEND: explainable fake news detection. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019, 395–405

[5]	Bian T, Xiao X, Xu T, Zhao P, Huang W, Rong Y, Huang J. Rumor detection on social media with bi-directional graph convolutional networks. In: Proceedings of the 34th AAAI Conference on Artificial Intelligence. 2020, 549–556

[6]	Wu Z, Pi D, Chen J, Xie M, Cao J . Rumor detection based on propagation graph neural network with attention mechanism. Expert Systems with Applications, 2020, 158: 113595

[7]	Ma J, Gao W, Wong K F. Rumor detection on twitter with tree-structured recursive neural networks. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018, 1980–1989

[8]	Yang X, Lyu Y, Tian T, Liu Y, Liu Y, Zhang X. Rumor detection on social media with graph structured adversarial learning. In: Proceedings of the 29th International Joint Conference on Artificial Intelligence. 2020, 1417–1423

[9]	Nguyen V H, Sugiyama K, Nakov P, Kan M Y. FANG: leveraging social context for fake news detection using graph representation. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 2020, 1165–1174

[10]	Foroozani A, Ebrahimi M . Anomalous information diffusion in social networks: Twitter and Digg. Expert Systems with Applications, 2019, 134: 249–266

[11]	Zubiaga A, Liakata M, Procter R, Hoi G W S, Tolmie P . Analysing how people orient to and spread rumours in social media by looking at conversational threads. PLoS One, 2016, 11( 3): e0150989

[12]	Chen T, Kornblith S, Norouzi M, Hinton G. A simple framework for contrastive learning of visual representations. In: Proceedings of the 37th International Conference on Machine Learning. 2020, 149

[13]	Devlin J, Chang M W, Lee K, Toutanova K. BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2019, 4171–4186

[14]	He K, Fan H, Wu Y, Xie S, Girshick R. Momentum contrast for unsupervised visual representation learning. In: Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020, 9726–9735

[15]	Castillo C, Mendoza M, Poblete B. Information credibility on twitter. In: Proceedings of the 20th International Conference on World Wide Web. 2011, 675–684

[16]	Yang F, Liu Y, Yu X, Yang M. Automatic detection of rumor on Sina Weibo. In: Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics. 2012, 13

[17]	Ma J, Gao W, Wei Z, Lu Y, Wong K F. Detect rumors using time series of social context information on microblogging websites. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. 2015, 1751–1754

[18]	Liu X, Nourbakhsh A, Li Q, Fang R, Shah S. Real-time rumor debunking on twitter. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. 2015, 1867–1870

[19]	Ma J, Gao W, Wong K F. Detect rumors in microblog posts using propagation structure via kernel learning. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 2017, 708–717

[20]	Ma J, Gao W, Mitra P, Kwon S, Jansen B J, Wong K F, Cha M. Detecting rumors from microblogs with recurrent neural networks. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence. 2016, 3818–3824

[21]	Jin Z, Cao J, Guo H, Zhang Y, Luo J. Multimodal fusion with recurrent neural networks for rumor detection on microblogs. In: Proceedings of the 25th ACM international conference on Multimedia. 2017, 795–816

[22]	Chen T, Li X, Yin H, Zhang J. Call attention to rumors: deep attention based recurrent neural networks for early rumor detection. In: Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. 2018, 40–52

[23]	Guo H, Cao J, Zhang Y, Guo J, Li J. Rumor detection with hierarchical social attention network. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 2018, 943–951

[24]	Li Q, Zhang Q, Si L. Rumor detection by exploiting user credibility information, attention and multi-task learning. In: Proceedings of the 57th Conference of the Association for Computational Linguistics. 2019, 1173–1179

[25]	Yu F, Liu Q, Wu S, Wang L, Tan T. A convolutional approach for misinformation identification. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence. 2017, 3901–3907

[26]	Kumar S, Asthana R, Upadhyay S, Upreti N, Akbar M . Fake news detection using deep learning models: a novel approach. Transactions on Emerging Telecommunications Technologies, 2020, 31( 2): e3767

[27]	Liu Y, Wu Y F B. Early detection of fake news on social media through propagation path classification with recurrent and convolutional networks. In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 2018, 354–361

[28]	Rao D, Miao X, Jiang Z, Li R. STANKER: stacking network based on level-grained attention-masked BERT for rumor detection on social media. In: Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. 2021, 3347–3363

[29]	Song Y Z, Chen Y S, Chang Y T, Weng S Y, Shuai H H. Adversary-aware rumor detection. In: Proceedings of the 59th Findings of the Association for Computational Linguistics. 2021, 1371–1382

[30]	Li J, Ni S, Kao H Y. Meet the truth: leverage objective facts and subjective views for interpretable rumor detection. In: Proceedings of the 59th Findings of the Association for Computational Linguistics. 2021, 705–715

[31]	Kipf T N, Welling M. Semi-supervised classification with graph convolutional networks. In: Proceedings of the 5th International Conference on Learning Representations. 2017

[32]	Zhang M, Chen Y. Link prediction based on graph neural networks. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems. 2018, 5171–5181

[33]	Pan S, Wu J, Zhu X, Zhang C, Yu P S . Joint structure feature exploration and regularization for multi-task graph classification. IEEE Transactions on Knowledge and Data Engineering, 2016, 28( 3): 715–728

[34]	Wang D, Cui P, Zhu W. Structural deep network embedding. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016, 1225–1234

[35]	You J, Ying R, Ren X, Hamilton W L, Leskovec J. GraphRNN: generating realistic graphs with deep auto-regressive models. In: Proceedings of the 35th International Conference on Machine Learning. 2018

[36]	Wu Z, Pan S, Chen F, Long G, Zhang C, Yu P S . A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems, 2021, 32( 1): 4–24

[37]	Niepert M, Ahmed M, Kutzkov K. Learning convolutional neural networks for graphs. In: Proceedings of the 33rd International Conference on Machine Learning. 2016, 2014–2023

[38]	Hamilton W L, Ying R, Leskovec J. Inductive representation learning on large graphs. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017, 1025–1035

[39]	Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y. Graph attention networks. In: Proceedings of the 6th International Conference on Learning Representations. 2018

[40]	He X, Deng K, Wang X, Li Y, Zhang Y, Wang M. LightGCN: simplifying and powering graph convolution network for recommendation. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020, 639–648

[41]	Wang X, He X, Cao Y, Liu M, Chua T S. KGAT: knowledge graph attention network for recommendation. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019, 950–958

[42]	Zeng J, Wang X, Liu J, Chen Y, Liang Z, Chua T S, Chua Z L. SHADEWATCHER: recommendation-guided cyber threat analysis using system audit records. In: Proceedings of 2022 IEEE Symposium on Security and Privacy. 2022

[43]	Lin H, Ma J, Cheng M, Yang Z, Chen L, Chen G. Rumor detection on twitter with claim-guided hierarchical graph attention networks. In: Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. 2021, 10035−10047

[44]	van den Oord A, Kalchbrenner N, Vinyals O, Espeholt L, Graves A, Kavukcuoglu K. Conditional image generation with PixelCNN decoders. In: Proceedings of the 30th International Conference on Neural Information Processing Systems. 2016, 4797–4805

[45]	Hjelm R D, Fedorov A, Lavoie-Marchildon S, Grewal K, Bachman P, Trischler A, Bengio Y. Learning deep representations by mutual information estimation and maximization. In: Proceedings of the 7th International Conference on Learning Representations. 2019

[46]	Goodfellow I J, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y. Generative adversarial nets. In: Proceedings of the 27th International Conference on Neural Information Processing Systems. 2014, 2672–2680

[47]	Gutmann M, Hyvärinen A. Noise-contrastive estimation: a new estimation principle for unnormalized statistical models. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics. 2010, 297–304

[48]	Liu X, Zhang F, Hou Z, Mian L, Wang Z, Zhang J, Tang J. Self-supervised learning: generative or contrastive. IEEE Transactions on Knowledge and Data Engineering, 2021, doi: 10.1109/TKDE.2021.3090866

[49]	van den Oord A, Li Y, Vinyals O. Representation learning with contrastive predictive coding. 2019, arXiv preprint arXiv: 1807.03748

[50]	Tschannen M, Djolonga J, Rubenstein P K, Gelly S, Lucic M. On mutual information maximization for representation learning. In: Proceedings of the 8th International Conference on Learning Representations. 2020

[51]	Caron M, Bojanowski P, Joulin A, Douze M. Deep clustering for unsupervised learning of visual features. In: Proceedings of the 15th European Conference on Computer Vision. 2018, 139–156

[52]	Caron M, Misra I, Mairal J, Goyal P, Bojanowski P, Joulin A. Unsupervised learning of visual features by contrasting cluster assignments. In: Proceedings of the 34th International Conference on Neural Information Processing Systems. 2020, 831

[53]	Alwassel H, Mahajan D, Korbar B, Torresani L, Ghanem B, Tran D. Self-supervised learning by cross-modal audio-video clustering. In: Proceedings of the 34th International Conference on Neural Information Processing Systems. 2020, 818

[54]	Veličković P, Fedus W, Hamilton W L, Liò P, Bengio Y, Hjelm R D. Deep graph infomax. In: Proceedings of the 7th International Conference on Learning Representations. 2019

[55]	Kim D, Oh A. How to find your friendly neighborhood: graph attention design with self-supervision. In: Proceedings of the 9th International Conference on Learning Representations. 2021

[56]	Wu J, Wang X, Feng F, He X, Chen L, Lian J, Xie X. Self-supervised graph learning for recommendation. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020, 726–735

[57]	Liu Y, Xu S, Tourassi G D. Detecting rumors through modeling information propagation networks in a social media environment. In: Proceedings of 8th International Conference, SBP 2015. 2015

[58]	Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez A N, Kaiser Ł, Polosukhin I. Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017, 6000–6010

[59]	Bello I, Zoph B, Le Q, Vaswani A, Shlens J. Attention augmented convolutional networks. In: Proceedings of 2019 IEEE/CVF International Conference on Computer Vision. 2019, 3285–3294

[60]	Wang X, Girshick R, Gupta A, He K. Non-local neural networks. In: Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018, 7794–7803

[61]	Kim Y. Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 2014, 1746–1751

[62]	Zhang Z K, Liu C, Zhan X X, Lu X, Zhang C X, Zhang Y C . Dynamics of information diffusion and its applications on complex networks. Physics Reports, 2016, 651: 1–34

[63]	Huang Q, Zhou C, Wu J, Liu L, Wang B. Deep spatial–temporal structure learning for rumor detection on twitter. Neural Computing and Applications, 2020, doi: 10.1007/s00521-020-05236-4

[64]

Wei L, Hu D, Zhou W, Yue Z, Hu S. Towards propagation uncertainty: edge-enhanced Bayesian graph convolutional networks for rumor detection. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. 2021, 3845−3854

[65]	Kingma D P, Ba J. Adam: a method for stochastic optimization. In: Proceedings of the 3rd International Conference on Learning Representations. 2015

[66]	Loshchilov I, Hutter F. SGDR: stochastic gradient descent with warm restarts. In: Proceedings of the 5th International Conference on Learning Representations. 2017

[67]	Mikolov T, Sutskever I, Chen K, Corrado G, Dean J. Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th International Conference on Neural Information Processing Systems. 2013, 3111−3119

[68]	Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics. 2010, 249−256

[69]	Li Q, Han Z, Wu X M. Deeper insights into graph convolutional networks for semi-supervised learning. In: Proceedings of the 32th AAAI Conference on Artificial Intelligence. 2018, 3538−3545

[70]	Wang F, Liu H. Understanding the behaviour of contrastive loss. In: Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021, 2495−2504

[71]	Farinneya P, Pour M M A, Hamidian S, Diab M. Active learning for rumor identification on social media. In: Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021. 2021, 4556−4565

[72]	Xia R, Xuan K, Yu J. A state-independent and time-evolving network for early rumor detection in social media. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. 2020, 9042−9051

[73]	Chami I, Ying R, Re C, Leskovec J. Hyperbolic graph convolutional neural networks. In: Proceedings of the 33rd International Conference on Neural Information Processing Systems. 2019, 438

[74]	Wu Y X, Wang X, Zhang A, He X, Chua T S. Discovering invariant rationales for graph neural networks. In: Proceedings of the ICLR 2022. 2022

[75]	Wang X, Wu Y X, Zhang A, He X, Chua T S. Towards multi-grained explainability for graph neural networks. In: Proceedings of the 34th Annual Conference on Neural Information Processing Systems. 2021

[76]	Li Y, Wang X, Xiao J, Ji W, Chua T S. Invariant grounding for video question answering. 2022, arXiv preprint arXiv: 2206.02349

Acknowledgements

This work was supported by the National Key Research and Development Program of China (2020AAA0106000), the National Natural Science Foundation of China (Grant Nos. U21B2026, 62121002), and the CCCD Key Lab of Ministry of Culture and Tourism.