Exploiting user comments for early detection of fake news prior to users’ commenting

Qiong NAN , Qiang SHENG , Juan CAO , Yongchun ZHU , Danding WANG , Guang YANG , Jintao LI

Front. Comput. Sci. ›› 2025, Vol. 19 ›› Issue (10) : 1910354

PDF (2155KB)
Front. Comput. Sci. ›› 2025, Vol. 19 ›› Issue (10) : 1910354 DOI: 10.1007/s11704-024-40674-6
Artificial Intelligence
RESEARCH ARTICLE

Exploiting user comments for early detection of fake news prior to users’ commenting

Author information +
History +
PDF (2155KB)

Abstract

Both accuracy and timeliness are key factors in detecting fake news on social media. However, most existing methods encounter an accuracy-timeliness dilemma: Content-only methods guarantee timeliness but perform moderately because of limited available information, while social context-based ones generally perform better but inevitably lead to latency because of social context accumulation needs. To break such a dilemma, a feasible but not well-studied solution is to leverage social contexts (e.g., comments) from historical news for training a detection model and apply it to newly emerging news without social contexts. This requires the model to (1) sufficiently learn helpful knowledge from social contexts, and (2) be well compatible with situations that social contexts are available or not. To achieve this goal, we propose to absorb and parameterize useful knowledge from comments in historical news and then inject it into a content-only detection model. Specifically, we design the Comments ASsisted FakENews Detection method (CAS-FEND), which transfers useful knowledge from a comment-aware teacher model to a content-only student model and detects newly emerging news with the student model. Experiments show that the CAS-FEND student model outperforms all content-only methods and even comment-aware ones with 1/4 comments as inputs, demonstrating its superiority for early detection.

Graphical abstract

Keywords

fake news detection / knowledge distillation / early detection

Cite this article

Download citation ▾
Qiong NAN, Qiang SHENG, Juan CAO, Yongchun ZHU, Danding WANG, Guang YANG, Jintao LI. Exploiting user comments for early detection of fake news prior to users’ commenting. Front. Comput. Sci., 2025, 19(10): 1910354 DOI:10.1007/s11704-024-40674-6

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

BBC. Mast fire probe amid 5G coronavirus claims, 2020

[2]

Fang L, Feng K, Zhao K, Hu A, Li T . Unsupervised rumor detection based on propagation tree VAE. IEEE Transactions on Knowledge and Data Engineering, 2023, 35( 10): 10309–10323

[3]

Shu K, Cui L, Wang S, Lee D, Liu H. dEFEND: explainable fake news detection. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019, 395−405

[4]

Zhang X, Cao J, Li X, Sheng Q, Zhong L, Shu K. Mining dual emotion for fake news detection. In: Proceedings of the Web Conference 2021. 2021, 3465−3476

[5]

Castillo C, Mendoza M, Poblete B. Information credibility on twitter. In: Proceedings of the 20th International Conference on World Wide Web. 2011, 675−684

[6]

Shu K, Wang S, Liu H. Beyond news contents: the role of social context for fake news detection. In: Proceedings of the 12th ACM International Conference on Web Search and Data Mining. 2019, 312−320

[7]

Shu K, Wang S, Liu H. Understanding user profiles on social media for fake news detection. In: Proceedings of 2018 IEEE Conference on Multimedia Information Processing and Retrieval. 2018, 430−435

[8]

Potthast M, Kiesel J, Reinartz K, Bevendorff J, Stein B. A stylometric inquiry into hyperpartisan and fake news. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018, 231−240

[9]

Przybyla P. Capturing the style of fake news. In: Proceedings of the 34th AAAI Conference on Artificial Intelligence. 2020, 490−497

[10]

Wang Y, Ma F, Jin Z, Yuan Y, Xun G, Jha K, Su L, Gao J. EANN: event adversarial neural networks for multi-modal fake news detection. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2018, 849−857

[11]

Kaliyar R K, Goswami A, Narang P . FakeBERT: fake news detection in social media with a BERT-based deep learning approach. Multimedia Tools and Applications, 2021, 80( 8): 11765–11788

[12]

Shu K, Zheng G, Li Y, Mukherjee S, Awadallah A H, Ruston S, Liu H. Early detection of fake news with multi-source weak social supervision. In: Proceedings of European Conference on Machine Learning and Knowledge Discovery in Databases. 2020, 650−666

[13]

Shu K, Sliva A, Wang S, Tang J, Liu H . Fake news detection on social media: a data mining perspective. ACM SIGKDD Explorations Newsletter, 2017, 19( 1): 22–36

[14]

Guo B, Ding Y, Sun Y, Ma S, Li K, Yu Z . The mass, fake news, and cognition security. Frontiers of Computer Science, 2021, 15( 3): 153806

[15]

Ma J, Gao W, Mitra P, Kwon S, Jansen B J, Wong K F, Cha M. Detecting rumors from microblogs with recurrent neural networks. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence. 2016, 3818−3824

[16]

Hussain M, Cheng C, Xu R, Afzal M . CNN-Fusion: an effective and lightweight phishing detection method based on multi-variant ConvNet. Information Sciences, 2023, 631: 328–345

[17]

Silva A, Luo L, Karunasekera S, Leckie C. Embracing domain differences in fake news: cross-domain fake news detection using multi-modal data. In: Proceedings of the 35th AAAI conference on artificial intelligence. 2021, 557−565

[18]

Nan Q, Wang D, Zhu Y, Sheng Q, Shi Y, Cao J, Li J. Improving fake news detection of influential domain via domain- and instance-level transfer. In: Proceedings of the 29th International Conference on Computational Linguistics. 2022, 2834−2848

[19]

Huang Y, Gao M, Wang J, Yin J, Shu K, Fan Q, Wen J . Meta-prompt based learning for low-resource false information detection. Information Processing & Management, 2023, 60( 3): 103279

[20]

Lin H, Ma J, Chen L, Yang Z, Cheng M, Guang C. Detect rumors in Microblog posts for low-resource domains via adversarial contrastive learning. In: Proceedings of Findings of the Association for Computational Linguistics: NAACL 2022. 2022, 2543−2556

[21]

Zhu Y, Sheng Q, Cao J, Li S, Wang D, Zhuang F. Generalizing to the future: mitigating entity bias in fake news detection. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2022, 2120−2125

[22]

Sheng Q, Cao J, Zhang X, Li R, Wang D, Zhu Y. Zoom out and observe: news environment perception for fake news detection. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022, 4543−4556

[23]

Sheng Q, Zhang X, Cao J, Zhong L. Integrating pattern- and fact-based fake news detection via model preference learning. In: Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 2021, 1640−1650

[24]

Qi P, Cao J, Li X, Liu H, Sheng Q, Mi X, He Q, Lv Y, Guo C, Yu Y. Improving fake news detection by using an entity-enhanced framework to fuse diverse multimodal clues. In: Proceedings of the 29th ACM International Conference on Multimedia. 2021, 1212−1220

[25]

Wu L, Liu P, Zhang Y. See how you read? Multi-reading habits fusion reasoning for multi-modal fake news detection. In: Proceedings of the 37th AAAI Conference on Artificial Intelligence. 2023, 13736−13744

[26]

Sun M, Zhang X, Ma J, Xie S, Liu Y, Yu P S . Inconsistent matters: a knowledge-guided dual-consistency network for multi-modal rumor detection. IEEE Transactions on Knowledge and Data Engineering, 2023, 35( 12): 12736–12749

[27]

Hu L, Chen Z, Zhao Z, Yin J, Nie L . Causal inference for leveraging image-text matching bias in multi-modal fake news detection. IEEE Transactions on Knowledge and Data Engineering, 2023, 35( 11): 11141–11152

[28]

Hu L, Zhao Z, Qi W, Song X, Nie L . Multimodal matching-aware co-attention networks with mutual knowledge distillation for fake news detection. Information Sciences, 2024, 664: 120310

[29]

Bu Y, Sheng Q, Cao J, Qi P, Wang D, Li J. Combating online misinformation videos: characterization, detection, and future directions. In: Proceedings of the 31st ACM International Conference on Multimedia. 2023, 8770−8780

[30]

Liu Y, Wu Y F B . FNED: a deep network for fake news early detection on social media. ACM Transactions on Information Systems (TOIS), 2020, 38( 3): 25

[31]

Nguyen V H, Sugiyama K, Nakov P, Kan M Y. FANG: leveraging social context for fake news detection using graph representation. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 2020, 1165−1174

[32]

Lu Y J, Li C T. GCAN: graph-aware co-attention networks for explainable fake news detection on social media. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 505−514

[33]

Wang H, Tang P, Kong H, Jin Y, Wu C, Zhou L . DHCF: dual disentangled-view hierarchical contrastive learning for fake news detection on social media. Information Sciences, 2023, 645: 119323

[34]

Gao Y, Wang X, He X, Feng H, Zhang Y . Rumor detection with self-supervised learning on texts and social graph. Frontiers of Computer Science, 2023, 17( 4): 174611

[35]

Nan Q, Sheng Q, Cao J, Hu B, Wang D, Li J. Let silence speak: enhancing fake news detection with generated comments from large language models. In: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management. 2024, 1732−1742

[36]

Wan H, Feng S, Tan Z, Wang H, Tsvetkov Y, Luo M. DELL: generating reactions and explanations for LLM-based misinformation detection. In: Proceedings of Findings of the Association for Computational Linguistics: ACL 2024. 2024, 2637−2667

[37]

Wei J, Liao X, Zheng H, Chen G, Cheng X . Learning from context: a mutual reinforcement model for Chinese microblog opinion retrieval. Frontiers of Computer Science, 2018, 12( 4): 714–724

[38]

Huang Y, Yu Z, Xiang Y, Yu Z, Guo J . Exploiting comments information to improve legal public opinion news abstractive summarization. Frontiers of Computer Science, 2022, 16( 6): 166333

[39]

Wang T, Chen Y, Wang Y, Wang B, Wang G, Li X, Zheng H, Zhao B Y . The power of comments: fostering social interactions in Microblog networks. Frontiers of Computer Science, 2016, 10( 5): 889–907

[40]

Li S, Zheng Y, Shi Y, Huang S, Chen S . KD-Crowd: a knowledge distillation framework for learning from crowds. Frontiers of Computer Science, 2025, 19( 1): 191302

[41]

Ji Z, Ni J, Liu X, Pang Y . Teachers cooperation: team-knowledge distillation for multiple cross-domain few-shot learning. Frontiers of Computer Science, 2023, 17( 2): 172312

[42]

Gou J, Yu B, Maybank S J, Tao D . Knowledge distillation: a survey. International Journal of Computer Vision, 2021, 129( 6): 1789–1819

[43]

Hinton G, Vinyals O, Dean J. Distilling the knowledge in a neural network. 2015, arXiv preprint arXiv: 1503.02531

[44]

Zhou H, Song L, Chen J, Zhou Y, Wang G, Yuan J, Zhang Q. Rethinking soft labels for knowledge distillation: a bias-variance tradeoff perspective. In: Proceedings of the 9th International Conference on Learning Representations. 2021

[45]

Romero A, Ballas N, Kahou S E, Chassang A, Gatta C, Bengio Y. FitNets: hints for thin deep nets. In: Proceedings of the 3rd International Conference on Learning Representations. 2015

[46]

Zagoruyko S, Komodakis N. Paying more attention to attention: improving the performance of convolutional neural networks via attention transfer. In: Proceedings of the 5th International Conference on Learning Representations. 2017

[47]

Zhang D, Zhou Y, Zhao J, Yang Z, Dong H, Yao R, Ma H . Multi-granularity semantic alignment distillation learning for remote sensing image semantic segmentation. Frontiers of Computer Science, 2022, 16( 4): 164351

[48]

Chen Y, Wang N, Zhang Z. DarkRank: accelerating deep metric learning via cross sample similarities transfer. In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence, AAAI’18. 2018, 2852−2859

[49]

Tian Y, Krishnan D, Isola P. Contrastive representation distillation. In: Proceedings of the 8th International Conference on Learning Representations. 2020

[50]

Passalis N, Tzelepi M, Tefas A. Heterogeneous knowledge distillation using information flow modeling. In: Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020, 2336−2345

[51]

Jang Y, Lee H, Hwang S J, Shin J. Learning what and where to transfer. In: Proceedings of the 36th International Conference on Machine Learning. 2019, 3030−3039

[52]

Li K, Guo B, Liu J, Wang J, Ren H, Yi F, Yu Z . Dynamic probabilistic graphical model for progressive fake news detection on social media platform. ACM Transactions on Intelligent Systems and Technology (TIST), 2022, 13( 5): 86

[53]

Lu J, Yang J, Batra D, Parikh D. Hierarchical question-image co-attention for visual question answering. In: Proceedings of the 30th International Conference on Neural Information Processing Systems. 2016, 289−297

[54]

Bird S, Klein E, Loper E. Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit. Beijing: O’Reilly Media, Inc., 2009

[55]

Mohammad S. Word affect intensities. In: Proceedings of the 11th International Conference on Language Resources and Evaluation, 2018

[56]

Dong Z, Dong Q . HowNet-a hybrid language and knowledge resource. In: Proceedings of International Conference on Natural Language Processing and Knowledge Engineering, 2003, 2003: 820–824

[57]

Nan Q, Cao J, Zhu Y, Wang Y, Li J. MDFEND: multi-domain fake news detection. In: Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 2021, 3343−3347

[58]

Shu K, Mahudeswaran D, Wang S, Lee D, Liu H . FakeNewsNet: a data repository with news content, social context, and spatiotemporal information for studying fake news on social media. Big Data, 2020, 8( 3): 171–188

[59]

Devlin J, Chang M W, Lee K, Toutanova K. BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2019, 4171−4186

[60]

Wang B, Ma J, Lin H, Yang Z, Yang R, Tian Y, Chang Y. Explainable fake news detection with large language model via defense among competing wisdom. In: Proceedings of the ACM Web Conference 2024. 2024, 2452−2463

[61]

Xiao M, Mayer J. The challenges of machine learning for trust and safety: a case study on misinformation detection. 2023, arXiv preprint arXiv: 2308.12215

[62]

McClish D K . Analyzing a portion of the roc curve. Medical Decision Making, 1989, 9( 3): 190–195

[63]

Zhu Y, Sheng Q, Cao J, Nan Q, Shu K, Wu M, Wang J, Zhuang F . Memory-guided multi-view multi-domain fake news detection. IEEE Transactions on Knowledge and Data Engineering, 2023, 35( 7): 7178–7191

[64]

Mu Y, Bontcheva K, Aletras N. It’s about time: rethinking evaluation on rumor detection benchmarks using chronological splits. In: Proceedings of Findings of the Association for Computational Linguistics: EACL 2023. 2023, 736−743

[65]

Hu B, Sheng Q, Cao J, Zhu Y, Wang D, Wang Z, Jin Z. Learn over past, evolve for future: forecasting temporal trends for fake news detection. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics. 2023, 116−125

[66]

Hu B, Sheng Q, Cao J, Shi Y, Li Y, Wang D, Qi P. Bad actor, good advisor: exploring the role of large language models in fake news detection. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 22105−22113

[67]

Liu A, Sheng Q, Hu X. Preventing and detecting misinformation generated by large language models. In: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2024, 3001−3004

RIGHTS & PERMISSIONS

Higher Education Press

AI Summary AI Mindmap
PDF (2155KB)

1290

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/