基于多特征图与GCN-GAT的谣言检测模型
Rumor Detection Model Based on Multi-Feature Graphs and GCN-GAT
DOI: 10.12677/csa.2025.1510259, PDF,    科研立项经费支持
作者: 黄 媛, 苏庆鸥, 苏敬科, 刘柏霆*:广西民族师范学院数学与计算机科学学院,广西 崇左
关键词: 谣言检测多特征图RoBERTa图卷积网络图注意力网络Rumor Detection Multi-Feature Graph RoBERTa Graph Convolutional Network Graph Attention Network
摘要: 社交媒体中的谣言文本通常是非正式且语法不连贯的,这为准确提取语义信息带来了巨大挑战。为解决这一问题,本文提出了RoBERTa-MGAT模型,一种融合多特征图构建、并行图神经网络编码器与基于注意力的融合机制的谣言检测模型。具体而言,该模型利用RoBERTa和Word2Vec生成丰富的词向量表示,并通过构建三种异构图(词性图、词共现图和语义依存图)从不同角度捕捉多样化的语言特征。针对每个图结构,本文并行采用图卷积网络(Graph Convolutional Network, GCN)和图注意力网络(Graph Attention Network, GAT)来联合学习互补的结构化表征,GCN专注于捕获局部平滑特征,而GAT则用于建模特征异构性与重要性。最后通过自注意力机制聚合所有图的输出表示,使模型能够有效整合多视角特征。在Weibo20和Weibo21两个公开谣言检测数据集上的实验结果表明,RoBERTa-MGAT在准确率和F1分数上均优于同类型模型,展现出卓越的性能。
Abstract: Rumor texts on social networks are often informal and grammatically incoherent, posing significant challenges for extracting accurate semantic information. To address this issue, we propose RoBERTa-MGAT, a rumor detection model that integrates multi-feature graph construction with parallel graph neural encoders and an attention-based fusion mechanism. Specifically, the model leverages RoBERTa and Word2Vec to generate rich word embeddings and constructs three heterogeneous graphs—a part-of-speech graph, a word co-occurrence graph, and a semantic dependency graph—to capture diverse linguistic features from different perspectives. For each graph, Graph Convolutional Network (GCN) and Graph Attention Network (GAT) are applied in parallel to jointly learn complementary structural representations, with GCN capturing local smoothness and GAT modeling feature heterogeneity and importance. The outputs from all graphs are then aggregated using a self-attention mechanism, allowing the model to effectively integrate multi-view features. Experimental results on two public rumor detection datasets, Weibo20 and Weibo21, demonstrate that RoBERTa-MGAT achieves superior performance, outperforming existing state-of-the-art methods in both accuracy and F1-score.
文章引用:黄媛, 苏庆鸥, 苏敬科, 刘柏霆. 基于多特征图与GCN-GAT的谣言检测模型[J]. 计算机科学与应用, 2025, 15(10): 176-188. https://doi.org/10.12677/csa.2025.1510259

参考文献

[1] Devlin, J., Chang, M.W., Lee, K. and Toutanova, K. (2018) BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. arXiv: 1810.04805.
[2] Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L. and Stoyanov, V. (2019) RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv: 1907.11692.
[3] Yao, L., Mao, C. and Luo, Y. (2019) Graph Convolutional Networks for Text Classification. Proceedings of the AAAI Conference on Artificial Intelligence, 33, 7370-7377. [Google Scholar] [CrossRef
[4] Lin, Y., Meng, Y., Sun, X., Han, Q., Kuang, K., Li, J. and Wu, F. (2021) BERTGCN: Transductive Text Classification by Combining GCN and BERT. arXiv: 2105.05727.
[5] Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P. and Bengio, Y. (2017) Graph Attention Networks. arXiv: 1710.10903.
[6] 戚力鑫, 万书振, 唐斌, 徐义春. 基于注意力机制的多模态融合谣言检测方法[J]. 计算机工程与应用, 2022, 58(19): 209-217.
[7] 米源, 唐恒亮. 基于图卷积网络的谣言鉴别研究[J]. 计算机工程与应用, 2021, 57(13): 161-167.
[8] 强子珊, 顾益军. 融合动态传播和社区结构的社交媒体谣言检测模型[J]. 计算机工程与应用, 2024, 60(18): 198-207.
[9] Bian, T., Xiao, X., Xu, T., Zhao, P., Huang, W., Rong, Y., et al. (2020) Rumor Detection on Social Media with Bi-Directional Graph Convolutional Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 34, 549-556. [Google Scholar] [CrossRef
[10] 王昕岩, 宋玉蓉, 宋波. 一种加权图卷积神经网络的新浪微博谣言检测方法[J]. 小型微型计算机系统, 2021, 42(8): 1780-1786.
[11] Sun, T., Qian, Z., Dong, S., Li, P. and Zhu, Q. (2022) Rumor Detection on Social Media with Graph Adversarial Contrastive Learning. Proceedings of the ACM Web Conference 2022, Lyon, 25-29 April 2022, 2789-2797. [Google Scholar] [CrossRef
[12] Choi, J., Ko, T., Choi, Y., Byun, H. and Kim, C. (2021) Dynamic Graph Convolutional Networks with Attention Mechanism for Rumor Detection on Social Media. PLOS ONE, 16, e0256039. [Google Scholar] [CrossRef] [PubMed]
[13] Ding, X., Teng, C. and Ji, D. (2023) Fake News Detection with Context Awareness of the Publisher. International Conferences on Software Engineering and Knowledge Engineering, South San Francisco, 1-10 July 2023, 548-553. [Google Scholar] [CrossRef
[14] Thirumoorthy, K., Haripriya, R., Shreenee, N., et al. (2024) Rumor Detection Using BERT-Based Social Circle and Interaction Network Model. Social Network Analysis and Mining, 14, Article No. 195. [Google Scholar] [CrossRef
[15] Che, W., Feng, Y., Qin, L. and Liu, T. (2021) N-LTP: An Open-Source Neural Language Technology Platform for Chinese. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Online, 7-11 November 2021, 42-49. [Google Scholar] [CrossRef
[16] Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., et al. (2020) Transformers: State-of-the-Art Natural Language Processing. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Online, 16-20 November 2020, 38-45. [Google Scholar] [CrossRef
[17] Mikolov, T., Chen, K., Corrado, G. and Dean, J. (2013) Efficient Estimation of Word Representations in Vector Space. arXiv: 1301.3781.
[18] Rao, D., Miao, X., Jiang, Z. and Li, R. (2021) Stanker: Stacking Network Based on Level-Grained Attention-Masked BERT for Rumor Detection on Social Media. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Punta Cana, 7-11 November 2021, 3347-3363. [Google Scholar] [CrossRef
[19] Nan, Q., Cao, J., Zhu, Y., Wang, Y. and Li, J. (2021) MDFEND: Multi-Domain Fake News Detection. Proceedings of the 30th ACM International Conference on Information & Knowledge Management, Queensland, 1-5 November 2021, 3343-3347. [Google Scholar] [CrossRef
[20] Misra, D. (2019) Mish: A Self Regularized Non-Monotonic Activation Function. arXiv: 1908.08681.