Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yinhao Xiao

VMID: A Multimodal Fusion LLM Framework for Detecting and Identifying Misinformation of Short Videos

Nov 15, 2024

Weihao Zhong, Yinhao Xiao, Minghui Xu, Xiuzhen Cheng

Figure 1 for VMID: A Multimodal Fusion LLM Framework for Detecting and Identifying Misinformation of Short Videos

Figure 2 for VMID: A Multimodal Fusion LLM Framework for Detecting and Identifying Misinformation of Short Videos

Figure 3 for VMID: A Multimodal Fusion LLM Framework for Detecting and Identifying Misinformation of Short Videos

Figure 4 for VMID: A Multimodal Fusion LLM Framework for Detecting and Identifying Misinformation of Short Videos

Abstract:Short video platforms have become important channels for news dissemination, offering a highly engaging and immediate way for users to access current events and share information. However, these platforms have also emerged as significant conduits for the rapid spread of misinformation, as fake news and rumors can leverage the visual appeal and wide reach of short videos to circulate extensively among audiences. Existing fake news detection methods mainly rely on single-modal information, such as text or images, or apply only basic fusion techniques, limiting their ability to handle the complex, multi-layered information inherent in short videos. To address these limitations, this paper presents a novel fake news detection method based on multimodal information, designed to identify misinformation through a multi-level analysis of video content. This approach effectively utilizes different modal representations to generate a unified textual description, which is then fed into a large language model for comprehensive evaluation. The proposed framework successfully integrates multimodal features within videos, significantly enhancing the accuracy and reliability of fake news detection. Experimental results demonstrate that the proposed approach outperforms existing models in terms of accuracy, robustness, and utilization of multimodal information, achieving an accuracy of 90.93%, which is significantly higher than the best baseline model (SV-FEND) at 81.05%. Furthermore, case studies provide additional evidence of the effectiveness of the approach in accurately distinguishing between fake news, debunking content, and real incidents, highlighting its reliability and robustness in real-world applications.

* arXiv admin note: text overlap with arXiv:2211.10973 by other authors

Via

Access Paper or Ask Questions

Multi-View Pre-Trained Model for Code Vulnerability Identification

Aug 10, 2022

Xuxiang Jiang, Yinhao Xiao, Jun Wang, Wei Zhang

Figure 1 for Multi-View Pre-Trained Model for Code Vulnerability Identification

Figure 2 for Multi-View Pre-Trained Model for Code Vulnerability Identification

Figure 3 for Multi-View Pre-Trained Model for Code Vulnerability Identification

Figure 4 for Multi-View Pre-Trained Model for Code Vulnerability Identification

Abstract:Vulnerability identification is crucial for cyber security in the software-related industry. Early identification methods require significant manual efforts in crafting features or annotating vulnerable code. Although the recent pre-trained models alleviate this issue, they overlook the multiple rich structural information contained in the code itself. In this paper, we propose a novel Multi-View Pre-Trained Model (MV-PTM) that encodes both sequential and multi-type structural information of the source code and uses contrastive learning to enhance code representations. The experiments conducted on two public datasets demonstrate the superiority of MV-PTM. In particular, MV-PTM improves GraphCodeBERT by 3.36\% on average in terms of F1 score.

* Accepted By WASA'2022

Via

Access Paper or Ask Questions