Open Access iconOpen Access

ARTICLE

crossmark

Research on Multimodal AIGC Video Detection for Identifying Fake Videos Generated by Large Models

Yong Liu1,2, Tianning Sun3,*, Daofu Gong1,4, Li Di5, Xu Zhao1

1 College of Cyberspace Security, Information Engineering University, Zhengzhou, 450001, China
2 Henan Key Laboratory of Cyberspace Situation Awareness, Zhengzhou, 450001, China
3 College of Computer Science and Technology, Zhejiang University, Hangzhou, 310027, China
4 Key Laboratory of Cyberspace Security, Ministry of Education, Zhengzhou, 450001, China
5 State Grid Henan Electric Power Company, Zhengzhou, 450040, China

* Corresponding Author: Tianning Sun. Email: email

Computers, Materials & Continua 2025, 85(1), 1161-1184. https://doi.org/10.32604/cmc.2025.062330

Abstract

To address the high-quality forged videos, traditional approaches typically have low recognition accuracy and tend to be easily misclassified. This paper tries to address the challenge of detecting high-quality deepfake videos by promoting the accuracy of Artificial Intelligence Generated Content (AIGC) video authenticity detection with a multimodal information fusion approach. First, a high-quality multimodal video dataset is collected and normalized, including resolution correction and frame rate unification. Next, feature extraction techniques are employed to draw out features from visual, audio, and text modalities. Subsequently, these features are fused into a multilayer perceptron and attention mechanisms-based multimodal feature matrix. Finally, the matrix is fed into a multimodal information fusion layer in order to construct and train a deep learning model. Experimental findings show that the multimodal fusion model achieves an accuracy of 93.8% for the detection of video authenticity, showing significant improvement against other unimodal models, as well as affirming better performance and resistance of the model to AIGC video authenticity detection.

Keywords

Multimodal information fusion; artificial intelligence generated content; authenticity detection; feature extraction; multi-layer perceptron; attention mechanism

Cite This Article

APA Style
Liu, Y., Sun, T., Gong, D., Di, L., Zhao, X. (2025). Research on Multimodal AIGC Video Detection for Identifying Fake Videos Generated by Large Models. Computers, Materials & Continua, 85(1), 1161–1184. https://doi.org/10.32604/cmc.2025.062330
Vancouver Style
Liu Y, Sun T, Gong D, Di L, Zhao X. Research on Multimodal AIGC Video Detection for Identifying Fake Videos Generated by Large Models. Comput Mater Contin. 2025;85(1):1161–1184. https://doi.org/10.32604/cmc.2025.062330
IEEE Style
Y. Liu, T. Sun, D. Gong, L. Di, and X. Zhao, “Research on Multimodal AIGC Video Detection for Identifying Fake Videos Generated by Large Models,” Comput. Mater. Contin., vol. 85, no. 1, pp. 1161–1184, 2025. https://doi.org/10.32604/cmc.2025.062330



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 2657

    View

  • 2082

    Download

  • 0

    Like

Share Link