基于视觉-语言模型的小样本深度伪造人脸检测方法

doi:10.16451/j.cnki.issn1003-6059.202503002

Abstract
Figure/Table
References
Related Citation (1)

Download: PDF (2818 KB) HTML (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract Aiming at the limitations of existing deepfake face detection methods in terms of model complexity, sample size requirements and adaptability to new deepfake techniques, a few-shot deepfake face detection method based on visual-language model(FDFD-VLM) is proposed. FDFD-VLM is built upon contrastive language-image pre-training(CLIP). Visual features are optimized through a face region extraction and high-frequency feature enhancement module. Prompt adaptability is improved by a classless differentiated prompt optimization module, while multimodal feature representation is strengthened by CLIP encoding attention optimization module. Additionally, a triplet loss function is introduced to improve the model discriminative capability. Experimental results demonstrate that FDFD-VLM outperforms existing methods on multiple deepfake face datasets and achieves efficient detection performance in few-shot deepfake face detection scenarios.

Key words： Key Words Deepfake Detection Visual-Language Model Prompt Engineering Few-Shot Detection

Received: 13 December 2024

ZTFLH:

TP391.41

Fund:National Natural Science Foundation of China(No.U2433205), National Natural Science Foundation of China(No.62201576,U1833107), Jiangsu Provincial Basic Research Program Natural Science Foundation-Youth Fund(No.BK20230558)

Corresponding Authors: YANG Hongyu, Ph.D., professor. His research interests include network and system security.

About author:: LI Xinghang, Master student. His research interests include AI security.
CHENG Xiang, Ph.D., lecturer. His research interests include network and system security.
HU Ze, Ph.D., lecturer. His research interests include natural language processing.

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	YANG Hongyu
	LI Xinghang
	CHENG Xiang
	HU Ze

Cite this article:

YANG Hongyu,LI Xinghang,CHENG Xiang等. Few-Shot Deepfake Face Detection Method Based on Vision-Language Model[J]. Pattern Recognition and Artificial Intelligence, 2025, 38(3): 205-220.

URL:

http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202503002 OR http://manu46.magtech.com.cn/Jweb_prai/EN/Y2025/V38/I3/205