模式识别与人工智能
Sunday, Jul. 27, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence  2025, Vol. 38 Issue (3): 205-220    DOI: 10.16451/j.cnki.issn1003-6059.202503002
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Few-Shot Deepfake Face Detection Method Based on Vision-Language Model
YANG Hongyu1,2, LI Xinghang1, CHENG Xiang3, HU Ze1
1. School of Safety Science and Engineering, Civil Aviation University of China, Tianjin 300300;
2. College of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300;
3. College of Information Engineering, Yangzhou University, Yangzhou 225127

Download: PDF (2818 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Aiming at the limitations of existing deepfake face detection methods in terms of model complexity, sample size requirements and adaptability to new deepfake techniques, a few-shot deepfake face detection method based on visual-language model(FDFD-VLM) is proposed. FDFD-VLM is built upon contrastive language-image pre-training(CLIP). Visual features are optimized through a face region extraction and high-frequency feature enhancement module. Prompt adaptability is improved by a classless differentiated prompt optimization module, while multimodal feature representation is strengthened by CLIP encoding attention optimization module. Additionally, a triplet loss function is introduced to improve the model discriminative capability. Experimental results demonstrate that FDFD-VLM outperforms existing methods on multiple deepfake face datasets and achieves efficient detection performance in few-shot deepfake face detection scenarios.
Key wordsKey Words Deepfake Detection      Visual-Language Model      Prompt Engineering      Few-Shot Detection     
Received: 13 December 2024     
ZTFLH: TP391.41  
Fund:National Natural Science Foundation of China(No.U2433205), National Natural Science Foundation of China(No.62201576,U1833107), Jiangsu Provincial Basic Research Program Natural Science Foundation-Youth Fund(No.BK20230558)
Corresponding Authors: YANG Hongyu, Ph.D., professor. His research interests include network and system security.   
About author:: LI Xinghang, Master student. His research interests include AI security.
CHENG Xiang, Ph.D., lecturer. His research interests include network and system security.
HU Ze, Ph.D., lecturer. His research interests include natural language processing.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
YANG Hongyu
LI Xinghang
CHENG Xiang
HU Ze
Cite this article:   
YANG Hongyu,LI Xinghang,CHENG Xiang等. Few-Shot Deepfake Face Detection Method Based on Vision-Language Model[J]. Pattern Recognition and Artificial Intelligence, 2025, 38(3): 205-220.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202503002      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2025/V38/I3/205
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn