基于梯度的对抗排序攻击方法

doi:10.16451/j.cnki.issn1003-6059.202203005

摘要
图/表
参考文献
相关文章 (1)

全文: PDF (706 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要互联网检索中普遍存在排名竞争这种对抗攻击行为,会产生许多不良影响,因此对攻击方法的研究有助于设计更鲁棒的排序模型.已有的攻击方法容易被人识别且无法有效攻击神经排序模型.因此,文中提出基于梯度的对抗排序攻击方法.方法分为3个模块:基于梯度大小的词重要度排序、基于梯度的排序攻击和基于词嵌入的同义词替换.针对给定的目标排序模型,首先基于构建的排序攻击目标进行梯度回传,利用梯度信息在指定文档上找到最重要的词.然后,基于投影梯度攻击原理,在词向量空间上对这些最重要的词进行扰动.最后,利用同义词替换技术将这些最重要的词替换为和原词语义相近且和扰动后的词向量最近邻的词,完成文档扰动.在MQ2007、MS MARCO数据集上的实验验证文中方法的有效性.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	吴晨
	张儒清
	郭嘉丰
	范意兴

关键词 ：排名竞争, 对抗攻击, 梯度攻击, 神经排序模型, 网页检索

Abstract：Ranking competition is prevalent in Web retrieval, and undesirable effects are caused by this adversarial attack behavior. Thus, the study on attack methods is conducive to designing a more robust ranking model.The existing attack methods are recognized by people easily and cannot attack neural ranking models effectively.In this paper, a gradient-based adversarial attack method(GARA) is proposed, including gradient-based word importance ranking, gradient-based adversarial ranking attack and embedding-based word replacement. Given a target ranking model, the backpropagation is firstly conducted based on the constructed ranking-based adversarial attack objective. Then the most important words of a specific document is recognized based on the gradient information. These important words are perturbed in the word embedding space based on the projected gradient descent. Finally, by adopting the counter-fitting technology, the document perturbation is completed by substituting the important word with its synonym which is semantically similar to the original word and nearest to the perturbed word vector.Experiments on MQ2007 and MS MARCO datasets demonstrate the effectiveness of the proposed method.

Key words： Ranking Competition Adversarial Attack Gradient-Based Attack Neural Ranking Model Web Retrieval

收稿日期: 2021-08-16

ZTFLH:

TP 391

基金资助:国家自然科学基金项目(No.62006218,61902381,61773362,61872338)、北京智源人工智能研究院项目(No.BAAI2019ZD0306)、中国科学院青年创新促进会项目(No.20144310,2016102,2021100)、联想-中科院联合实验室青年科学家项目(No.cstc2017jcjyBX0059)资助

通讯作者: 张儒清,博士,助理研究员,主要研究方向为自然语言处理.E-mail: zhangruqing@ict.ac.cn.

作者简介: 吴晨,博士研究生,主要研究方向为信息检索、自然语言处理.E-mail:wuchen17z@ict.ac.cn.
郭嘉丰,博士,研究员,主要研究方向为数据挖掘、信息检索.E-mail:guojiafeng@ict.ac.cn.
范意兴,博士,助理研究员,主要研究方向为数据挖掘、信息检索.E-mail:fanyixing@ict.ac.cn.

引用本文:

吴晨, 张儒清, 郭嘉丰, 范意兴. 基于梯度的对抗排序攻击方法[J]. 模式识别与人工智能, 2022, 35(3): 254-261. WU Chen, ZHANG Ruqing, GUO Jiafeng, FAN Yixing. Gradient-Based Adversarial Ranking Attack. Pattern Recognition and Artificial Intelligence, 2022, 35(3): 254-261.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.202203005 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2022/V35/I3/254