首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于攻击距离的对抗样本攻击组筛选方法
引用本文:刘洪毅,方宇彤,文伟平.一种基于攻击距离的对抗样本攻击组筛选方法[J].北京航空航天大学学报,2022,48(2):339-347.
作者姓名:刘洪毅  方宇彤  文伟平
作者单位:北京大学 软件与微电子学院, 北京 102600
基金项目:国家自然科学基金(U1736218)~~;
摘    要:黑盒对抗样本生成过程中通常会指定1个攻击组,包括1个原始样本和1个目标样本,使得生成的对抗样本与原始样本范数差别不大,但被分类器识别为目标样本的分类。针对攻击组的攻击难度不同导致攻击不稳定的问题,以图像识别领域为例,设计了基于决策边界长度的攻击距离度量方法,为攻击组的攻击难易程度提供了度量方法。在此基础上,设计了基于攻击距离的对抗样本攻击组筛选方法,在攻击开始前就筛去难以攻击的攻击组,从而实现在不修改攻击算法的前提下,提升攻击效果。实验表明:相比于筛选前的攻击组,筛选后的攻击组的总体效果提升了42.07%,攻击效率提升了24.99%,方差降低了76.23%。利用攻击组的对抗样本生成方法在攻击前先进行攻击组筛选,可以稳定并提高攻击效果。 

关 键 词:对抗样本    黑盒    决策边界    筛选    图像识别
收稿时间:2020-09-21

A method for filtering the attack pairs of adversarial examples based on attack distance
LIU Hongyi,FANG Yutong,WEN Weiping.A method for filtering the attack pairs of adversarial examples based on attack distance[J].Journal of Beijing University of Aeronautics and Astronautics,2022,48(2):339-347.
Authors:LIU Hongyi  FANG Yutong  WEN Weiping
Institution:School of Software & Microelectronics, Peking University, Beijing 102600, China
Abstract:During the generation of black-box adversarial examples, an attack pair is usually specified, including a source example and a target example. The purpose is to let the generated adversarial example only have little norm difference from the source example, but it is recognized by the classifier as the classification of the target sample. In order to solve the problem of the instability of adversarial attacks caused by different attack difficulty of attack pairs, taking the image recognition field as an example, this paper presented an attack distance measurement method based on the length of the decision boundary, which provided a measurement method for the attack difficulty of attack pairs. Then this paper designed a filtering method based on attack distance of the attack pairs, which filtered out attack pairs that were difficult to attack before the attack started, so this method can improve the attack performance without modifying the attack algorithm. Experiments show that, compared with the attack pairs before filtering, the filtered attack pairs improve the overall attack performance by 42.07%, improve the attack efficiency by 24.99%, and stabilize the variance by 76.23%. It is recommended that all methods of generating adversarial examples using attack pairs should filter attack pairs before attack to stabilize and improve the attack performance. 
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《北京航空航天大学学报》浏览原始摘要信息
点击此处可从《北京航空航天大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号