中文版 | English
题名

Effective and Imperceptible Adversarial Textual Attack Via Multi-objectivization

作者
通讯作者Tang, Ke
发表日期
2024-07-23
DOI
发表期刊
ISSN
2688-299X
EISSN
2688-3007
卷号4
摘要
The field of adversarial textual attack has significantly grown over the past few years, where the commonly considered objective is to craft adversarial examples (AEs) that can successfully fool the target model. However, the imperceptibility of attacks, which is also essential for practical attackers, is often left out by previous studies. In consequence, the crafted AEs tend to have obvious structural and semantic differences from the original human-written text, making them easily perceptible. In this work, we advocate leveraging multi-objectivization to address such an issue. Specifically, we reformulate the problem of crafting AEs as a multi-objective optimization problem, where the attack imperceptibility is considered as an auxiliary objective. Then, we propose a simple yet effective evolutionary algorithm, dubbed HydraText, to solve this problem. HydraText can be effectively applied to both score-based and decision-based attack settings. Exhaustive experiments involving 44,237 instances demonstrate that HydraText consistently achieves competitive attack success rates and better attack imperceptibility than the recently proposed attack approaches. A human evaluation study also shows that the AEs crafted by HydraText are more indistinguishable from human-written text. Finally, these AEs exhibit good transferability and can bring notable robustness improvement to the target model by adversarial training.
© 2024 Copyright held by the owner/author(s).
收录类别
语种
英语
学校署名
第一 ; 通讯
资助项目
This work was supported in part by the National Key Research and Development Program of China under Grant No. 2022YFA1004102, and in part by Guangdong Major Project of Basic and Applied Basic Research (Grant No. 2023B0303000010).
出版者
EI入藏号
20243216807059
EI主题词
Semantics
EI分类号
Optimization Techniques:921.5
来源库
EV Compendex
引用统计
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/807081
专题工学院
南方科技大学
工学院_计算机科学与工程系
作者单位
1.Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, College of Engineering, South Tower, 1088 Xueyuan Avenue, Guangdong, Shenzhen; 518055, China
2.Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong; 999077, Hong Kong
3.National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, 10F, Zhizhen Building, Canghai Campus, 3688 Nanhai Avenue, Guangdong, Shenzhen, China
4.National Key Laboratory for Novel Software Technology, School of Artificial Intelligence, Nanjing University, Nanjing, International College Area, Xianlin Campus, 163 Xianlin Avenue, Jiangsu, Nanjing, China
第一作者单位工学院;  计算机科学与工程系
通讯作者单位工学院;  计算机科学与工程系
第一作者的第一单位工学院;  计算机科学与工程系
推荐引用方式
GB/T 7714
Liu, Shengcai,Lu, Ning,Hong, Wenjing,et al. Effective and Imperceptible Adversarial Textual Attack Via Multi-objectivization[J]. ACM Transactions on Evolutionary Learning and Optimization,2024,4.
APA
Liu, Shengcai,Lu, Ning,Hong, Wenjing,Qian, Chao,&Tang, Ke.(2024).Effective and Imperceptible Adversarial Textual Attack Via Multi-objectivization.ACM Transactions on Evolutionary Learning and Optimization,4.
MLA
Liu, Shengcai,et al."Effective and Imperceptible Adversarial Textual Attack Via Multi-objectivization".ACM Transactions on Evolutionary Learning and Optimization 4(2024).
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Liu, Shengcai]的文章
[Lu, Ning]的文章
[Hong, Wenjing]的文章
百度学术
百度学术中相似的文章
[Liu, Shengcai]的文章
[Lu, Ning]的文章
[Hong, Wenjing]的文章
必应学术
必应学术中相似的文章
[Liu, Shengcai]的文章
[Lu, Ning]的文章
[Hong, Wenjing]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。