Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words

About

Existing hard-label text attacks often rely on inefficient "outside-in" strategies that traverse vast search spaces. We propose PivotAttack, a query-efficient "inside-out" framework. It employs a Multi-Armed Bandit algorithm to identify Pivot Sets-combinatorial token groups acting as prediction anchors-and strategically perturbs them to induce label flips. This approach captures inter-word dependencies and minimizes query costs. Extensive experiments across traditional models and Large Language Models demonstrate that PivotAttack consistently outperforms state-of-the-art baselines in both Attack Success Rate and query efficiency.

Yuzhi Liang, Shiliang Xiao, Jingsong Wei, Qiliang Lin, Xia Li• 2026

Related benchmarks

TaskDatasetResultRank
Adversarial AttackYelp
ASR39.8
49
Adversarial AttackYahoo
ASR93.5
22
Adversarial AttackMR
ASR62.2
22
Adversarial AttackAMAZON
ASR34.9
22
Adversarial AttackSST-2
Attack Success Rate (ASR)54.6
22
Textual EntailmentSNLI
ASR25.8
8
Textual EntailmentMNLI-m
ASR47.7
8
Textual EntailmentMNLI mm
ASR54.8
8
Showing 8 of 8 rows

Other info

Follow for update