Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BiRD: A Bidirectional Ranking Defense Mechanism for Retrieval Augmented Generation

About

The growing adoption of Retrieval-Augmented Generation (RAG) has led to a rise in adversarial attacks. Existing defenses, relying on semantic analysis or voting, face a trade-off between high computational cost and limited robustness under strong poisoning attacks. Their fundamental limitation is the exclusive focus on semantic content relevance, while neglecting the retrieval context that is critically defined by ranking structures. To this end, we investigate the bidirectional ranking behavior of poisoned and benign documents, and discover a key discriminative pattern: poisoned documents exhibit significantly stronger alignment between their backward rankings and the query's forward ranking. Capitalizing on this, we propose BiRD, a bidirectional ranking defense mechanism built upon a dual-signal framework that leverages forward ranking to assess semantic content relevance and backward ranking to quantify ranking context consistency. This design directly addresses the fundamental limitation of prior approaches, enabling simultaneous efficiency and robustness. Extensive evaluation across 3 datasets with 3 retrievers and 3 LLMs under 2 attack scenarios validates BiRD's effectiveness. Notably, BiRD reduces the attack success rate of PoisonedRAG by up to 54% while simultaneously improving task accuracy by up to 56%, with average additional latency under 1 second.

Chengcai Gao, Zhihong Sun, Xiaochuan Shi, Qiufeng Wang, Chao Liang• 2026

Related benchmarks

TaskDatasetResultRank
Question AnsweringHotpotQA PIA (test)
ASR50
62
Question AnsweringHotpotQA PoisonedRAG (test)
Abstention Rate (ASR)17
45
Retrieval-Augmented GenerationMS Marco
Accuracy (Clean)83
45
Question AnsweringHotpotQA Clean (test)
Accuracy78
45
Retrieval-Augmented Question AnsweringNQ
Clean Accuracy79
45
Showing 5 of 5 rows

Other info

Follow for update