BiRD: A Bidirectional Ranking Defense Mechanism for Retrieval Augmented Generation

About

The growing adoption of Retrieval-Augmented Generation (RAG) has led to a rise in adversarial attacks. Existing defenses, relying on semantic analysis or voting, face a trade-off between high computational cost and limited robustness under strong poisoning attacks. Their fundamental limitation is the exclusive focus on semantic content relevance, while neglecting the retrieval context that is critically defined by ranking structures. To this end, we investigate the bidirectional ranking behavior of poisoned and benign documents, and discover a key discriminative pattern: poisoned documents exhibit significantly stronger alignment between their backward rankings and the query's forward ranking. Capitalizing on this, we propose BiRD, a bidirectional ranking defense mechanism built upon a dual-signal framework that leverages forward ranking to assess semantic content relevance and backward ranking to quantify ranking context consistency. This design directly addresses the fundamental limitation of prior approaches, enabling simultaneous efficiency and robustness. Extensive evaluation across 3 datasets with 3 retrievers and 3 LLMs under 2 attack scenarios validates BiRD's effectiveness. Notably, BiRD reduces the attack success rate of PoisonedRAG by up to 54% while simultaneously improving task accuracy by up to 56%, with average additional latency under 1 second.

Chengcai Gao, Zhihong Sun, Xiaochuan Shi, Qiufeng Wang, Chao Liang• 2026

Related benchmarks

Task	Dataset	Result
Question Answering	HotpotQA PIA (test)	ASR50	62
Question Answering	HotpotQA PoisonedRAG (test)	Abstention Rate (ASR)17	45
Retrieval-Augmented Generation	MS Marco	Accuracy (Clean)83	45
Question Answering	HotpotQA Clean (test)	Accuracy78	45
Retrieval-Augmented Question Answering	NQ	Clean Accuracy79	45

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord