| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Information Retrieval | SciFact (test) | NDCG@100.906 | 65 | |
| Information Retrieval | SciFact BEIR (test) | nDCG@1076.6 | 22 | |
| Information Retrieval | SciFact | nDCG@1077.77 | 19 | |
| Scientific Fact Verification | SciFact | Macro F183.03 | 16 | |
| Information Retrieval | Scifact | nDCG82 | 15 | |
| Fact-checking | SCIFact | Balanced Acc90.3 | 15 | |
| Logical Retrieval | SciFact BEIR v1 (test) | nDCG@100.64 | 12 | |
| Claim Verification | SCIFACT | Accuracy94.32 | 12 | |
| Retrieval | SciFact-G | R@1035.1 | 10 | |
| Sentence-Level Confidence Prediction | SciFact | AUROC0.544 | 10 | |
| Information Retrieval | SciFact | NDCG@1072.07 | 7 | |
| Scientific Claim Verification | SciFact | Accuracy40.5 | 6 | |
| Adversarial Attack | SciFact | Contriever Score29.08 | 6 | |
| Information Retrieval | SciFact | Accuracy75.14 | 6 | |
| Information Retrieval | SciFact BEIR | nDCG0.708 | 5 | |
| Information Retrieval | SciFact Scientific | MRR98 | 4 | |
| Dense Retrieval | SciFact | Relative Improvement (%)4.8 | 4 | |
| Information Retrieval | SciFact BEIR (test) | Throughput (QPS)2,585 | 4 | |
| Health misinformation detection | SciFact | Macro Precision86.1 | 4 | |
| Information Retrieval | SciFact | QPS952.92 | 4 | |
| RAG (Retrieval-Augmented Generation) | SciFact 5K docs | Context Precision0.3229 | 1 | |
| Retrieval | SciFact | Spearman Correlation0.861 | 1 |