| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MRPC | CAME | Avg Accuracy89.9 | 89 | 4d ago | |
| QQP (test) | MoE-DiffuSeq | Accuracy95.3 | 51 | 4d ago | |
| MSRP | Accuracy80.4 | 34 | 4d ago | ||
| MRPC GLUE (val) | SVD-Based Selection | Accuracy85.54 | 27 | 4d ago | |
| MRPC | Calibrated Soft Suffix Learning | Delta Accuracy0 | 24 | 4d ago | |
| PAWS | FLAN-T5 | Accuracy94.6 | 24 | 3d ago | |
| PAWS original (test) | HyperPELT | Accuracy91.79 | 23 | 4d ago | |
| Microsoft Paraphrase Corpus | RoBERTa | Accuracy89.5 | 21 | 4d ago | |
| PAWS-QQP | DeBERTa-ASA | Accuracy96 | 16 | 4d ago | |
| Microsoft Paraphrase Corpus (MRPC) (test) | Accuracy77.4 | 15 | 3d ago | ||
| MRPC | Accuracy90.43 | 14 | 3d ago | ||
| QQP source: RTE (test) | BERT | Accuracy71.5 | 12 | 4d ago | |
| MRPC | EvalRank | Spearman Correlation (x100)30.87 | 12 | 4d ago | |
| QQP | Deep-Ens-LoRA | F1 Score89 | 10 | 4d ago | |
| QQP | Se² | Accuracy79.2 | 9 | 3d ago | |
| IndicXPara | MuRIL | Accuracy60.8 | 9 | 3d ago | |
| PAWS Wiki | EBT-GRC | Accuracy47.5 | 8 | 4d ago | |
| QQP IID | CRvNN | Accuracy84.8 | 8 | 4d ago | |
| QQP | PERFECT | Average Accuracy71.2 | 8 | 4d ago | |
| Twitter Out-of-Domain (test) | BERT-BASE+AA-GAN | Accuracy88.34 | 8 | 4d ago | |
| QQP In-Domain (test) | ROBERTA-BASE+AA-GAN | Accuracy91.66 | 8 | 4d ago | |
| MRPC (val) | F1 Score93.8 | 6 | 4d ago | ||
| MRPC (dev) | EFL | F1 Score91 | 6 | 3d ago | |
| QQP (dev) | Megatron-3.9B | Accuracy92.7 | 6 | 3d ago | |
| QQP | R-Kalman | Total Running Time (s)8,279 | 5 | 4d ago |