| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Downstream retrieval | RAR-B | ARC nDCG@516.2 | 24 | |
| Autoregressive Visual Watermarking | RAR-XL generation | Fidelity Score (Baseline)1 | 10 | |
| Medical Reasoning | RaR Medicine | WR vs Base57.6 | 8 | |
| Medical Question Answering | RaR-Medicine (test) | Length1,395 | 5 | |
| Pairwise Preference Evaluation | RaR Medicine | Pairwise Win Rate60.6 | 4 |