| nuScenes (val) | BIRANet | mAP72.3 | | 23 | 1mo ago |
| TweetEval hate | SC | Macro F167.53 | | 21 | 1mo ago |
| KUMC | PolypSegTrack | F1 Score91.1 | | 20 | 1mo ago |
| Synthetic IoUT deployment (test) | HFL-NoCoop | Participation Rate100 | | 16 | 22d ago |
| Unseen Datasets Average | ViT | Accuracy77.12 | | 14 | 17d ago |
| KUMC (test) | FPRL | F1 Score89.8 | | 14 | 19d ago |
| AVG | Early Fusion | AUC0.897 | | 10 | 1mo ago |
| CocoGlide | | AUC0.778 | | 10 | 1mo ago |
| T-IC13 adapted (test) | DocShield | Accuracy91.2 | | 9 | 12d ago |
| T-SROIE dense text for robustness (test) | Gemini-2.5-Pro | Accuracy99.9 | | 8 | 12d ago |
| RealText proposed (test) | DocShield | Accuracy91.4 | | 8 | 12d ago |
| StethoBench (test) | StethoLM | BERTScore70.4 | | 8 | 1mo ago |
| StethoBench | StethoLM | ROUGE-143.4 | | 8 | 1mo ago |
| BEANS | BioMamba | dcase0.426 | | 7 | 1mo ago |
| OCHuman (test) | BBox-Mask-Pose | bbox AP35.9 | | 7 | 1mo ago |
| RIVA challenge (val) | Co-DINO-Swin | mAP60.9 | | 6 | 15d ago |
| Clinical Expert Evaluation set (N=200) | Teacher | Accuracy88.5 | | 6 | 1mo ago |
| Sperm whale coda dataset (test) | CLAP | Accuracy96.8 | | 6 | 1mo ago |
| Ethos race | GPT-J (DC) | Macro F151.4 | | 6 | 1mo ago |
| Hate speech18 | GPT-J (DC) | Macro F10.573 | | 6 | 1mo ago |
| TweetEval offensive | GPT-J (DC) | Macro F168.3 | | 6 | 1mo ago |
| TweetEval irony | GPT-3 (DC) | Macro F162.7 | | 6 | 1mo ago |
| TweetEval stance-feminist (test) | GPT-J (DC) | Macro F141.3 | | 6 | 1mo ago |
| VOC | Regression With Correction | Failure Rate0 | | 6 | 1mo ago |
| nuScenes | Regression With Correction | Failure Rate0 | | 6 | 1mo ago |