| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Information Retrieval | Robust04 | P@2046.67 | 72 | |
| Information Retrieval | robust | Recall@10046.5 | 19 | |
| Document Reranking | Robust04 Description | MAP0.4084 | 13 | |
| Relevance Assessment Label Alignment | Robust 2004 | Cohen's Kappa (κ)0.56 | 11 | |
| Document Retrieval | Robust TREC 2004 (test) | P@2051 | 10 | |
| Pseudotime estimation | Robust V2 (pooled donor-holdout) | Mean Difference-0.293 | 9 | |
| Document Retrieval | Robust04 EN | NDCG@1056.38 | 8 | |
| Information Retrieval | ROBUST04 (test) | AP@100027.47 | 8 | |
| Information Retrieval | Robust04 BEIR (test) | nDCG@100.567 | 7 | |
| Information Retrieval | Robust04 Title queries (test) | MAP29.04 | 7 | |
| Passage Reranking | Robust04 (test) | MAP0.2901 | 5 | |
| Stage classification | Robust cells V2 (test) | Balanced Accuracy56.7 | 4 | |
| Pseudotime inference | Robust cells V2 (test) | Spearman Correlation (Pseudotime-depth)0.249 | 2 | |
| CD4/CD8 identification | Robust cells V2 (test) | AUROC0.867 | 2 | |
| Branch classification | Robust V2 (test) | Balanced Acc82.8 | 2 | |
| Trustworthiness Evaluation | Robust (human evaluation) | Control Wins100 | 1 |