| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LFQA | Pairwise Preference Accuracy77.24 | 13 | 1mo ago | ||
| EyePACS, DHCI, and TAD66k average | Average Human Annotation Count4,950 | 12 | 25d ago | ||
| WildVision Arena in-domain | Accuracy (w/ Tie)62 | 11 | 1mo ago | ||
| OSE | NPRM BERT | NDCG99.7 | 4 | 1mo ago | |
| NewsEla-En | NPRM BERT | NDCG0.999 | 4 | 1mo ago | |
| Seven Harm Categories | Insult Pairwise Score83.1 | 3 | 1mo ago | ||
| FLORES-200 | Accuracy100 | 3 | 1mo ago | ||
| Vikidia-En (test) | NPRM | NDCG0.991 | 2 | 1mo ago | |
| OSE (test) | NPRM | NDCG98.3 | 2 | 1mo ago |