| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Natural Instructions task459_matres_static_classification | Logitext | Correctness69 | 3 | 4d ago | |
| Natural Instructions task457_matres_conditional_classification | Correctness87 | 3 | 4d ago | ||
| Natural Instructions task108_contextualabusedetection_classification | Correctness75 | 3 | 4d ago | ||
| Natural Instructions task022_cosmosqa_passage_inappropriate_binary | Logitext | Correctness80 | 3 | 4d ago | |
| Natural Instructions task021_mctaco_grammatical_logical | Logitext | Correctness0.5 | 3 | 4d ago | |
| Legalbench | Logitext | Warranty Duration (CUAD)61 | 3 | 4d ago |