| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| LLM Routing | In-domain datasets Cost First, alpha=0.8 | Accuracy93 | 11 | |
| LLM Routing | In-domain datasets Balance, alpha=0.5 | Accuracy93 | 11 | |
| LLM Routing | In-domain datasets Performance First, alpha=0.2 | Accuracy93 | 11 | |
| Claim Verification | 9 In-Domain datasets (FEVER, ClaimDecomp, HoVer, FEVEROUS, WiCE, Ex-FEVER, PubHealth, PubMedClaim, FoolMeTwice) | FEVER Accuracy74.1 | 6 |