| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Out-of-distribution detection | OOD Detection Source LLM GPT-4o (test) | XSUM Score98.7 | 19 | |
| Out-of-distribution detection | OOD Detection Source LLM: Claude-3.5-Haiku (test) | XSUM0.977 | 19 | |
| Out-of-distribution detection | OOD Detection Source LLM: Gemini-2.5-Flash (test) | XSUM Score96 | 19 |