Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OOD Detection

Benchmarks

Task NameDataset NameSOTA ResultTrend
Out-of-distribution detectionOOD Detection Source LLM GPT-4o (test)
XSUM Score98.7
19
Out-of-distribution detectionOOD Detection Source LLM: Claude-3.5-Haiku (test)
XSUM0.977
19
Out-of-distribution detectionOOD Detection Source LLM: Gemini-2.5-Flash (test)
XSUM Score96
19
Showing 3 of 3 rows