Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OOD Evaluation Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
Generalization across real-world cross-view spatial reasoning benchmarksOOD Evaluation Suite Aggregate
Avg OOD Performance40
13
Showing 1 of 1 rows