Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

All Datasets

Benchmarks

Task NameDataset NameSOTA ResultTrend
Retrieval-Augmented GenerationAll Datasets Aggregated
Average Performance Score76.6
40
Generalized Category DiscoveryAll Datasets Avg
Overall Accuracy75.1
12
Lesion SegmentationAll Datasets
BBox Score0.777
6
Image GenerationAll Datasets
Fidelity54
4
Preference PredictionAll Datasets Total
Significant Features Count (S)43
2
Alpha-law validationAll datasets
Clean Accuracy31.3
1
Showing 6 of 6 rows