Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ICBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Image Captioning EvaluationICBench Long Caption
Human Score76.57
40
Image CaptioningICBench short captions (test)
Fluency80.13
23
Vision-Language-Action instruction followingICBench Goal Suite (test)
Success Rate (SR)96.2
12
Vision-Language-Action instruction followingICBench Object Suite (test)
Success Rate (SR)94.2
12
Vision-Language-Action instruction followingICBench Spatial Suite (test)
Success Rate99.6
12
Intrinsic Concept ExtractionICBench D1
SIMT-T Score (Object)28
2
Showing 6 of 6 rows