Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

KRIS-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Image EditingKRIS-Bench
Factual Knowledge Score81.67
74
Instruction-based Image EditingKRIS Bench 38 (test)
Factual Score79.8
27
Image EditingKris-Bench Natural Science
VC0.8275
8
Instruction-based Image EditingKRIS-Bench 1.0 (test)
Attribute Perception81.02
7
Showing 4 of 4 rows