Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cell-Level Visual Question Answering on PBSBench (In-domain)

77True/False Accuracy

GPT-4o

37.4847.745868.26Apr 19, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
7790234891465
2026.04
73911438131863
2026.04
73964754364072
2026.04
708834161060
2026.04
67811627101462
2026.04
65781531141963
2026.04
63763174759
2026.04
63510152452
2026.04
613691691359
2026.04
59722544850
2026.04
59351117141756
2026.04
59526105856
2026.04
58467104754
2026.04
5429010111556
2026.04
512012191141
2026.04
4527067943
2026.04
392400125