Share your thoughts, 1 month free Claude Pro on usSee more

Part-level Contact MCQ on PARSE-10K (test)

86.2Accuracy

Ours

Updated 1mo ago

Evaluation Results

Method	Links
Ours 2026.03		86.2
Gemini-2.5-Pro 2026.03		75.6
GPT-5 2026.03		75.2
Claude-Opus-4 2026.03		73.2
Qwen3-VL 2026.03		60.4
Robobrain2.0 2026.03		37.2