Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Part-level Contact MCQ on PARSE-10K (test)
Loading...
86.2
Accuracy
Ours
35.24
48.47
61.7
74.93
Mar 8, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Ours
Evaluation protocol=Fi...
2026.03
86.2
Gemini-2.5-Pro
Evaluation protocol=Ze...
2026.03
75.6
GPT-5
Evaluation protocol=Ze...
2026.03
75.2
Claude-Opus-4
Evaluation protocol=Ze...
2026.03
73.2
Qwen3-VL
Evaluation protocol=Ze...
2026.03
60.4
Robobrain2.0
Evaluation protocol=Ze...
2026.03
37.2
Feedback
Search any
task
Search any
task