Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Full Structure Coreference on BIOVISTA
Loading...
81.5
Recall
BIOMINER-INSTRUCT
54.148
61.249
68.35
75.451
Apr 23, 2026
Recall
Precision
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Recall
Precision
F1 Score
BIOMINER-INSTRUCT
2026.04
81.5
81.2
81.3
Gemini 2.0-flash
2026.04
78.6
78.3
78.4
Qwen3-VL-235B
2026.04
78.5
78.2
78.3
GPT-4.1-mini
2026.04
78.1
77.7
77.9
Qwen3-VL-32B
2026.04
76.6
76.2
76.4
Claude haiku-4-5
2026.04
72.2
71.9
72
Grok-4-fast
2026.04
55.9
55.8
55.8
GPT-4o-mini
2026.04
55.2
55
55.1
Feedback
Search any
task
Search any
task