Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Document Understanding on EHR Dataset 4
Loading...
82
Macro F1
Gemini 3.0 Flash
1.92
22.71
43.5
64.29
Apr 6, 2026
Macro F1
Updated 10d ago
Evaluation Results
Method
Method
Links
Macro F1
Gemini 3.0 Flash
Model Type=Large Models
2026.04
82
Gemini 3.0 Pro
Model Type=Large Models
2026.04
81
MedGemma 1.5 4B
Model Type=Small Models
2026.04
64
Gemma 3 27B
Model Type=Small Models
2026.04
52
Gemma 3 4B
Model Type=Small Models
2026.04
41
MedGemma 1 4B
Model Type=Small Models
2026.04
25
MedGemma 1 27B
Model Type=Small Models
2026.04
5
Feedback
Search any
task
Search any
task