Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Forgery Detection on MedForge-90K Implant Forgery Cross-Model
Loading...
94.86
Accuracy
MedForge-Reasoner
49.1728
61.0339
72.895
84.7561
Mar 19, 2026
Accuracy
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
F1 Score
MedForge-Reasoner
Category=Generic MLLMs
2026.03
94.86
94.63
SIDA-13B
Category=Specialized D...
2026.03
85.86
83.17
AIGC-Holmes
Category=Specialized D...
2026.03
80.91
84.25
FakeVLM
Category=Specialized D...
2026.03
79.82
79.16
SIDA-7B
Category=Specialized D...
2026.03
76.57
69.85
Gemini3-Pro
Category=Generic MLLMs
2026.03
74.8
71.02
Gemini3-Flash
Category=Generic MLLMs
2026.03
60.14
71.11
Qwen3VL-Plus
Category=Generic MLLMs
2026.03
56.03
54.99
Qwen3VL-Flash
Category=Generic MLLMs
2026.03
50.93
52.69
Feedback
Search any
task
Search any
task